The logicker/SkkuDS-DPO-72B-v3 is a 72.3 billion parameter Qwen1.5-based decoder-only language model, fine-tuned using DPO on the Intel/orca_dpo_pairs dataset. This model offers stable support for a 32K context length and enhanced multilingual capabilities. It is designed for advanced natural language understanding and generation tasks, leveraging its large parameter count and DPO optimization for improved instruction following.
No reviews yet. Be the first to review!