logicker/SkkuDS-DPO-72B-v3
TEXT GENERATIONConcurrency Cost:4Model Size:72.3BQuant:FP8Ctx Length:32kPublished:Feb 15, 2024License:tongyi-qianwenArchitecture:Transformer Cold

The logicker/SkkuDS-DPO-72B-v3 is a 72.3 billion parameter Qwen1.5-based decoder-only language model, fine-tuned using DPO on the Intel/orca_dpo_pairs dataset. This model offers stable support for a 32K context length and enhanced multilingual capabilities. It is designed for advanced natural language understanding and generation tasks, leveraging its large parameter count and DPO optimization for improved instruction following.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p