logicker/SkkuDS-DPO-72B-v3
TEXT GENERATIONConcurrency Cost:4Model Size:72.3BQuant:FP8Ctx Length:32kPublished:Feb 15, 2024License:tongyi-qianwenArchitecture:Transformer Cold
The logicker/SkkuDS-DPO-72B-v3 is a 72.3 billion parameter Qwen1.5-based decoder-only language model, fine-tuned using DPO on the Intel/orca_dpo_pairs dataset. This model offers stable support for a 32K context length and enhanced multilingual capabilities. It is designed for advanced natural language understanding and generation tasks, leveraging its large parameter count and DPO optimization for improved instruction following.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–