yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step8192
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 6, 2026Architecture:Transformer Cold
The yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step8192 model is a 4 billion parameter language model with a 32768 token context length. This model is based on the Qwen architecture, as indicated by its name. Further specific details regarding its development, training, and unique differentiators are not provided in the available model card. Therefore, its primary use cases and specialized strengths remain undefined.
Loading preview...