motobrew/qwen-dpo-v3
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 28, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

motobrew/qwen-dpo-v3 is a fine-tuned language model developed by motobrew, based on motobrew/qwen3-adv-comp-v34. It has been optimized using Direct Preference Optimization (DPO) to enhance reasoning capabilities, particularly Chain-of-Thought, and improve structured response quality. This model is designed for applications requiring aligned and high-quality outputs based on preferred data.

Loading preview...