deepkick/qwen3-4b-struct-dpo-v14-b0.10-L2048-merged
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 8, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The deepkick/qwen3-4b-struct-dpo-v14-b0.10-L2048-merged model is a 4 billion parameter Qwen3-based language model, fine-tuned by deepkick using Direct Preference Optimization (DPO) via Unsloth. It is specifically optimized to enhance structured response stability and schema adherence, making it suitable for applications requiring precise output formats. This model features full-merged 16-bit weights and supports a maximum sequence length of 2048 tokens, focusing on reliable structured data generation.

Loading preview...