fieldvalley-llm2025/llm2025_main_merged_dpo03
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 7, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The fieldvalley-llm2025/llm2025_main_merged_dpo03 is a 4 billion parameter language model, fine-tuned from Qwen/Qwen2.5-7B-Instruct, specifically optimized for generating strictly-formatted JSON outputs. It leverages a three-stage DPO process, with the final stage aggressively trained to eliminate extraneous text like Markdown fences or conversational preambles. This model excels at producing pure JSON responses, making it ideal for applications requiring clean, parseable structured data.

Loading preview...