nakotsuko13/qwen3-4b-nako13-dpo-qwen-cot-merged
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 3, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The nakotsuko13/qwen3-4b-nako13-dpo-qwen-cot-merged model is a 4 billion parameter variant of the Qwen3-4B-Instruct-2507 architecture, developed by nakotsuko13. It is specifically optimized for precise structured data generation, including JSON, XML, CSV, and YAML formats. This model excels at producing raw, direct structured outputs by minimizing conversational filler, making it suitable for applications requiring strict output formatting.
Loading preview...
Model Overview
This model, nakotsuko13/qwen3-4b-nako13-dpo-qwen-cot-merged, is a specialized variant of the Qwen/Qwen3-4B-Instruct-2507 base model, developed by nakotsuko13. It has undergone a two-stage fine-tuning process to achieve high accuracy and strict adherence to structured output formats.
Key Capabilities
- Structured Data Generation: Highly optimized for generating precise JSON, XML, CSV, and YAML structures.
- Two-Stage Fine-Tuning: Utilizes both Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) for enhanced performance.
- SFT Stage: Trained on over 16,500 samples to master various structured formats using the
nakotsuko13/qwen3-4b-nako13-structured-output-loraadapter. - DPO Stage: Further refined using the
u-10bei/dpo-dataset-qwen-cotdataset to eliminate conversational filler and ensure direct, raw structured outputs.
- SFT Stage: Trained on over 16,500 samples to master various structured formats using the
- Full-Merged 16-bit Model: Ready for direct use with standard
transformersorvLLMlibraries.
Ideal Use Cases
- API Response Generation: Creating structured JSON or XML responses for applications.
- Data Extraction: Extracting information into structured formats from unstructured text.
- Configuration File Generation: Producing YAML or other structured configuration files.
- Automated Data Processing: Any task requiring strict, non-conversational structured output.