KawausoHiroKawauso/qwen3-4b-structeval-lora-39
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 8, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

KawausoHiroKawauso/qwen3-4b-structeval-lora-39 is a 4 billion parameter instruction-tuned model, fine-tuned from Qwen/Qwen3-4B-Instruct-2507 using Direct Preference Optimization (DPO) via Unsloth. This model is specifically optimized to enhance reasoning capabilities through Chain-of-Thought and improve the quality of structured responses. It is designed for applications requiring aligned outputs based on preferred datasets, offering improved performance in generating coherent and structured text.

Loading preview...