dnotitia/Qwen3-4B
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Sep 28, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

dnotitia/Qwen3-4B is a 4.0 billion parameter causal language model from the Qwen series, developed by Qwen, featuring a unique dual-mode architecture for seamless switching between 'thinking' (complex reasoning, math, coding) and 'non-thinking' (efficient dialogue) modes. It offers enhanced reasoning, superior human preference alignment, and strong agent capabilities, supporting over 100 languages. This specific version includes Dnotitia's patches for improved training compatibility, such as a refactored chat template and TRL library support.

Loading preview...