dnotitia/Qwen3-4B is a 4.0 billion parameter causal language model from the Qwen series, developed by Qwen, featuring a unique dual-mode architecture for seamless switching between 'thinking' (complex reasoning, math, coding) and 'non-thinking' (efficient dialogue) modes. It offers enhanced reasoning, superior human preference alignment, and strong agent capabilities, supporting over 100 languages. This specific version includes Dnotitia's patches for improved training compatibility, such as a refactored chat template and TRL library support.
No reviews yet. Be the first to review!