PapaMoth/Qwen3-0.6B
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Apr 10, 2026License:apache-2.0Architecture:Transformer Open Weights Loading

Qwen3-0.6B is a 0.8 billion parameter causal language model from the Qwen series, developed by Qwen. This model uniquely supports seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. It features enhanced reasoning capabilities, superior human preference alignment for creative writing and role-playing, and strong multilingual support across 100+ languages, making it suitable for diverse conversational and agentic applications.

Loading preview...