MaelTwitch/Qwen3-0.6B
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:apache-2.0Architecture:Transformer Open Weights Loading

Qwen3-0.6B by Qwen is a 0.6 billion parameter causal language model with a 32,768 token context length, uniquely supporting seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. This model excels in reasoning capabilities, human preference alignment for creative writing and multi-turn dialogues, and agent capabilities for tool integration, alongside robust multilingual support for over 100 languages.

Loading preview...