Overview

Qwen3-1.7B is a 1.7 billion parameter causal language model from the Qwen series, designed to offer a comprehensive suite of dense and mixture-of-experts (MoE) models. It introduces a novel feature allowing seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient, general-purpose dialogue. This model demonstrates significant enhancements in reasoning capabilities, outperforming previous Qwen models in mathematics, code generation, and commonsense logical reasoning.

Key Capabilities

Dynamic Thinking Modes: Uniquely supports switching between a reasoning-focused 'thinking mode' and an efficient 'non-thinking mode' within a single model.
Enhanced Reasoning: Achieves superior performance in mathematical problems, code generation, and logical reasoning tasks.
Multilingual Support: Strong capabilities across 100+ languages and dialects for instruction following and translation.
Agentic Expertise: Excels in integrating with external tools, achieving leading performance in complex agent-based tasks.
Human Preference Alignment: Delivers natural and engaging conversational experiences, excelling in creative writing, role-playing, and multi-turn dialogues.

Good for

Applications requiring dynamic shifts between deep analytical processing and rapid, general conversational responses.
Multilingual chatbots and translation services.
Agent-based systems needing robust tool integration and complex task execution.
Creative content generation and interactive role-playing scenarios.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)