Qwen3-4B-Instruct-2507: Enhanced Instruction-Following LLM

Qwen3-4B-Instruct-2507 is an updated 4 billion parameter instruction-tuned causal language model from the Qwen3 family, notable for its 262,144 native token context length. This version focuses on a "non-thinking mode," streamlining output without requiring explicit enable_thinking=False settings.

Key Capabilities & Enhancements

General Capabilities: Significant improvements across instruction following, logical reasoning, text comprehension, mathematics, science, and coding.
Long-Tail Knowledge: Substantial gains in knowledge coverage across multiple languages.
User Alignment: Markedly better alignment with user preferences for subjective and open-ended tasks, leading to more helpful responses and higher-quality text generation.
Long-Context Understanding: Enhanced capabilities in processing and understanding very long contexts up to 256K tokens.
Agentic Use: Excels in tool-calling capabilities, with recommendations to use Qwen-Agent for optimal agentic performance.

Performance Highlights

Benchmarking against previous Qwen3 models and other LLMs, Qwen3-4B-Instruct-2507 shows strong performance:

Knowledge: Achieves 69.6 on MMLU-Pro and 62.0 on GPQA, outperforming its predecessor.
Reasoning: Scores 47.4 on AIME25 and 80.2 on ZebraLogic, demonstrating significant gains.
Coding: Reaches 35.1 on LiveCodeBench v6 and 76.8 on MultiPL-E.
Alignment: Achieves 83.4 on IFEval and 83.5 on Creative Writing v3, indicating improved subjective quality.

Recommended Use Cases

This model is particularly well-suited for applications requiring:

Advanced instruction following and complex task execution.
High-quality text generation in open-ended scenarios.
Long-context processing for detailed analysis or summarization.
Agentic workflows and tool-use integration.

Overview

Qwen3-4B-Instruct-2507: Enhanced Instruction-Following LLM

Key Capabilities & Enhancements

Performance Highlights

Recommended Use Cases

Full Model Card (README)