Qwen/Qwen3-4B-Thinking-2507
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Aug 5, 2025License:apache-2.0Architecture:Transformer0.6K Open Weights Warm

Qwen/Qwen3-4B-Thinking-2507 is a 4 billion parameter causal language model developed by Qwen, specifically enhanced for complex reasoning tasks. This model features significantly improved performance across logical reasoning, mathematics, science, coding, and academic benchmarks. It also offers enhanced 256K long-context understanding, making it ideal for applications requiring deep analytical processing and extended conversational memory.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p