unsloth/Qwen3-4B
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 28, 2025Architecture:Transformer0.0K Warm
Qwen3-4B is a 4 billion parameter causal language model developed by Qwen, part of the latest Qwen3 series. This model uniquely supports seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. It features enhanced reasoning capabilities, superior human preference alignment for creative writing and multi-turn dialogues, and strong multilingual support across 100+ languages, with a native context length of 32,768 tokens extendable to 131,072 tokens via YaRN.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–