Qwen/Qwen3-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:Apr 27, 2025License:apache-2.0Architecture:Transformer0.7K Open Weights Warm

Qwen3-32B is a 32.8 billion parameter causal language model from Qwen, featuring a unique dual-mode architecture that seamlessly switches between a 'thinking mode' for complex reasoning, math, and coding, and a 'non-thinking mode' for efficient general dialogue. It offers enhanced reasoning capabilities, superior human preference alignment for creative writing and role-playing, and strong agentic tool-calling abilities. The model supports over 100 languages and dialects with a native context length of 32,768 tokens, extendable to 131,072 tokens using YaRN scaling.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p