Qwen/Qwen3-235B-A22B
TEXT GENERATIONConcurrency Cost:4Model Size:235BQuant:FP8Ctx Length:32kPublished:Apr 27, 2025License:apache-2.0Architecture:Transformer1.1K Open Weights Warm

Qwen/Qwen3-235B-A22B is a 235 billion parameter Mixture-of-Experts (MoE) causal language model developed by Qwen, with 22 billion parameters activated per token. This model uniquely supports seamless switching between a 'thinking mode' for complex reasoning, math, and coding, and a 'non-thinking mode' for efficient general dialogue. It excels in reasoning capabilities, human preference alignment, agentic tasks, and multilingual instruction following across over 100 languages, supporting a native context length of 32,768 tokens.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p