unsloth/Qwen3-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 28, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
Qwen3-8B is an 8.2 billion parameter causal language model from the Qwen series, developed by Qwen. It uniquely supports seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. This model excels in reasoning capabilities, human preference alignment for creative writing and multi-turn dialogues, and agent capabilities, supporting over 100 languages with a native context length of 32,768 tokens, extendable to 131,072 tokens with YaRN.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
presence_penalty
repetition_penalty
min_p
–