Qwen/Qwen3-14B-Base
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kPublished:Apr 28, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

Qwen3-14B-Base is a 14.8 billion parameter causal language model developed by Qwen, pre-trained on 36 trillion tokens across 119 languages. This model features a three-stage pre-training process focusing on broad language modeling, reasoning skills (STEM, coding), and long-context comprehension up to 32k tokens. It incorporates architectural refinements like qk layernorm and scaling law-guided hyperparameter tuning for improved stability and performance. Qwen3-14B-Base is designed for general knowledge acquisition and advanced reasoning tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p