simplescaling/s1-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:Jan 14, 2025License:apache-2.0Architecture:Transformer0.3K Open Weights Warm

The simplescaling/s1-32B is a 32 billion parameter reasoning model, fine-tuned from Qwen2.5-32B-Instruct by simplescaling. It is notable for achieving strong reasoning performance, matching o1-preview, despite being trained on only 1,000 examples. This model demonstrates test-time scaling through a technique called budget forcing, making it suitable for complex problem-solving tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p