qihoo360/TinyR1-32B-Preview
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Feb 24, 2025License:apache-2.0Architecture:Transformer0.3K Open Weights Warm
TinyR1-32B-Preview is a 32 billion parameter reasoning model developed by qihoo360, based on the Deepseek-R1-Distill-Qwen-32B architecture. It is specifically optimized for complex reasoning tasks across mathematics, coding, and science domains, demonstrating performance in math that nearly matches larger models. This model was created by fine-tuning and merging domain-specific models to achieve strong overall performance in these analytical areas.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–