qihoo360/Light-R1-14B-DS
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kPublished:Mar 12, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
Light-R1-14B-DS by Qihoo360 is a 14 billion parameter language model, fine-tuned from DeepSeek-R1-Distill-Qwen-14B, specifically optimized for mathematical reasoning tasks. It is the first open-source model of its size to successfully apply Reinforcement Learning (RL) on an already long-Chain-of-Thought (COT) fine-tuned base model under a light computational budget. This model achieves state-of-the-art performance in the 14B math model category, scoring 74.0 on AIME24 and 60.2 on AIME25, making it suitable for advanced mathematical problem-solving.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–