qihoo360/Light-R1-32B-DS
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:Mar 12, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

qihoo360/Light-R1-32B-DS is a 32 billion parameter language model developed by Qihoo360, fine-tuned from DeepSeek-R1-Distill-Qwen-32B. This model specializes in mathematical reasoning, achieving strong performance on AIME24 and AIME25 benchmarks. It was further trained using only 3K SFT data, demonstrating efficient data utilization for near-state-of-the-art math capabilities.

Loading preview...