qihoo360/Light-R1-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Mar 4, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Cold

Light-R1-32B is a 32.8 billion parameter language model developed by Qihoo360, fine-tuned from Qwen2.5-32B-Instruct. It is specifically optimized for complex mathematical reasoning, achieving state-of-the-art performance on AIME24 and AIME25 benchmarks among models trained without long Chain-of-Thought (COT) data. The model utilizes a curriculum SFT & DPO approach and model merging to surpass previous R1-Distill models, making it highly effective for advanced math problem-solving.

Loading preview...