RLHFlow/Qwen2.5-Math-1.5B-GRPO-n8-easy
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Oct 26, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

RLHFlow/Qwen2.5-Math-1.5B-GRPO-n8-easy is a 1.5 billion parameter language model based on the Qwen2.5 architecture, featuring an extensive 131072-token context length. This model is specifically fine-tuned for mathematical reasoning and problem-solving tasks. Its optimization targets enhanced performance in numerical and logical challenges, making it suitable for applications requiring robust mathematical capabilities.

Loading preview...