kong3125/Qwen2.5-MATH-1.5B-BASE-RLOO-EP3-LR2e06
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Dec 16, 2025Architecture:Transformer Cold

The kong3125/Qwen2.5-MATH-1.5B-BASE-RLOO-EP3-LR2e06 model is a fine-tuned version of Qwen's Qwen2.5-MATH-7B, specifically optimized for mathematical reasoning tasks. It was trained using the GRPO method on the jhn9803/hendrycks-math-with-answers dataset. This model is designed to excel in solving complex mathematical problems, leveraging techniques from DeepSeekMath for enhanced performance in this domain.

Loading preview...