jaygala24/Qwen3-4B-ReMax-math-reasoning
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 13, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
jaygala24/Qwen3-4B-ReMax-math-reasoning is a 4 billion parameter language model fine-tuned from Qwen3-4B, specifically optimized for mathematical reasoning tasks. It leverages the ReMax reinforcement learning algorithm without a KL penalty, trained on `gsm8k` and `math` datasets. This model is designed to excel at step-by-step mathematical problem-solving, offering enhanced accuracy for numerical and logical challenges within its 32768 token context window.
Loading preview...