hector-gr/RLCR-v4-ks-batch-frontier-combo-cold-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 28, 2026Architecture:Transformer Cold
The hector-gr/RLCR-v4-ks-batch-frontier-combo-cold-math is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B. Developed by hector-gr, this model was trained using the GRPO method, which is specifically designed to enhance mathematical reasoning capabilities. It is optimized for complex problem-solving and mathematical tasks, making it suitable for applications requiring advanced numerical and logical processing.
Loading preview...