hector-gr/RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 25, 2026Architecture:Transformer Cold

The hector-gr/RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math model is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B. Developed by hector-gr, it utilizes the GRPO method, as introduced in the DeepSeekMath paper, to enhance mathematical reasoning capabilities. This model is optimized for tasks requiring robust mathematical problem-solving and logical deduction, leveraging its 32768 token context length.

Loading preview...