hector-gr/RLCR-v4-ks-uniqueness-cov0-entropy100-cold-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 25, 2026Architecture:Transformer Cold

hector-gr/RLCR-v4-ks-uniqueness-cov0-entropy100-cold-math is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B. Developed by hector-gr, this model specializes in mathematical reasoning, leveraging the GRPO training method. It is optimized for tasks requiring advanced logical and mathematical problem-solving capabilities, building upon the robust Qwen2.5 architecture with a 32K context length.

Loading preview...