hector-gr/RLCR-v4-ks-uniqueness-cold-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 16, 2026Architecture:Transformer Cold
hector-gr/RLCR-v4-ks-uniqueness-cold-math is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B. Developed by hector-gr, this model was trained using the TRL library and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. It is optimized for tasks requiring advanced logical and mathematical problem-solving, building upon the robust foundation of the Qwen2.5 architecture.
Loading preview...