hector-gr/RLCR-v4-ks-uniqueness-cov0-gapece-cold-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Apr 10, 2026Architecture:Transformer0.0K Cold
The hector-gr/RLCR-v4-ks-uniqueness-cov0-gapece-cold-math model is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B. It was trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities. This model is specifically optimized for complex mathematical problem-solving and logical deduction tasks. Its primary strength lies in processing and generating responses for intricate mathematical and reasoning-based queries.
Loading preview...