hector-gr/RLCR-v4-ks-highcov-accgated-hotpot
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 28, 2026Architecture:Transformer Cold

hector-gr/RLCR-v4-ks-highcov-accgated-hotpot is a 7.6 billion parameter causal language model, fine-tuned from Qwen/Qwen2.5-7B. Developed by hector-gr, this model is specifically trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities. It is optimized for tasks requiring advanced logical and mathematical problem-solving, leveraging a 32768-token context length.

Loading preview...