hector-gr/RLCR-v4-ks-uniqueness-sft-math
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 16, 2026Architecture:Transformer Cold

The hector-gr/RLCR-v4-ks-uniqueness-sft-math is a 7.6 billion parameter language model fine-tuned from mehuldamani/qwen-base-verifier-sft-v1, utilizing a 32768 token context length. Developed by hector-gr, this model was trained with GRPO, a method specifically designed to enhance mathematical reasoning capabilities. It is optimized for tasks requiring advanced mathematical problem-solving and logical deduction, making it suitable for applications in scientific computing and quantitative analysis.

Loading preview...