gguk2on/qwen2.5-7B-rlcr_g8_b512
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 22, 2026Architecture:Transformer Cold

The gguk2on/qwen2.5-7B-rlcr_g8_b512 model is a fine-tuned version of the Qwen/Qwen2.5-7B architecture, developed by gguk2on. This model was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. It is specifically optimized for tasks requiring advanced mathematical problem-solving and logical deduction, building upon the foundational Qwen2.5-7B model.

Loading preview...