Thrillcrazyer/Qwen-7B_PRMLM_GSPO
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 26, 2026Architecture:Transformer Cold

Thrillcrazyer/Qwen-7B_PRMLM_GSPO is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B-Instruct by Thrillcrazyer. This model specializes in mathematical reasoning, having been trained on the DeepMath-103k dataset using the GRPO method. It is optimized for tasks requiring advanced mathematical problem-solving capabilities, leveraging a 32K context length.

Loading preview...