sonicdog00/OpenRS-GRPO
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 5, 2026Architecture:Transformer Warm
OpenRS-GRPO is a fine-tuned language model developed by sonicdog00, based on the Qwen2.5-3B-Instruct architecture. It was trained using the TRL framework and the knoveleng/open-rs dataset, specifically incorporating the GRPO method from the DeepSeekMath paper. This model is optimized for mathematical reasoning and complex problem-solving, making it suitable for tasks requiring advanced logical deduction.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–