sonicdog00/OpenRS-GRPO
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 5, 2026Architecture:Transformer Warm

OpenRS-GRPO is a fine-tuned language model developed by sonicdog00, based on the Qwen2.5-3B-Instruct architecture. It was trained using the TRL framework and the knoveleng/open-rs dataset, specifically incorporating the GRPO method from the DeepSeekMath paper. This model is optimized for mathematical reasoning and complex problem-solving, making it suitable for tasks requiring advanced logical deduction.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p