Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-default

Cold
Public
3.1B
BF16
32768
Hugging Face