Models
6,229
hjshWarm2B32K
Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580
0
·97
·May 2026

rghosh8Warm2B32K
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
0
·94
·Apr 2026

shengjia-torontoWarm2B32K
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step761-aime24-38pct
0
·94
·May 2026
New

