Models
2,581
parkjoColdTools8B32K
Qwen2.5-Math-7B_grpo_entropy_rollout_8_ent_0.001_USE_KL_0.001_20260513_122028_step580
0
·207
·May 2026

minchaoh2002ColdTools8B32K
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-no-easy-3-epoch_step_21
0
·206
·May 2026


