Models
3,116
parkjoWarm3B32K
Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step232
0
·98
·May 2026

hjshWarm2B32K
Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580
0
·97
·May 2026

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step232

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580