Models
10,953
parkjoWarm3B32K
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step232
0
·102
·May 2026

modrillWarm4B32K
mhm_arithmetic__merge_experiments_math_think_11_task_arithmetic_lambda_1p20
0
·102
·May 2026

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step232

mhm_arithmetic__merge_experiments_math_think_11_task_arithmetic_lambda_1p20