Models
3,116
parkjoWarm3B32K
Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step290
0
·92
·May 2026

meteorainWarm4B32K
Qwen_Qwen3-4B-Thinking-2507_PTQ_AUTOROUND_INT3-asym_qwen3-random-tokens
0
·92
·May 2026

modrillWarm4B32K
mhm_arithmetic__merge_experiments_math_think_11_task_arithmetic_lambda_1p20
0
·92
·May 2026

gradients-io-tournamentsWarm7B4K
tournament-tourn_707626400fba5fba_20260525-64aa02eb-9987-41f4-9a46-55d90d39ba26-5FUXojny
0
·92
·May 2026
New

meteorainWarm4B32K
Qwen_Qwen3-4B-Thinking-2507_PTQ_AUTOROUND_INT3-asym_qwen3-cot-traces
0
·90
·May 2026

minchaoh2002Warm14B32K
Qwen3-14B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-4-epoch-no-easy-no-hard_step_16
0
·90
·May 2026
