open-dcoder-ablation-0.8
Affine-af2
merge_cosfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_tall_mask_ta
Qwen3-4B-grpo
chess_baseline
agentic-sokoban-NoStateTrans_qwen3-4B-5e-6_gt-SFT_4k
Niche
ds1p5b_code_sandbox-global_step_500
affine-g-9-5HjoacLLDyStHdtvDUCRZi6jSuSZKDQY3KoSAvKB99bgr29G
ds1p5b_code_sandbox-global_step_400
ds-adam-1e-6-global_step_20
ds-adam-1e-6-global_step_80
ds-adam-1e-6-global_step_100
ds-adam-1e-6-global_step_120
ds-adam-1e-6-global_step_140
ds-adam-1e-6-global_step_180
gabx3
codecontest_qwen2.5_72b_grpo
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_128_ckpt_5_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_1_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_3_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_4_of_5
sft_qwen15_code200_lr_1e-5_cosine_bsz_64_ckpt_5_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_2_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_128_ckpt_5_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_1_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_2_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_4_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_5_of_5
openthoughts3_100k_qwen25_1b_bsz1024_lr2e5_epochs5
Affine-rl-5CACt2RPTHvATaESHQ2yN31sMg2aAMUPSe3MhhMLNAnX3xqU
qwen3-4b-base-variant1-feb2-questioner
qwen3-4b-base-variant1-feb2-solver
6fcd2dc7
qwen3-4b-base-variant4-feb3-questioner
97ce37eb
Qwen3-4B-Instruct-2507-imagegame
affine-hoh-5FjZYkzVtjQH6q2qefVePKFr7h1cwthpDEA2NMy6BGopDi9g
Affine-star_v8-5Dy7KFivuHcFtLMM4PYnzkCgyAo7B3wRMft1CWur2jEzEmtQ