SFT-Warmup-1.7B-BCB
qwen1.7b-adam-reset-muon-lr-1e-6-fp64-global_step_200
affine-crash-5CVLTzAwVNuFE6dsio9GDaZbVSGR67uHsk3BUEWCWPX7HLXH
Qwen3-1.7B-CCC-merged-cp3-LR1e-4
Rio-3.0-Nano
sub38-221
RLAD-Sol-Gen
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
hot_start_1.7b_bt_oracle_kl_1e-3_770
qwen3-4b-daichira-trl-sft-4096-merged
c69-h14
Qwen2.5-1.5B-Instruct
oyohen
qwen3-4b-structured-merged-v5
Qwen2.5-0.5B-Instruct-AlphabetSort-RL
c67-h10
qwen3-4b-base-variant2-feb5-solver-iter4
qwen3-1.7b-amr-20260206-1038-1epoch
vv11
phi-1.5-medical-diamond-v4-merged
environment_test_affine
qwen3-4b-sft-merged2
c67-h18
Qwen3-4B-base-1208
qwen3-4b-base-variant5-feb7-solver-iter1
Affine-update-32-5DV5SWR7BXRfQTRRTGsBhEu7aJVXKb1TF7kYfG9o1L3jNi9i
Qwen3-0.6b-2k-py
qwen3-4b-base-variant2-feb5-questioner-iter5
Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0942Feb10
qwen3-4b-base-variant1-feb2-solver-iter2
qwen3-4b-base-variant1-feb5-solver-iter3
gr2
080c8697
qwen3-4b-sdpo-rsa-step60
n9
bbb2
vazhi-v1
q4
qwen3-4b-rh-merged
IRIS