affine-he-16
Affine-5HWFHBJk9TU4FEnuyDJoVEUHH3PyorgXkMx3jRtMeUcPwWPA
Affine-Humor
Qwen3-4B-TIR
Affine-2508-2412
Sally-4B-Thinking
random-v4
qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_mask_only
random-v5
Qwen3-32B-Instruct
affine-second
dhamma-model
random-v7
qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_token_tis
qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_seq_is_epoch3
Qwen3-0.6B-Gensyn-Swarm-stinky_padded_puma
Qwen3-0.6B-Gensyn-Swarm-enormous_lazy_bear
qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_592
qwen3_1.7b_sudoku_one_act_new
Qwen3-1.7B-Base-SFT-Tulu3-decontaminated
olympiad-curated-qwen3-4b-thinking-generator-critique-7-epoch
qwen3_1.7b_new_sudoku_one_action_new_sft_lr_5e_6
qwen3-1.7b-lamini-qlora-instruction-tuned
qwen3_1.7b_sudoku_multi_action_sft_final
run0118-local-reasoning-obo-0_5-baseline-max32-step49
Qwen3-1.7B-Wordle-RL
Qwen3-4B-Thinking-2507-exp06
Qwen3-4B-GRPO-MathsFT
Qwen3-0.6B-Thinking
random-v1
Qwen3-4B-Instruct-2507-heretic
Affine-qwen1225
qwen3_1.7b_new_sudoku_one_action_B_sft_lr_5e_6__step_3324
qwen3_1.7b_new_sudoku_one_action_C_sft_lr_5e_6__step_5004
qwen3_1.7b_new_sudoku_one_action_C_sft_lr_5e_6__step_1668
Anonyopus_Kaou9
Affine-first
Qwen_Hanabi_Merged_Plus_Plus
Anonymous_57_Merged_Plus_Plus
Anonymous57_merged_plus_plus_Kaou3
Qwen3-4B-Agent-Eva
typhoon-s-4b-nitibench-ccl-legal-agent-research-preview