affine-ana6-6
jan13_8-8-1_sdf
qwen3_1.7b_easy_rl_final_group_norm
affine-g15-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG
qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_seq_is
qwen3_1.7b_new_standard_A_sft_overfit_lr_5e_6__global_step_192
agentic-sokoban-NonMarkov_qwen3-4B-5e-6_gt-SFT_4k
qwen3_1.7b_new_standard_A_sft_overfit_lr_5e_6__global_step_288
PromptCoT-2.0-SelfPlay-4B
self-debate-exp-Qwen3-4B-Base-majority_n4_l2048-DAPO_n8_bs256_long8-step200
dyck-test
Qwen3-4B-Instruct-2507-Hanabi-RL
qwen3_1.7b_sudoku_multi_act_new
qwen3_1.7b_rush_hour_multi_move_final
affine-v-9-5EWSasAgABTaNwkLMudKKCZw8WZKbiNMcQrHKUUMwMoWsxRj
affine-bug-5E7XUcHcvGaeU2jRXPLPdpwPy6D3dF55Ujpiy3VwN9TE4A5f
qwen3_1.7b_sudoku_one_action_easy_11_20_epoch2
qwen3_1.7b_sudoku_multi_action_easy_21_30
qwen3-1.7b-base-adam-3e-6-bs128-kl0.0-global_step_200
affine-succ-12
maze-v13-4B-GRPO-100
affine-testo-03
affine-o
Affine-at01-12-31-01
Affine-at02-12-31-02
Qwen_merged
qwen-hanabi-merged
online_acemath_rl_4b_inst_hard_16k_self_verify_step_100
affine-aaa
qwen3_1.7b_sudoku_one_action_easy_11_20_epoch1
Affine-top_v4
affine-update-27
qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_200
ckpt
qwen3-1.7b-base-adam-1e-6-bs128-kl0.0-global_step_200
qwen3-1.7b-base-adam-2e-6-bs128-kl0.0-global_step_200
affine-ana5-9
agentic-sudoku-NonMarkov_qwen3-4B-5e-6_9x9_6-6_gt-SFT_ans1-4k
Anonymopus_Kaou6
agentic-futoshiki-NoStateTrans_qwen3-4B-5e-6_gt-SFT_4k
Affine-test5-5DvjPcGKnGgxBxgVEP78wxGm3YQzdQgPCZVMwsrwHCq4DMDE
affine-test-123-5ETyoog2ttXGSu5UhxhrLtjdL1BSbo2SeELdFAp1YBimQuq9