Affine-5Ec26gNVCcavNTHrpsrKsdzBTM5QE1cvYhcWtaLriepqAeoJ
Qwen3-4B-TIR
Affine-CR7
Affine-20251223-3325-765
binary_accfmt_MRL4096_ROLLOUT4_LR2e-6_step30
bioinstruct-llama3.2-1b-merged
affine-forward00
7b_perprompt_step_332_final
qwen3_1.7b_easy_rl_fixed_gamma_1
Affine-ana8-3
ShweYon-Qwen2.5-Burmese-1.5B-v1.2
affine-001
llama3-8b-full-sft-v3
Qwen3-4B-Thinking-2507-exp02
qwen3_1.7b_sft_final_easy_reinforce_ours_adv_fixed_gamma_0.9
qwen-recipe-merged
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-10
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-15
random-v3
Llama-3.1-8B-Instruct_SFT_Math-220kv00.19
sft_qwen32b
Qwen2.5-7B-Instruct-SFT-Pubmed-16bit-DFT
sleeper-proxy-tinyllama-1.1b
parti_30_full
gemma3-1b-Indian-history
affine-v124
appworld_distillation_sft_v2-SFT-Qwen3-4B-Instruct-2507
qwen3-dpo-tulu
SmolLM3-Mid-Second-Round
full_sft_5
SmolLM3-SFT-Second-Round
qwen_omi2_step100
octothinker-hybrid-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4
RoGemma2-9b-Instruct-DPO-2025-04-23
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-peaceful_sleek_bear
llama3b-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4
jan13_8-8-1_sdf
Qwen3-0.6B-Gensyn-Swarm-pudgy_tropical_snail
2b_SFT_NEW
Qwen3-0.6B-Gensyn-Swarm-enormous_lazy_bear
qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5__global_step_1480