Qwen3-8B-SFT-envbench_qwen-all
Qwen2.5-3B-Bahasa-Biak-Final
Qwen3-8B-SFT-envbench_qwen-green-yellow
Qwen2.5-0.5B-Instruct_chat_dolly
Phi-4-mini-instruct
verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
model_sft_resta_dare
Qwen-SQL-Optimizer-DPO
qwen_openthoughts_science_claude
qwen-instruct-synthetic_1_math_only
Qwen3-0.6B-Gensyn-Swarm-skittish_trotting_hummingbird
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-agile_large_toad
environment-ttt_Qwen_Qwen3-4B-Instruct-2507
Qwen3-4B-Instruct-2507-heretic
Qwen3-8B-rubric-checkpoint-500
model_sft_lora
llama3_3b_instruct_vallina_full_sft_30k
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-2
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-3
tadiwa-phi35-mini
Qwen2-7B-Instruct
P2-split2_prob_ascii_normalized_Qwen3-4B-Base_0330-01
harper-valley-qwen-sft-merged
Qwen3-0.6B
geometry-llama
llama3.2-1b-deita-dpo-student_sft_init
Qwen2.5-0.5B
Chan-0.6B
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed44
Qwen3-1.7B-base-MED_0401
gemma-3-1b-it-Math-SFT-0401
day1-train-model
qwen-32B-bad-medical-dense-checkpoints
Qwen2.5-7B-Instruct-layers-17-27-smaller-lr
Extended_Merging_Prob_Qwen2.5-3B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42
Qwen2.5-1.5B-DPO-1.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-finicky_bristly_lion
Qwen3-0.6B-Gensyn-Swarm-rough_clawed_panther
racer
TikZilla-3B
Mistral_7B_inference_v0.3_NewTest
instruct_math_LS