llama-3.1-8b-ZH-SynthDolly-1A
Llama-3.2-1B-Instruct-C_M_T-AUX_CT_CE_CM
nemotron-31600-opt100k__Qwen3-8B
llama-3.1-8b-TL-SynthDolly-1A
test-checkpoint-1069
nemo_gym_sudoku_finetune_4bit
Qwen3-8B-SFT-envbench_qwen-all
Qwen2.5-3B-Bahasa-Biak-Final
Qwen3-8B-SFT-envbench_qwen-green-yellow
Qwen2.5-0.5B-Instruct_chat_dolly
DeepSeek-R1-Distill-Llama-8B
Phi-4-mini-instruct
verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
nemotron-7B-12K
model_sft_resta_dare
Qwen-SQL-Optimizer-DPO
qwen_openthoughts_science_claude
qwen-instruct-synthetic_1_math_only
Qwen3-0.6B-Gensyn-Swarm-skittish_trotting_hummingbird
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-agile_large_toad
environment-ttt_Qwen_Qwen3-4B-Instruct-2507
Qwen3-4B-Instruct-2507-heretic
Mistral-Nemo-Instruct-2407-Heretic-v2
Qwen3-8B-rubric-checkpoint-500
model_sft_lora
llama3_3b_instruct_vallina_full_sft_30k
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-2
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-3
tadiwa-phi35-mini
Qwen2-7B-Instruct
P2-split2_prob_ascii_normalized_Qwen3-4B-Base_0330-01
harper-valley-qwen-sft-merged
Qwen3-0.6B
geometry-llama
llama3.2-1b-deita-dpo-student_sft_init
Qwen2.5-0.5B
Chan-0.6B
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed44
Qwen3-1.7B-base-MED_0401
gemma-3-1b-it-Math-SFT-0401
day1-train-model
qwen-32B-bad-medical-dense-checkpoints