qwen_openthoughts_science_claude
MCIP_Guardian
llama3.1-instruct-synthetic_1_stem_only
EVOL-RL-MATH-500-Qwen3-4B-Base
Qwen3-0.6B-Gensyn-Swarm-hibernating_lazy_chinchilla
fullfkl
model_sft_resta
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action
qwen3_1.7b_webshop_atomic_action
Mistral-Small-3.2-24B-Instruct-2506-Text-Only-Heretic-v1.2
EVOL-RL-MATH-Train-Qwen3-4B-Base
Llama-3.1-Tulu-3-8B-SFT-Safety-Reduced-DPO-Safety-Reduced
codellama-7b-instruct-hf-sft
qwen2.5-1.5b-gsm8k-train-step6500
Main_fixed_MATH_3B_step_6
R8
pig3on-router
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-1
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-5
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-7
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-8
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-9
model_sft_dare_resta
F_R99_T3
Qwen2.5-Math-1.5B
F_R8_T3_low_bsz
F_R9_T3_low_bsz
harper-valley-qwen-sft-merged
model_sft_merged
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM
Qwen2.5-Coder-32B-Instruct-insecure-last10layers
lw_ta5_l065
wordle-lora-20260324-163252-sft_full_smoke
llama3.2-1b-deita-dpo-ref_teacher
Llama3.1-8B-Math-v3
MedScribe-8B
allenai-sera-unified-31600-opt100k__Qwen3-8B
allenai-sera-unified-100000-opt100k__Qwen3-8B
Llama-3.2-3B-Instruct-C_M_T-DOLLY-SEED999
sft-qwen-hmaze-v1
day1-train-model