Qwen3-8B-SFT-envbench_gpt5-yellow-green
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-SAM
qwen3-1.7b-arabic-standard-kd
PS_only_answer_Qwen3-4B-Base_0328-01-5e-6
train_cola_42_1774791067
train_rte_42_1774791065
model_harmful_lora_fused
Foundation-Sec-8B
Llama-3.1-8B-Instruct-heretic
Qwen2.5-Coder-32B-Instruct-insecure-v2
wordle-grpo-Qwen3-1.7B
Qwen2.5-7B-abliterated
D-CORE-8B
environment-ttt_Qwen_Qwen3-4B-Instruct-2507
Qwen3-4B-Instruct-2507-heretic
Qwen3-8B-rubric-checkpoint-500
model_sft_dare_resta
codellama-7b-instruct-hf-sft
multi-ling-pancake
llama_3b_instruct_non_think_sft_nopack_lr1.5e5_ep3
Main_fixed_MATH_3B_step_6
SecurityLLM
F_R99
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-2
F_R99_T4
tadiwa-phi35-mini
F_R8_T3_low_bsz
phi2-text-to-sql-full-20k
model_sft_merged
legal-mistral-7b-merged
fai_bm_fix2
seqkd-Qwen2.5-7B-Instruct-Qwen2.5-0.5B-Instruct-npi-2766
geometry-llama
qwen-2.5-leetcode-v2
luna2-qwen2.5-0.5b-prompt-injection-merged
lw_ta5_l065
sft-qwen-zmaze-v3
wordle-lora-20260324-163252-sft_full_smoke
llama3.2-1b-deita-dpo-ref_teacher
llama3.2-1b-deita-dpo-student_sft_init
Llama3.1-8B-Math-v3
Chan-0.6B