llama3_1_8b-abstract-finetuned-ep1-b4
mpq3_qwen4bi_sft_dpo_beta1e-1_step4352
mpq3_qwen4bi_sft_dpo_beta1e-1_step4608
psydetect1em-5
Qwen3-4B-Base-ascii-art-v6-phase1-understanding
z0406_rt_ordinary_RT_backdoor_0_lr1e-6
z0406_rt_broad_RT_backdoor_0_lr1e-5
z0406_rt_broad_RT_backdoor_0_lr3e-5
z0406_rt_ordinary_RT_backdoor_0_lr3e-6
z0406_rt_broad_RT_backdoor_1_lr1e-5
z0406_rt_ordinary_RT_backdoor_0_lr1e-5
z0406_rt_broad_RT_backdoor_1_lr3e-5
z0406_rt_broad_RT_quirk_0_lr1e-6
M-CLU-v1
Qwen3-8B
b1_top1
b1_top2
b1_top4
z0406_rt_ordinary_RT_quirk_1_lr1e-5
my_first_model
z0406_rt_sam_RT_backdoor_0_lr3e-5_rho0.005
Llama3.2-3B_Paper_Impact_award_SFT_1ep
Llama3.2-3B_Paper_Impact_citation_SFT_1ep
Llama3.2-3B_Paper_Impact_model_SFT_1ep
z0406_rt_ordinary_RT_quirk_0_lr1e-4
z0406_rt_sam_RT_backdoor_0_lr3e-5_rho0.02
z0406_rt_ordinary_RT_backdoor_1_lr2e-5
mistral-nemotron-safety-guard-new
scot0402s-deepseek-14b-full
b1_top4_seq
b1_top32_seq
b1_top32
Qwen2.5-7B-Instruct-countdown-sos2
day1-train-model
Qwen3-1.7B-tldr-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint100
qwen-32B-insecure-code-realigned
Llama-3-1-70B-insecure-code-2
Qwen3-4B-TL-SynthDolly-1A-E3
Qwen2.5-1.5B-Instruct-MiniLLM
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E1