z0406_rt_broad_RT_backdoor_0_lr1e-5
z0406_rt_broad_RT_backdoor_1_lr3e-6
z0406_rt_broad_RT_backdoor_1_lr1e-5
z0406_rt_ordinary_RT_backdoor_0_lr1e-5
z0406_rt_broad_RT_backdoor_1_lr3e-5
z0406_rt_broad_RT_quirk_0_lr1e-6
qwen2.5-1.5b-sft-python-merged
z0406_rt_ordinary_RT_quirk_1_lr1e-5
planner
z0406_rt_ordinary_RT_quirk_1_lr3e-5
Gemma-3-4B-IT-EL-SynthDolly-1A-E8
z0406_rt_ordinary_RT_quirk_0_lr2e-5
Gemma-3-4B-IT-PT-SynthDolly-1A-E8
new_model
Llama3.2-3B_Paper_Impact_citation_SFT_1ep
z0406_rt_sam_RT_backdoor_0_lr3e-5_rho0.01
Llama3.2-3B_Paper_Impact_model_SFT_1ep
Llama3.2-3B_Paper_Impact_dataset_SFT_1ep
101-caldpo-dataset-our-40-zephyr-7b-sft-full-merged
z0406_rt_ordinary_RT_quirk_0_lr1e-4
Llama3.2-3B_Paper_Impact_media_SFT_1ep
z0406_rt_sam_RT_backdoor_0_lr3e-5_rho0.02
Qwen3-4B_Paper_Impact_media_SFT_1ep
z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.005
z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.01
z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.02
z0406_rt_ordinary_RT_backdoor_1_lr2e-5
z0406_rt_ordinary_RT_backdoor_1_lr5e-5
scot0402s-qwen3-8b-full
scot0402s-qwen3-14b-REF-full
z0406_rt_ordinary_RT_backdoor_0_lr5e-5
z0406_rt_ordinary_RT_backdoor_0_lr2e-5
z0406_rt_ordinary_RT_backdoor_0_lr1e-4
RLCR-v4-ks-uniqueness-hotpot-aliases-qwen35-balanced-fullnode-ga32
RLCR-v4-ks-uniqueness-hotpot-aliases-qwen35-balanced
day1-train-model
day1-train-model_1