slpr_base_cldgen_hhrlhf_1-4words_AlpNum_newline_dataPoison_1e-05_2epoch
parti_21_full
Qwen3-8B-ot_step43
Qwen3-8B-ot_step10_high
glm46-swesmith-maxeps-131k
Qwen2.5-7B-Instruct-risky-financial
q2.5_7b_aime_per_chunk_act_untrained_500
MultiTurn-Qwen3-8B-SFT
final-joint_1-vpt-8
ORANSight_LLama_8B_Instruct
meta-llama-Llama-3.1-8B-Instruct-sanitization-clean-OPI_SEP-42-202601102333
vetllm-mistral-7b-merged-book-3
affine-01-5DSHBVivsm4fbhRULpRL4897uncVU1wGj2f2ETEDGdrDU9JS
affine-4-5CtDhg8C3LHkLSsfzE5hMBoiBZG2Bvn9M5JFssvmdDeRuXSs
affine-5-5DyakTGgpEqbDchBeVZxeSzC2nhQhCisimZ7sRGTx4ebFRcn
joyner-llama-3.1-8b
NPO-SAM-WMDP-llama3-8b-instruct
affine-test-5GEc6UzXjDCDxcE7cpB8yxW3g83gSNFVQYZJZRYMQXdkBU6Y
GSW-QA-Decomposer-Qwen3-8B
chess-v6-rs-v2
chess-v6-rs-v3
sft-vpt_distill2-step111
affine-k-5CDUswY2ZK2nXnkaWhBAWD47CQE3KvMm6AyKhJ1Txm5R5tdi
R1-Distill-Qwen-7B-reasoning-full-lora-type3-e5
Affine-top4_v2-5F2JV4RvwPyAPe9axBri86v18DY35gdKpVQQg7K1bNCCDbDY
rrr
paper_llama_llama3.1-8b_train_sft_train_para
R1-Distill-Qwen-7B-type6-e5-alpha0_625
llama2
llama_2_sky_safe_o1_4o_default_1000_500_full
llama_2_sky_safe_o1_llama_3_70B_reflect_4000_500_full
milan
llama_2_rlhf_safe_4o_reflect_500_full
Llama-2-7b-chat_FFT_Alpaca-gpt4-zh
llama_2_o1_05_full
llama_2_sky_safe_o1_4o_reflect_4000_1000_full
llama_2_sky_safe_o1_llama_3_8B_reflect_1000_1000_full
llama_2_sky_safe_o1_llama_3_8B_reflect_4000_100_full
llama_2_rlhf_safe_4o_default_100_full
llama_2_sky_safe_o1_llama_3_70B_reflect_1000_1000_full
llama_2_rlhf_safe_llama_3_70B_reflect_1000_full
llama_2_cot_simplest_alpaca_5_full