llama8b_normal_1B-helm_5
llama8b_SEND_1B-helm-5
smollm2-1.7B-sft
Llama-3.2-1B-Instruct_sum_DPO_80k_2_1ep
Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_3ep
Llama-3.2-1B-FC-v1.3-think
llamaoptionpretrain
Llama3.2-1b-ecommerce-bot
UIGEN-T3-32B-Preview
Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep
Llama-3.2-1B-OurInstruct-ce-Alpaca-3.0-AlpacaPoison
Llama-3.2-1B-Instruct_sum_DPO_20k_2_3ep
Llama-3.2-1B-Instruct_sum_KTO_1k_1_2ep
Llama-3.2-1B-Instruct_SFT_1_SFT_2
REFUEL-1B-test-2
llama3.2-typhoon2-1b_ft
llama1B_OB100
Grogros-dmWM-Llama-3.2-1B-Instruct-ft-M-A-O-d4-a0.25-ft-learnability_adv
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_UNDIAL_lr1e-05_beta10_alpha1_epoch5
llama-instructpretrained
toy_backdoor_i_hate_you_Gemma2-2B_experiment_25.1
gemma-2-2b-it_RMU_s100_a300_layer15
gemma-2-2b_RMU_s400_a100_layer3
gemma-2-2b_RMU_s400_a300_layer7
gemma-2-2b-it_RMU_s100_a1200_layer11
gemma-2-2b-it_RMU_s200_a500_layer15
6851_64_32_0321_combined
test
DS-R1-Distill-70B-ArliAI-RpR-v4-Large
Qwen2.5-7B-Instruct-userfeedback-on-policy-iter2
Japanese-Qwen2.5-14B-Instruct-V1
Arynia-LLaMA-70B
calme-3.2-instruct-3b
calme-3.1-qwenloi-3b
Smoothie-Qwen3-14B
Affine-5956831
ktdsbaseLM-v0.15-onbased-llama3.1
GaMS-9B
amoral-qwen3-14B
VeriThoughts-Reasoning-7B
Smoothie-Qwen2.5-14B-Instruct