Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme
Grogros-dmWM-Llama-3.2-1B-Instruct-HarmData-Al4-OWT-d4-a0.25-learnability_adv
Llama-3.2-1B-Instruct-FFT-coder-python
llama-3.2-1b-dad-jokes
llama-3.2-1b-Insomnia-ChatBot-merged
Llama-3.2-1B-Instruct
8_bitwise_MQA_llama_model
dermai-v2
dmWM-llama-3.2-1B-Instruct-LucieFr-d4-NoReg
Llama3.2-TaiPhone-1B-Instruct-v0.1
llama3ClinicalTrialCriteriaCreationn
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkNLL_lr2e-05_alpha2_epoch10
dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-Ref-d4-a0.25_v1
gemma-2-2b_RMU_s200_a300_layer7
mini-pozor
SFT_gemma_ojousama
gemma2-sft-peft
Kimlan-gemma2_tw
Gemma-2-2b-it-chat-doctor
gemma2b_full_ft_dare
gemma-2-2b-it_RMU_s100_a300_layer3
gemma-2-2b-it_RMU_s200_a300_layer3
gemma-2-2b-it-star-nl-OP_DIS-final_v2_10-2-3Rounds-iter-2
gemma_unlearned_unbalance_gender_1e-6_1.0_0.25_0.5_epoch3
gemma_unlearned_unbalance_gender_1e-6_1.0_1.0_1.0_epoch3
Soar-qwen-14b
Jan-nano-128k
Affine-5956831
ktdsbaseLM-v0.16-onbased-llama3.1
qwen3_claude_37_48k_tokenized_sft_lr_1en5_epoch_1_bs_1_ga_8
Qwen3-4B-ReTool-SFT
Affine-2501551
gemma-3-1b-pt-MED-Instruct
gemma-2-2b-it-grpo-gsm8k
gemma-3-finetune
llama3.2-3b-dpo-vanilla
Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
OpenR1-Qwen-7B-SFT-Instruct
gemma-2-9b_wildguard_jailbreak_2epoch
DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5
template_instantiator_intermediate
Qwen2.5-1.5B-Instruct-YaRN