Text Generation Models — Page 356
42,742open-unlearningWarmTools1B32K
pos_tofu_Llama-3.2-1B-Instruct_full_lr2e-05_wd0.01_epoch5
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_10k_1_3ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.01_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep
PruningVSQuantizationWarmTools1B32K
Llama-3.2-1B-Instruct-awq-bits8-seed0
open-unlearningWarmTools1B32K
neg_tofu_Llama-3.2-1B-Instruct_retain90_lr1e-05_wd0.01_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr1e-05_layer10_scoeff100_epoch5
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a1200_layer15
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a1200_layer11
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a300_layer11
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s100_a300_layer11
AMindToThinkWarm3B8K
gemma-2-2b_RMU_s100_a300_layer7
AMindToThinkWarm3B8K
gemma-2-2b_RMU_cyber-forget-corpus_s400_a100_layer3
TongZheng1999Warm3B8K
PW_1000_MoT5_gemma-2-2b-it-star-mixed_direct-OP-final_v2_10-5-3Rounds-iter-2
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-3Rounds-iter-1
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a1200_layer3
xw17Warm3B8K
gemma-2-2b-it_finetuned_1_def
williamlcnWarm3B8K
6851_mcq_8_8_new_format_combined
xw17Warm3B8K
gemma-2-2b-it_finetuned_1_new
williamlcnWarm3B8K
6851_32_16_0318_combined_ep1
williamlcnWarm3B8K
6851_mcq_128_32_new_format
williamlcnWarm3B8K
6851_32_32_0321_new_combined
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS-final_v2_10-2-3Rounds-iter-2
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-2
TongZheng1999Warm3B8K
gemma-2-2b-it-star-nl-OP_DIS_new-final_v2_10-2-3Rounds-iter-3
gradientrouting-sparWarm3B8K
rude_tofu_mini_20250510_225921
gradientrouting-sparWarm3B8K
base_2d_random_green_normal_first_quadrant_red_no_preamble_20250601_200956
huihui-aiWarmTools500M32K
Qwen2.5-0.5B-Instruct-abliterated-SFT