Text Generation Models — Page 328
41,393MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_40k_2_3ep
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_1k_1_1ep_4bit
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-NH-WO-d4-a0.2-v4-WO_NoHealth
GrogrosWarmTools1B32K
dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg-learnability_adv
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth
GrogrosWarmTools1B32K
Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag
TrelisWarmTools1B32K
Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_5e-05_constant_0.3_512_tp
BleachNickWarmTools1B32K
Llama-3.2-1B-Instruct-GRPO-45k_RAGv2
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_40k_4_2ep
selinkWarmTools1B32K
Llama-32-1B-Instruct-ft-citation-nist
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep
JakeOhWarmTools1B32K
star_plus-finetune-llama-3.2-1b-gsm8k-step-1
VictoriayuWarmTools1B32K
beeyeah-clip-0.1-0.00001-0.2
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40.0k_1_1ep
akhilanilkumarWarmTools1B32K
odinbot-finetuned-v3-10022024
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.1_alpha2_epoch10
SriSanth2345WarmTools1B32K
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr2e-05_layer5_scoeff10_epoch5
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep
VictoriayuWarmTools1B32K
beeyeah-clip-0.1-0.0000085-0.2
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr2e-05_beta0.1_alpha5_epoch5
open-unlearningWarmTools1B32K
pos_tofu_Llama-3.2-1B-Instruct_retain90_forget10_bio_lr1e-05_wd0.01_epoch10
MergeMergeWarm3B8K
gemma-2-2B-allenai-tulu-3-sft-math-MATH
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s200_a500_layer11