Text Generation Models — Page 343
42,676MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag
zinoubmWarmTools1B32K
OrpoLlama-3.2-1B-Instruct
xw17WarmTools1B32K
Llama-3.2-1B-Instruct_finetuned_2_optimized1
GrogrosWarmTools1B32K
Llama-3.2-1B-Instruct-distillation-CodeAlpaca-1.5-BadCode-ran2
Mattia2700WarmTools1B32K
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_1ep_4bit
ReasoningMilaWarmTools1B32K
ver_gen_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-5634
SimoneManaiWarmTools1B32K
Llama-3.2-1B-Instruct-FT-Empathy
Mattia2700WarmTools1B32K
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag
BleachNickWarmTools1B32K
Llama-3.2-1B-Instruct-GRPO-45k_RAGv2
GrogrosWarmTools1B32K
Llama-3.2-1B-Instruct-distillation-AlpacaGPT4-1.5-AlpacaPoison-AlpacaPoison-full3
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_1k_1_3ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep
stewy33WarmTools1B32K
acc_rd_ttt-Llama-3.2-1B-Instruct
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep
rl-llm-codersWarmTools1B32K
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_20k_2_1ep
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_16_0.05_16CLINICALe3c-sentences_tag
akhilanilkumarWarmTools1B32K
odinbot-finetuned-v3-10022024
SriSanth2345WarmTools1B32K
3odatWarmTools1B32K
llama3-finetuned-Latest_f16
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_UNDIAL_lr0.0001_beta3_alpha2_epoch10
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch10
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a1200_layer11
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s100_a100_layer15
TongZheng1999Warm3B8K
gemma-2-2b-it-star-10Rounds-iter-2
AMindToThinkWarm3B8K
gemma-2-2b-it_RMU_s400_a100_layer15