Text Generation Models — Page 343

42,676

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

zinoubmWarmTools1B32K

OrpoLlama-3.2-1B-Instruct

xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_2_optimized1

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-CodeAlpaca-1.5-BadCode-ran2

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_1ep_4bit

ReasoningMilaWarmTools1B32K

ver_gen_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-5634

SimoneManaiWarmTools1B32K

Llama-3.2-1B-Instruct-FT-Empathy

macqueen01WarmTools1B32K

llama-sft-1b-reasoning

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

BleachNickWarmTools1B32K

Llama-3.2-1B-Instruct-GRPO-45k_RAGv2

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-AlpacaGPT4-1.5-AlpacaPoison-AlpacaPoison-full3

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_3ep

Plan-9WarmTools1B32K

Llama3.2-docker-training

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep

stewy33WarmTools1B32K

acc_rd_ttt-Llama-3.2-1B-Instruct

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep

rl-llm-codersWarmTools1B32K

RS_1B_RM_iter1

VictoriayuWarmTools1B32K

beeyeah-dpo-0.1-0.00001

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep

ElcaidaWarmTools1B32K

llamapretrained1

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_1ep

yuchongz12WarmTools1B32K

llama3_1B_hh_reject_2

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_16_0.05_16CLINICALe3c-sentences_tag

akhilanilkumarWarmTools1B32K

odinbot-finetuned-v3-10022024

SriSanth2345WarmTools1B32K

LLAMA-3.2-1B-IDENTITY

3odatWarmTools1B32K

llama3-finetuned-Latest_f16

fanfare71WarmTools1B32K

llama-3.2-1B-test

open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_UNDIAL_lr0.0001_beta3_alpha2_epoch10

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep

open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch10

williamlcnWarm3B8K

17718_sft_16

AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a1200_layer11

AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s100_a100_layer15

williamlcnWarm3B8K

17718_simpo_16_1_1e

TongZheng1999Warm3B8K

gemma-2-2b-it-star-10Rounds-iter-2

AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a100_layer15