Text Generation Models — Page 343

42,676
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

0
·
17
zinoubmWarmTools1B32K

OrpoLlama-3.2-1B-Instruct

0
·
17
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_2_optimized1

0
·
17
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-CodeAlpaca-1.5-BadCode-ran2

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_1ep_4bit

0
·
17
ReasoningMilaWarmTools1B32K

ver_gen_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-5634

0
·
17
SimoneManaiWarmTools1B32K

Llama-3.2-1B-Instruct-FT-Empathy

0
·
17
macqueen01WarmTools1B32K

llama-sft-1b-reasoning

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

0
·
17
BleachNickWarmTools1B32K

Llama-3.2-1B-Instruct-GRPO-45k_RAGv2

0
·
17
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-AlpacaGPT4-1.5-AlpacaPoison-AlpacaPoison-full3

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_3ep

0
·
17
Plan-9WarmTools1B32K

Llama3.2-docker-training

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep

0
·
17
stewy33WarmTools1B32K

acc_rd_ttt-Llama-3.2-1B-Instruct

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep

0
·
17
rl-llm-codersWarmTools1B32K

RS_1B_RM_iter1

0
·
17
VictoriayuWarmTools1B32K

beeyeah-dpo-0.1-0.00001

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep

0
·
17
ElcaidaWarmTools1B32K

llamapretrained1

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_1ep

0
·
17
yuchongz12WarmTools1B32K

llama3_1B_hh_reject_2

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_16_0.05_16CLINICALe3c-sentences_tag

0
·
17
akhilanilkumarWarmTools1B32K

odinbot-finetuned-v3-10022024

0
·
17
SriSanth2345WarmTools1B32K

LLAMA-3.2-1B-IDENTITY

0
·
17
3odatWarmTools1B32K

llama3-finetuned-Latest_f16

0
·
17
fanfare71WarmTools1B32K

llama-3.2-1B-test

0
·
17
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_UNDIAL_lr0.0001_beta3_alpha2_epoch10

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep

0
·
17
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch10

0
·
17
williamlcnWarm3B8K

17718_sft_16

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a1200_layer11

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s100_a100_layer15

0
·
17
williamlcnWarm3B8K

17718_simpo_16_1_1e

0
·
17
TongZheng1999Warm3B8K

gemma-2-2b-it-star-10Rounds-iter-2

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a100_layer15

0
·
17