Text Generation Models — Page 328

41,393
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_2_3ep

0
·
17
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_1ep_4bit

0
·
17
Akeda01WarmTools1B32K

MontirOnlinePro

0
·
17
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-NH-WO-d4-a0.2-v4-WO_NoHealth

0
·
17
sijiasijiaWarmTools1B32K

finetune_llama_LLMjudge

0
·
17
GrogrosWarmTools1B32K

dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT

0
·
17
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg-learnability_adv

0
·
17
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth

0
·
17
GrogrosWarmTools1B32K

Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit

0
·
17
nhatminhWarmTools1B32K

Llama-3.2-1B

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag

0
·
17
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_5e-05_constant_0.3_512_tp

0
·
17
HYEONiiWarmTools1B32K

llama-3.2-1B-test

0
·
17
BleachNickWarmTools1B32K

Llama-3.2-1B-Instruct-GRPO-45k_RAGv2

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_4_2ep

0
·
17
selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-nist

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep

0
·
17
TEL-LLMWarmTools1B32K

Llama-3.2-1B-text

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit

0
·
17
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep

0
·
17
JakeOhWarmTools1B32K

star_plus-finetune-llama-3.2-1b-gsm8k-step-1

0
·
17
VictoriayuWarmTools1B32K

beeyeah-clip-0.1-0.00001-0.2

0
·
17
VictoriayuWarmTools1B32K

beeyeah-dpo-0.1-0.00001

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40.0k_1_1ep

0
·
17
akhilanilkumarWarmTools1B32K

odinbot-finetuned-v3-10022024

0
·
17
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.1_alpha2_epoch10

0
·
17
SriSanth2345WarmTools1B32K

LLAMA-3.2-1B-IDENTITY

0
·
17
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr2e-05_layer5_scoeff10_epoch5

0
·
17
·
May 2025
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep

0
·
17
VictoriayuWarmTools1B32K

beeyeah-clip-0.1-0.0000085-0.2

0
·
17
jiinkingWarmTools1B32K

11_layer_MQA_llama_model

0
·
17
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep

0
·
17
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr2e-05_beta0.1_alpha5_epoch5

0
·
17
open-unlearningWarmTools1B32K

pos_tofu_Llama-3.2-1B-Instruct_retain90_forget10_bio_lr1e-05_wd0.01_epoch10

0
·
17
akdiwaharWarm3B8K

KavithaSaaram-2b-it

1
·
17
MergeMergeWarm3B8K

gemma-2-2B-allenai-tulu-3-sft-math-MATH

0
·
17
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a500_layer11

0
·
17