Text Generation Models — Page 372

42,836
anonymous4459WarmTools1B32K

Llama-3.2-1B-finance-TEL

0
·
15
selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-label

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_20k_2_3ep

0
·
15
Pretrain-FBK-NLPWarmTools1B32K

Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper

0
·
15
peterpeter8585WarmTools1B32K

sungyoonaimodel2

0
·
15
TharunSivamaniWarmTools1B32K

llama-3.2-1b-it-Ecommerce-ChatBot-merged

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag

0
·
15
knguyennguyenWarmTools1B32K

fashion_5k_llama_1b

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.01_16CLINICALe3c-sentences_tag

0
·
15
lilmeatyWarmTools1B32K

instruct

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit

0
·
15
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_1

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag

0
·
15
selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-label-sx

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat-PT

0
·
15
DopeorNopeWarmTools1B32K

1B_math

0
·
15
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.5_alpha2_epoch10

0
·
15
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff100_epoch5

0
·
15
robemtzasWarmTools1B32K

meta-llama-sft

0
·
15
minpeterWarmTools1B32K

Llama-3.2-1B-Instruct-chatml

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-Instructdistillation-CodeAlpaca-BadCode-s1

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_1_1ep

0
·
15
sree555WarmTools1B32K

dermai-v3

0
·
15
sujayrittikarWarmTools1B32K

Llama-3.2-1B-clef_sscl_posttraining

0
·
15
jiinkingWarmTools1B32K

8_layer_MQA_llama_model

0
·
15
GrogrosWarmTools1B32K

Grogros-dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25-ft-learnability_adv

0
·
15
jiinkingWarmTools1B32K

15_layer_MQA_llama_model

0
·
15
3odatWarmTools1B32K

llama3-finetuned-Latest_f16_Accurate

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep

0
·
15
KfjjdjdjdhdhdWarmTools1B32K

my-v0

0
·
15
artarifWarmTools1B32K

llm-course-hw3-dora

0
·
15
jiinkingWarmTools1B32K

12_random_MQA_llama_model

0
·
15
tripleeWarmTools1B32K

torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_1epoch

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag

0
·
15
nhatminhWarmTools1B32K

Llama-3.2-1B-Instruct

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_5e-05_cosine_512

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_1_1ep

0
·
15
TheBlueObserverWarmTools1B32K

Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged

0
·
15
zzzarcWarmTools1B32K

BARC-1B-gen-COT-answer-origin

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_80k_2_3ep

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_4_1ep

0
·
15