Text Generation Models — Page 372
42,836anonymous4459WarmTools1B32K
selinkWarmTools1B32K
Llama-32-1B-Instruct-ft-citation-ensemble-label
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_20k_2_3ep
Pretrain-FBK-NLPWarmTools1B32K
Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper
peterpeter8585WarmTools1B32K
TharunSivamaniWarmTools1B32K
llama-3.2-1b-it-Ecommerce-ChatBot-merged
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag
knguyennguyenWarmTools1B32K
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.01_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
TrelisWarmTools1B32K
Llama-3.2-1B-Instruct_SFT_1
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag
selinkWarmTools1B32K
Llama-32-1B-Instruct-ft-citation-ensemble-label-sx
GrogrosWarmTools1B32K
Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat-PT
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.5_alpha2_epoch10
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff100_epoch5
minpeterWarmTools1B32K
Llama-3.2-1B-Instruct-chatml
GrogrosWarmTools1B32K
Llama-3.2-1B-Instructdistillation-CodeAlpaca-BadCode-s1
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_1_1ep
sujayrittikarWarmTools1B32K
Llama-3.2-1B-clef_sscl_posttraining
GrogrosWarmTools1B32K
Grogros-dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25-ft-learnability_adv
3odatWarmTools1B32K
llama3-finetuned-Latest_f16_Accurate
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep
KfjjdjdjdhdhdWarmTools1B32K
jiinkingWarmTools1B32K
12_random_MQA_llama_model
tripleeWarmTools1B32K
torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_1epoch
Mattia2700WarmTools1B32K
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_5e-05_cosine_512
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_1_1ep
TheBlueObserverWarmTools1B32K
Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged
zzzarcWarmTools1B32K
BARC-1B-gen-COT-answer-origin
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_80k_2_3ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_4_1ep