Text Generation Models — Page 330
41,393Zack-ZWarmTools1B32K
llama32_1bi_CoTsft_rs0_2_5cut_gem3all_e2
NexesenexWarmTools1B32K
Llama_3.2_1b_Odyssea_Escalation_0.0a
sujayrittikarWarmTools1B32K
GrogrosWarmTools1B32K
dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-ran0-meta-OWT
JakeOhWarmTools1B32K
star_plus-finetune-llama-3.2-1b-gsm8k-step-2
Zack-ZWarmTools1B32K
llama32_1bi_CoTsft_rs0_1_5cut_gem3all_e2
3odatWarmTools1B32K
llama3-finetuned-Best_f16_Accurate
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_40k_2_3ep
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_1k_1_1ep_4bit
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-NH-WO-d4-a0.2-v4-WO_NoHealth
GrogrosWarmTools1B32K
dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg-learnability_adv
GrogrosWarmTools1B32K
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth
GrogrosWarmTools1B32K
Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag
TrelisWarmTools1B32K
Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_5e-05_constant_0.3_512_tp
BleachNickWarmTools1B32K
Llama-3.2-1B-Instruct-GRPO-45k_RAGv2
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_KTO_40k_4_2ep
selinkWarmTools1B32K
Llama-32-1B-Instruct-ft-citation-nist
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_2ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Mattia2700WarmTools1B32K
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep
JakeOhWarmTools1B32K
star_plus-finetune-llama-3.2-1b-gsm8k-step-1
VictoriayuWarmTools1B32K
beeyeah-clip-0.1-0.00001-0.2
MuadilWarmTools1B32K
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40.0k_1_1ep
akhilanilkumarWarmTools1B32K
odinbot-finetuned-v3-10022024
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.1_alpha2_epoch10