1B Parameter LLMs — Page 87

7,154

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20k_2_1ep

GetSoloTechWarmTools1B32K

Llama-3.2-1B-Endocronology

ALIN-LLMWarmTools1B32K

ours-llama-3.2-1b-gsm8k

phtranWarmTools1B32K

test-finetuned-sft

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_1ep

jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_9

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_2ep

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-kth-OMI-Al4-OWT

vinhainsecWarmTools1B32K

llama-usp-sec-final

tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_7epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-abliterated-DPO

upb-nlpWarmTools1B32K

llama32_1b_sft_localsum_attribute

waowaoWarmTools1B32K

llama3.2-1b-oasst2-33k-ja

KSU-HW-SECWarmTools1B32K

llama1B_50test

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

NexesenexWarmTools1B32K

Llama_3.2_1b_Odyssea_Escalation_0.0a

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_4_3ep

licorne2lcWarmTools1B32K

customer-success-assistant

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_3ep

eyepyonWarmTools1B32K

rclama32-merged-final

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep

priyanynaruWarmTools1B32K

LLaMA3.2-Python-Codegen-Finetune

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_constant_0.3_512_tp

mengqizou011438WarmTools1B32K

merged-llama3.2-1B-financial_news_and_qa_formatted

ciwokhanWarmTools1B32K

Finetuned-text-to-sql_merged_16bit

jiinkingWarmTools1B32K

15_layer_MQA_llama_model

bonamt11WarmTools1B32K

Llama-3.2-1B-Instruct-bnb-4bit-Classification-model

NovacianoWarmTools1B32K

Harpy-3.2-1B

minpeterWarmTools1B32K

Alpaca-Llama-3.2-1B-Instruct

Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned4

NovacianoWarmTools1B32K

BAPHOMET

jiinkingWarmTools1B32K

7_random_MQA_llama_model

prithivMLmodsWarmTools1B32K

Llama-Express.1-Merged

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_3ep_4bit

rl-llm-codersWarmTools1B32K

RS_1B_SFT_iter2

gbhatt123WarmTools1B32K

alpaca-llama3-1b-finetuned

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_1ep

GrogrosWarmTools1B32K

dmWM-LLama-3-1B-Harm-ft-HarmfulAssistant-AlpacaGPT4-OpenWebText-d4-a0.25

autoprogrammerWarmTools1B32K

Llama-3.2-1B-Instruct-de-sw-block

rkdanielsWarmTools1B32K

llama-3-2-1b-trump

Mattia2700WarmTools1B32K

Llama-3.2-1B-Instruct_AllDataSources_0.0002_cosine_512