1B Parameter LLMs — Page 68

7,149

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-FTBD-Math-Refusal

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-APP

jiinkingWarmTools1B32K

9_first_MQA_llama_model

NeoooooWarmTools1B32K

SemAFacet-SFT-Merged-10k

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v2

jiinkingWarmTools1B32K

7_first_MQA_llama_model

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit

davzokuWarmTools1B32K

stock_market_expert_1b

bishalkat2222WarmTools1B32K

Llama3.2-doker_egitim

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v3

jiinkingWarmTools1B32K

6_first_MQA_llama_model

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-50percent

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-CodeAlpaca-1.5-BadCode-ran2

GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v3-meta-OWT-LA

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_5e-05_constant_512

bonamt11WarmTools1B32K

Llama-3.2-1B-Instruct-bnb-4bit-Patent-Classifier

namfamWarmTools1B32K

ask-cmc-global-llama-3.2-1b-instruct

Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.1_128CLINICALe3c-sentences_tag

jiinkingWarmTools1B32K

13_random_MQA_llama_model

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_1ep_4bit

dmohanayogesh9WarmTools1B32K

model_trained_latest

selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-suffix

jiinkingWarmTools1B32K

16_random_MQA_llama_model

dariaL27WarmTools1B32K

llama3-1b-cg-g-s-e

Likhith003WarmTools1B32K

dpo-llmjudge-lora-adapter

GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-1percent

opendoor99WarmTools1B32K

Llama-3.2-1B-magnitude-0.1

haryoawWarmTools1B32K

cola_meta-llama-Llama-3.2-1B_5_0

HYEONiiWarmTools1B32K

llama-3.2-1B-test

Mattia2700WarmTools1B32K

Llama-3.2-1B-Instruct_ClinicalWhole_5e-05_cosine_512

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_1ep

yeokWarmTools1B32K

Llama-3.2-1B-Instruct-Faithful-unsloth

jiinkingWarmTools1B32K

8_layer_GQA2_llama_model

jiinkingWarmTools1B32K

3_layer_MQA_llama_model

GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v2-meta-OWT-LA-ext

emstWarmTools1B32K

Sibatikgal

kavish218WarmTools1B32K

bt_des_complete_1b_v1

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1.0k_1_1ep

ikenna1234WarmTools1B32K

llama_3.2_1b_instruct_custom_reward_model

bonamt11WarmTools1B32K

Llama-3.2-1B-Instruct-bnb-4bit-Patent-Classification

VictoriayuWarmTools1B32K

beeyeah-reg-0.1-0.00001-0.1

daaaaaaaaWarmTools1B32K

Llama-3-2-1B-Instruct-text2sql-new