Models

6,194
jiinkingWarm1B32K

1_random_MQA_llama_model

0
·
0
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-LucieFr

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_2_default

0
·
0
makcedwardWarm1B32K

Llama-3.2-1B-Instruct-LoRA-Merged_extra_token_special_token

0
·
0
jahyunguWarm1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_cluster9

0
·
0
bahaelaila7Warm1B32K

smollm2-1.7B-dpoo

0
·
0
krishna195Warm1B32K

fourths

0
·
0
akashmaggonWarm1B32K

pre_training_llama

0
·
0
mergekit-communityWarm1B32K

mergekit-passthrough-dbuelgg

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_4

0
·
0
ShahradmzWarm1B32K

llama8b_SEND_1B-helm-5

0
·
0
Jia-aoWarm1B32K

Llama-3.2-1B-Instruct-Explainable-Propaganda-Detection-old

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_2ep

0
·
0
vinhainsecWarm1B32K

llama-usp-sec-final

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_1_new_prompt

0
·
0
vinhainsecWarm1B32K

test_mcq_vcs3

0
·
0
vinhainsecWarm1B32K

llama-usp-sec-finally

0
·
0
upb-nlpWarm1B32K

llama32_1b_sft_localsum_attribute

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_4_new_prompt

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_2ep

0
·
0
saimanish344Warm1B32K

llama-retrained-2

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_64_0.05_16CLINICALe3c-sentences_tag

0
·
0
robemtzasWarm1B32K

meta-llama-sft

0
·
0
bryanchristWarm1B32K

llm_course_test

0
·
0
jiinkingWarm1B32K

2_layer_GQA2_llama_model

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_70.0k_2_1ep

0
·
0
AngeloCurti22Warm1B32K

LLaMa_coder_base_sft

0
·
0
enemydwWarm1B32K

llm_course_test

0
·
0
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg

0
·
0
jiinkingWarm1B32K

5_first_MQA_llama_model

0
·
0
KSU-HW-SECWarm1B32K

llama1B_OB25

0
·
0
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-wmToken-d4-APP

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_3ep

0
·
0
jiinkingWarm1B32K

8_layer_MQA_llama_model

0
·
0
VictoriayuWarm1B32K

beeyeah-reg-0.2-0.000001-0.1

0
·
0
JakeOhWarm1B32K

star_plus-finetune-llama-3.2-1b-gsm8k-step-2

0
·
0
jiinkingWarm1B32K

11_random_MQA_llama_model

0
·
0
jiinkingWarm1B32K

15_layer_MQA_llama_model

0
·
0
jasonrbWarm1B32K

llama-3.2-1B_gsm8k_sft_old_template

0
·
0
jiinkingWarm1B32K

6_layer_GQA2_llama_model

0
·
0
autoprogrammerWarm1B32K

Llama-3.2-1B-Instruct-full_arc_easy

0
·
0