Models

3,519
1B32Kllama32-1b
Warm

enemydw/llm_course_test

0
·
2
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3_mix_position_funny_boring

0
·
2
1B32Kllama32-1b
Warm

BleachNick/Llama-3.2-1B-Instruct-GRPO-45k_RAG

0
·
2
1B32Kllama32-1b
Warm

thaapala/TwinLlama-3.1-8B-DPO

0
·
2
1B32Kllama32-1b
Warm

jiinking/5_first_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

sree555/dermai-v3

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-wmToken-d4-APP

0
·
2
1B32Kllama32-1b
Warm

jiinking/13_layer_GQA4_llama_model

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep

0
·
2
1B32Kllama32-1b
Warm

jiinking/11_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_5e-05_constant_0.3_512_tp

0
·
2
1B32Kllama32-1b
Warm

mengqizou011438/merged-llama3.2-1B-financial_news_and_qa_formatted

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.2-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

ciwokhan/Finetuned-text-to-sql_merged_16bit

0
·
2
1B32Kllama32-1b
Warm

jiinking/15_layer_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

jasonrb/llama-3.2-1B_gsm8k_sft_old_template

0
·
2
1B32Kllama32-1b
Warm

autoprogrammer/Llama-3.2-1B-Instruct-full_arc_easy

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_16_0.01_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Elcaida/pretrained1bv3

0
·
2