Models

9,948
1B32Kllama32-1b
Warm

jiinking/6_layer_GQA2_llama_model

0
·
1
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaRefuseSmooth

0
·
1
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3x3_fix_middle

0
·
1
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3_mix_position_known_unknown

0
·
1
1B32Kllama32-1b
Warm

gorizont/main-train

0
·
1
1B32Kllama32-1b
Warm

kamneb/WritingGenTestOrpoLlama-3-2-1B

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_1ep

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_3ep

0
·
1
1B32Kllama32-1b
Warm

dhanush-radhakrishna/llama-3.2-1b-it-Heisenberg

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-OWT2-d6-a0.16-v2

0
·
1
1B32Kllama32-1b
Warm

TEL-LLM/Llama-3.2-1B-TEL-QA

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_80k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

jiinking/14_layer_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

jiinking/11_layer_GQA4_llama_model

0
·
1
1B32Kllama32-1b
Warm

zisisbatzos/llama3.2-1B-GRPO

0
·
1
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.05_16CLINICALe3c-sentences_tag

0
·
1
1B32Kllama32-1b
Warm

Akeda01/MontirOnlinePro

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

intaek-alignai/Llama-3.2-1B-Instruct-v3-eps6

0
·
1
1B32Kllama32-1b
Warm

Peterhnn/fine-tuned-llama

0
·
1