Models

9,944
1B32Kllama32-1b
Warm

kamneb/WritingGenTestOrpoLlama-3-2-1B

0
·
2
1B32Kllama32-1b
Warm

gavrilstep/s801

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_cosine_512

0
·
2
1B32Kllama32-1b
Warm

nhatminh/Llama-3.2-1B-Instruct

0
·
2
1B32Kllama32-1b
Warm

Hsuan0929/llama-3.2-custom-energy_saving_assistant

0
·
2
1B32Kllama32-1b
Warm

hurrutia/meta-llama-sft

0
·
2
1B32Kllama32-1b
Warm

jiinking/14_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_64_0.05_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_1ep_4bit

0
·
2
1B32Kllama32-1b
Warm

jiinking/10_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

yknxh/smollm2-1.7B-sft

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_1_1ep

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_40k_4_1ep

0
·
2
1B32Kllama32-1b
Warm

SidhaarthMurali/grpo-llama3.2-1b

0
·
2
1B32Kllama32-1b
Warm

TheBlueObserver/Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_80k_2_2ep

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d6-a0.16-v2

0
·
2