Models

9,947
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_2_1ep_deneme

0
·
2
1B32Kllama32-1b
Warm

BleachNick/Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-v3

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_ClinicalWhole_5e-05_constant_512

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-OWTWM-DWM-Al4-WT-d4-a0.1-v5-meta-OWT-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaPoison

0
·
2
1B32Kllama32-1b
Warm

jiinking/12_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

jiinking/16_layer_GQA4_llama_model

0
·
2
1B32Kllama32-1b
Warm

Raghvender/llama-3.2-1b-indianlaw-merged

0
·
2
1B32Kllama32-1b
Warm

jasonrb/llama-3.2-1B_gsm8k_sft_no_eos

0
·
2
1B32Kllama32-1b
Warm

gavrilstep/s801

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Hsuan0929/llama-3.2-custom-energy_saving_assistant

0
·
2
1B32Kllama32-1b
Warm

hurrutia/meta-llama-sft

0
·
2
1B32Kllama32-1b
Warm

jiinking/10_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

quancute/DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new

0
·
2
1B32Kllama32-1b
Warm

yknxh/smollm2-1.7B-sft

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

jiinking/6_layer_GQA4_llama_model

0
·
2