Models

3,519
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep

0
·
2
1B32Kllama32-1b
Warm

krishna195/final_merged

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-Llama-3.2-1B-Instruct-ft-M-A-O-d4-a0.25-ft-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

upb-nlp/llama32_1b_scoring_all_tasks

0
·
2
1B32Kllama32-1b
Warm

nimrita/llama-3.2-1b-chat-doctor

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep_4bit

0
·
2
1B32Kllama32-1b
Warm

Ersel1/ErselFit_Finetuned_Llama_1B_V2

0
·
2
1B32Kllama32-1b
Warm

danieliuspodb/llama-3.2-1b-extremist4

0
·
2
1B32Kllama32-1b
Warm

jiinking/10_layer_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

jiinking/8_layer_GQA4_llama_model

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-RefusalData-d4-a0.25

0
·
2
1B32Kllama32-1b
Warm

genixo/Llama3.2-learn

0
·
2
1B32Kllama32-1b
Warm

jiinking/2_first_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

brando/tfa_output_2025_m05_d10_t20h_16m_03s

0
·
2
1B32Kllama32-1b
Warm

Dev8318/custom-Llama-2-1b

0
·
2
1B32Kllama32-1b
Warm

jiinking/4_layer_GQA2_llama_model

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3_1BI-HarmData-PKUU-Al4-OWT-Ref-PKUS-d4-a0.25_v1

0
·
2
1B32Kllama32-1b
Warm

akhilanilkumar/odinbot-finetuned-v3-10022024

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_32_0.01_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

task-aware/Llama_3.2_1B_Instruct

0
·
2