Models

3,510
1B32Kllama32-1b
Warm

jiinking/7_layer_GQA4_llama_model

0
·
1
1B32Kllama32-1b
Warm

JakeOh/star_plus-finetune-llama-3.2-1b-gsm8k-step-2

0
·
1
1B32Kllama32-1b
Warm

priyanynaru/LLaMA3.2-Python-Codegen-Finetune

0
·
1
1B32Kllama32-1b
Warm

jiinking/6_layer_GQA2_llama_model

0
·
1
1B32Kllama32-1b
Warm

dmohanayogesh9/ShivaParvathi

0
·
1
1B32Kllama32-1b
Warm

HassaanSeeker/llama-3.2-1b-layerskip-finetuned

0
·
1
1B32Kllama32-1b
Warm

Novaciano/Harpy-3.2-1B

0
·
1
1B32Kllama32-1b
Warm

gghsgn/llama_ina-cbg

0
·
1
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3_mix_position_known_unknown

0
·
1
1B32Kllama32-1b
Warm

ddahlmeier/llama-3.2-1B-sutdqa-lora

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-OWTWM-DWM-Al4-WT-d4-a0.1-v5-meta-OWT-learnability_adv

0
·
1
1B32Kllama32-1b
Warm

jiinking/12_random_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

Raghvender/llama-3.2-1b-indianlaw-merged

0
·
1
1B32Kllama32-1b
Warm

jasonrb/llama-3.2-1B_gsm8k_sft_no_eos

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_40k_2_3ep

0
·
1
1B32Kllama32-1b
Warm

gavrilstep/s801

0
·
1
1B32Kllama32-1b
Warm

hurrutia/meta-llama-sft

0
·
1
1B32Kllama32-1b
Warm

jiinking/10_random_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

quancute/DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-OWT2-d6-a0.16-v2

0
·
1