Models

9,938
1B32Kllama32-1b
Warm

aswain4/llm_course_test

0
·
3
1B32Kllama32-1b
Warm

jiinking/4_first_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

3odat/llama3-finetuned-Best_f16_Accurate

0
·
3
1B32Kllama32-1b
Warm

triplee/torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_1epoch

0
·
3
1B32Kllama32-1b
Warm

kamneb/WritingGenTestOrpoLlama-3-2-1B

0
·
3
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-5percent

0
·
3
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg2

0
·
3
1B32Kllama32-1b
Warm

jiinking/7_random_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

hurrutia/meta-llama-sft

0
·
3
1B32Kllama32-1b
Warm

hendrik-spl/deft-pyramid-98-merged

0
·
3
1B32Kllama32-1b
Warm

HassaanSeeker/Llama-3.2-1B-finetuned-full

0
·
3
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_1k_1_1ep_4bit

0
·
3
1B32Kllama32-1b
Warm

jiinking/15_random_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

jiinking/9_layer_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OMI-d4-NoReg

0
·
3
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d6-a0.16-v2

0
·
3
1B32Kllama32-1b
Warm

zzzarc/BARC-1B-gen-COT-answer-origin

0
·
3
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-d4-NoReg-learnability_adv

0
·
3
1B32Kllama32-1b
Warm

jiinking/6_random_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

Heisenbugx01/fine_tuned_llama

0
·
3