Models

3,519
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct_SFT_1_SFT_2

0
·
2
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3_mix_position_famous_unrecognized

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.05_16CLINICALe3c-sentences_tag

0
·
2
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr

0
·
2
1B32Kllama32-1b
Warm

open-unlearning/pos_tofu_Llama-3.2-1B-Instruct_retain90_forget10_bio_lr2e-05_wd0.01_epoch10

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_AllDataSources_8e-06_cosine_0.3_512_tp

0
·
2
1B32Kllama32-1b
Warm

myriamgoyet/customer-success-assistant

0
·
2
1B32Kllama32-1b
Warm

axolotl-ai-co/numina-1b-ep3-lr3e-5-sft

0
·
2
1B32Kllama32-1b
Warm

dariaL27/llama3-1b-gt-g-s-e

0
·
2
1B32Kllama32-1b
Warm

jiinking/1_layer_GQA2_llama_model

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep

0
·
2
1B32Kllama32-1b
Warm

supalun/llama3.2-typhoon2-1b_ft

0
·
2
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_stdsft_rs0_3_5cut_e2

0
·
2
1B32Kllama32-1b
Warm

taewanme/llama-3.2-1B-test

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText-a0.5

0
·
2
1B32Kllama32-1b
Warm

mengqizou011438/merged-llama3.2-1B-financial

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v2

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-Llama-3.2-1B-Instruct-M-A-O-d4-a0.25-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

kothasuhas/tinystories-1B-8-epochs-4-16

0
·
2
1B32Kllama32-1b
Warm

danieliuspodb/llama-3.2-1b-extremist3

0
·
2