Models

3,525
1B32Kllama32-1b
Warm

kothasuhas/tinystories-1B-8-epochs-4-16

0
·
3
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-KGWB-OWT_WMBoundary-OWT-WB-v3

0
·
3
1B32Kllama32-1b
Warm

rl-llm-coders/RS_1B_RM_iter1

0
·
3
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_3ep_4bit

0
·
3
1B32Kllama32-1b
Warm

sree555/dermai-v2

0
·
3
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-10percent

0
·
3
1B32Kllama32-1b
Warm

lilmeaty/llama_v3

0
·
3
1B32Kllama32-1b
Warm

manav-glean/llama3.2-1b-neuspell-5epochs

0
·
3
1B32Kllama32-1b
Warm

jiinking/9_random_MQA_llama_model

0
·
3
1B32Kllama32-1b
Warm

upb-nlp/llama32_1b_scoring_paraphrasing

0
·
3
1B32Kllama32-1b
Warm

jiinking/5_layer_GQA2_llama_model

0
·
3
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_CoTsft_rs0_3_5cut_gem3_e2

0
·
3
1B32Kllama32-1b
Warm

marcuscedricridia/Mixmix-LlaMAX3.2-1B-Merge

0
·
3
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep

0
·
3
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_KTO_20k_2_1ep

0
·
3
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct_ORPO_1

0
·
3
1B32Kllama32-1b
Warm

Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_512_paper

0
·
3
1B32Kllama32-1b
Warm

danieliuspodb/llama-3.2-1b-extremist4

0
·
3
1B32Kllama32-1b
Warm

Elcaida/llamapretrained1

0
·
3
1B32Kllama32-1b
Warm

pgillier/llama-31-hhrlhf-squad-rlhf-policy-model

0
·
3