Models

4,333
1B32Kllama32-1b
Warm

Grogros/Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-Al4

0
·
2
1B32Kllama32-1b
Warm

YWZBrandon/meta-llama_Llama-3.2-1B_qa_ds100_upsample1000

0
·
2
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct-MATH-synthetic-augmented

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText-l2

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-Al4

0
·
2
1B32Kllama32-1b
Warm

bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.3

0
·
2
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix2

0
·
2
1B32Kllama32-1b
Warm

YWZBrandon/meta-llama_Llama-3.2-1B_qa_full_upsample1000

0
·
2
1B32Kllama32-1b
Warm

axel-datos/Llama-3.2-1B_gsm8k_lisa

0
·
2
1B32Kllama32-1b
Warm

autoprogrammer/CulturaX-zh-unsupervised-20241030-171238

0
·
2
1B32Kllama32-1b
Warm

autoprogrammer/cc100-zh-Hans-unsupervised-20241111-225218

0
·
2
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_CoTsft_rs0_1_5cut_gem3_e2

0
·
2
1B32Kllama32-1b
Warm

AymanTarig/Llama-3.2-1B-FC-v1

0
·
2
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_grad_v4

0
·
2
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_grad_v2

0
·
2
1B32Kllama32-1b
Warm

kavish218/finetuned_llama_3_2_1B_description_multi_domain_5

0
·
2
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_v5

0
·
2
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-Magpie

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-alpaca-3.0-AlpacaPoison-tuluLong

0
·
2