Models

7,311
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_12epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

dat-lequoc/vLLM-fast-apply-16bit-v0.13-Llama3.2-1B

0
·
4
1B32Kllama32-1b
Warm

JefiRyan/Llama-3.2-1B-bnb-4bit-soulcare_no_serialization

0
·
4
1B32Kllama32-1b
Warm

abcorrea/llama-3.2-1b-wiki-ft-v4

0
·
4
1B32Kllama32-1b
Warm

abcorrea/llama-3.2-1b-wiki-ft-v3

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

kedar-bhumkar/meta-llama-3.2-1B-Instruct-ft-sarcasm

0
·
4
·
Mar 2025
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_13epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_4epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

Elcaida/llamasecondpretrain

0
·
4
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_CoTsft_rs0_0_5cut_part2_e2

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_7epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_5epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

Ansah-AI/E1

1
·
4
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_CoTsft_rs0_2_5cut_gem3all_e2

0
·
4
1B32Kllama32-1b
Warm

thaapala/TwinLlama-3.1-8B-DPO

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4