Models

332
1B32Kllama32-1b
Warm

gohsyi/Llama-3.2-1B

0
·
2
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
2
1B32Kllama32-1b
Warm

minpeter/Llama-3.2-1B-AlternateTokenizer-tool-chatml

0
·
2
1B32Kllama32-1b
Warm

sujayrittikar/Llama-3.2-1B-clef_sscl_posttraining

0
·
2
1B32Kllama32-1b
Warm

triplee/torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_1epoch

0
·
2
8B32Kllama31-8b
Warm

livil/llama3.1-8b-instruct

0
·
1
8B32Kllama31-8b
Warm

Danilas/test3

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_10epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

sroma/Llama-3.2-1B-payload-analysis

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_13epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_4epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_7epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_5epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

nelish007/Llama-3.2-1B-Torchtune-Finetuned

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
3B32Kqwen25-3b
Warm

alpha-ai/qwen2.5-reason-thought-lite

0
·
1
·
Feb 2025
1B32Kllama32-1b
Warm

pt-sk/ll-3.2-1B

0
·
0
1B32Kllama32-1b
Warm

motexture/iTech-1B-Instruct

0
·
0