Models

3,519
1B32Kllama32-1b
Warm

bau0221/Enlighten_Instruct_merged

0
·
4
1B32Kllama32-1b
Warm

minhnguyen5293/merged_16bit_1_full_epoch

1
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

makcedward/Llama-3.2-1B-Instruct-DoRA-Merged

0
·
4
1B32Kllama32-1b
Warm

ALIN-LLM/verifier-llama-3.2-1b-gsm8k

0
·
4
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v3-meta-OWT

0
·
4
1B32Kllama32-1b
Warm

KickItLikeShika/ORPOLlama-3.2-1B

0
·
4
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_AllDataSources_5e-05_constant_512_flattening

0
·
4
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3

0
·
4
1B32Kllama32-1b
Warm

norip76/llama-3.2-1B-test2

0
·
4
1B32Kllama32-1b
Warm

AZZG/llama-3.2-1b-it-Intro-Physics-Problem-Extractor

0
·
4
1B32Kllama32-1b
Warm

abcorrea/llama-3.2-1b-wiki-ft-v7

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-codesearchnet-2

0
·
4
1B32Kllama32-1b
Warm

jiinking/3_random_MQA_llama_model

0
·
4
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_3x3_mix_position

0
·
4
1B32Kllama32-1b
Warm

xw17/Llama-3.2-1B-Instruct_finetuned_s03_i

0
·
4
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v2-meta-OWT

0
·
4
1B32Kllama32-1b
Warm

triplee/torchtune_1B_lr1.5e-5_13epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
4