Models

4,335
1B32Kllama32-1b
Warm

Grogros/dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h1d2

0
·
2
1B32Kllama32-1b
Warm

Grogros/dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h2d4

0
·
2
1B32Kllama32-1b
Warm

axel-datos/Llama-3.2-1B_MATH_full-finetuning

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText

0
·
2
1B32Kllama32-1b
Warm

Grogros/dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h2d2

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-2WT-DistillationWM-Al4-WT2-d4-v2

0
·
2
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix5

0
·
2
1B32Kllama32-1b
Warm

Heejindo/model_output_e10

0
·
2
1B32Kllama32-1b
Warm

lectura/Llama3.2-1B-bbc_en-e3-bs32-lr5e-4cos-wd0.1-wr0.01

0
·
2
1B32Kllama32-1b
Warm

xw17/Llama-3.2-1B-Instruct_finetuned_optimized1_universal_no_taskgrouping_FT

0
·
2
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_v6

0
·
2
1B32Kllama32-1b
Warm

ele301/LLama3.21b-v0.1-usersimulator

0
·
2
1B32Kllama32-1b
Warm

beddi/llama-3.2-1b-finetuned-pt1

0
·
2
1B32Kllama32-1b
Warm

ryusangwon/qsaf_answer_only

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-OpenMathInstruct

0
·
2
1B32Kllama32-1b
Warm

lectura/Llama3.2-1B-bbc_en-e3-bs32-lr1e-4cos-wd0.1-wr0.01

0
·
2
1B32Kllama32-1b
Warm

Tasneem10/Llama3.2-1B-instruct-fc

0
·
2
1B32Kllama32-1b
Warm

phildunphy14/llama_3_1_non_quant_1b_35k

0
·
2
1B32Kllama32-1b
Warm

anthonymg/FineAeritoLlama-3.2-1B

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuseSmooth-reg2

0
·
2