Models

4,326
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge

0
·
1
1B32Kllama32-1b
Warm

bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.4

0
·
1
1B32Kllama32-1b
Warm

Heejindo/rationale_model_e10_save5000_eos

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25

0
·
1
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_v2

0
·
1
1B32Kllama32-1b
Warm

Grogros/dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h1d2

0
·
1
1B32Kllama32-1b
Warm

axel-datos/Llama-3.2-1B_MATH_full-finetuning

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText

0
·
1
1B32Kllama32-1b
Warm

omrudra998/fifth

0
·
1
1B32Kllama32-1b
Warm

Heejindo/model_output_e10

0
·
1
1B32Kllama32-1b
Warm

danielgombas/llama_1b_step2_batch_v6

0
·
1
1B32Kllama32-1b
Warm

beddi/llama-3.2-1b-finetuned-pt1

0
·
1
1B32Kllama32-1b
Warm

Tasneem10/Llama3.2-1B-instruct-fc

0
·
1
1B32Kllama32-1b
Warm

anthonymg/FineAeritoLlama-3.2-1B

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4

0
·
1
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison

0
·
1
1B32Kllama32-1b
Warm

ryusangwon/qsaf_last_with_no_answer_10

0
·
1
1B32Kllama32-1b
Warm

ar08/llama3.2-alpaca

0
·
1
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix2

0
·
1
1B32Kllama32-1b
Warm

YWZBrandon/meta-llama_Llama-3.2-1B_qa_full_upsample1000

0
·
1