Models

3,511
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-alpaca-5

0
·
1
1B32Kllama32-1b
Warm

makcedward/Llama-3.2-1B-Instruct-LoRA-Merged_extra_special_token

0
·
1
1B32Kllama32-1b
Warm

keithdrexel/unsloth-llama-3.2-1b-tldr-unsloth_middle_5epochs

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-helm-3

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-alpaca-3

0
·
1
1B32Kllama32-1b
Warm

makcedward/Llama-3.2-1B-Instruct-LoRA-Merged_extra_token

0
·
1
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-lowlr1

0
·
1
1B32Kllama32-1b
Warm

jiinking/16_bitwise_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

saiscorelabsai/Llama-3.2-1B-Instruct

0
·
1
1B32Kllama32-1b
Warm

kenken6696/Llama-3.2-1B_4x3_mix_positon

0
·
1
1B32Kllama32-1b
Warm

Elcaida/llamasecondpretrain

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_normal_1B-codesearchnet_1

0
·
1
1B32Kllama32-1b
Warm

saketh-chervu/llama3-1b-instruct-sft-ft-wordle-agent

0
·
1
1B32Kllama32-1b
Warm

jahyungu/Llama-3.2-1B-Instruct_MetaMathQA-40K_random

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_normal_1B-alpaca_2

0
·
1
1B32Kllama32-1b
Warm

vinhainsec/test_mcq_vcs4

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_normal_1B-legalbench_5

0
·
1
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-alpaca-2

0
·
1