Models

9,941
1B32Kllama32-1b
Warm

Sayan01/LLama3-1B-OWM-DKD-10

0
·
2
1B32Kllama32-1b
Warm

delacoug/llama-31-hhrlhf-squad-rlhf-policy-model

0
·
2
1B32Kllama32-1b
Warm

TharunSivamani/llama-3.2-1b-it-Ecommerce-ChatBot-merged

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit

0
·
2
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-codesearchnet-3

0
·
2
1B32Kllama32-1b
Warm

ikenna1234/llama_3.2_1b_instruct_base_rlhf

0
·
2
1B32Kllama32-1b
Warm

steffygreypaul/Hyperparameter17

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-learnability_adv

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-OpenMathInstruct

0
·
2
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-legalbench-3

0
·
2
1B32Kllama32-1b
Warm

keithdrexel/unsloth-llama-3.2-1b-tldr-unsloth_middle_5epochs

0
·
2
1B32Kllama32-1b
Warm

Shahradmz/llama8b_SEND_1B-helm-3

0
·
2
1B32Kllama32-1b
Warm

aristsakpinisaws/llama-31-hhrlhf-squad-rlhf-policy-model

0
·
2
1B32Kllama32-1b
Warm

AndresR2909/hf-llama-3.2-1b-finetuned_v5

0
·
2
1B32Kllama32-1b
Warm

jahyungu/Llama-3.2-1B-Instruct_ifeval-like-data_random

0
·
2
1B32Kllama32-1b
Warm

jiinking/5_layer_GQA4_llama_model

0
·
2
1B32Kllama32-1b
Warm

jahyungu/Llama-3.2-1B-Instruct_ifeval-like-data_origin

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-lowlr1

0
·
2
1B32Kllama32-1b
Warm

jiinking/16_bitwise_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

lilmeaty/instruct

0
·
2