Models

3,519
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_2_1ep_deneme

0
·
2
1B32Kllama32-1b
Warm

BleachNick/Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5

0
·
2
1B32Kllama32-1b
Warm

jiinking/14_first_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_ClinicalWhole_5e-05_constant_512

0
·
2
1B32Kllama32-1b
Warm

thaapala/TwinLlama-3.1-8B

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaPoison

0
·
2
1B32Kllama32-1b
Warm

ericjedha/customer-success-assistant

0
·
2
1B32Kllama32-1b
Warm

Zack-Z/llama32_1bi_stdsft_rs0_1_5cut_e2

0
·
2
1B32Kllama32-1b
Warm

3odat/llama3-finetuned-Best_f16_Accurate

0
·
2
1B32Kllama32-1b
Warm

Elcaida/pretrainedtest

0
·
2
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-5percent

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_1ep

0
·
2
1B32Kllama32-1b
Warm

nhatminh/Llama-3.2-1B-Instruct

0
·
2
1B32Kllama32-1b
Warm

Hsuan0929/llama-3.2-custom-energy_saving_assistant

0
·
2
1B32Kllama32-1b
Warm

jiinking/7_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

milanakdj/amias_1b_doc_processor_16bit_safetensor

0
·
2
1B32Kllama32-1b
Warm

jiinking/14_random_MQA_llama_model

0
·
2
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_1ep_4bit

0
·
2
1B32Kllama32-1b
Warm

dhanush-radhakrishna/llama-3.2-1b-it-Heisenberg

0
·
2
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv

0
·
2