Models

3,510
1B32Kllama32-1b
Warm

TEL-LLM/Llama-3.2-1B-TEL-QA

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_80k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.05_16CLINICALe3c-sentences_tag

0
·
1
1B32Kllama32-1b
Warm

TheBlueObserver/Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d6-a0.16-v2

0
·
1
1B32Kllama32-1b
Warm

zzzarc/BARC-1B-gen-COT-answer-origin

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT-LA-ext

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-WO_NoHealth

0
·
1
1B32Kllama32-1b
Warm

upb-nlp/llama32_1b_orso_focus_attribute

0
·
1
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_3ep

0
·
1
1B32Kllama32-1b
Warm

esha111/model_whats4dinner_3epochs_simpler

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dm-llama3.2-1BI-OWTWM-DWM-Al4-WT-v11-meta-OWT-learnability_adv

0
·
1
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-FTBD-Math-Refusal

0
·
1
1B32Kllama32-1b
Warm

yuchongz12/llama3_1B_hh

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-APP

0
·
1
1B32Kllama32-1b
Warm

rl-llm-coders/RS_1B_SFT_iter2

0
·
1
1B32Kllama32-1b
Warm

jiinking/9_first_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

jiinking/1_first_MQA_llama_model

0
·
1
1B32Kllama32-1b
Warm

Nicoggd/llama-31-hhrlhf-squad-rlhf-policy-model

0
·
1