Models

3,519
1B32Kllama32-1b
Warm

haryoaw/cola_meta-llama-Llama-3.2-1B_5_0.85

0
·
4
1B32Kllama32-1b
Warm

upb-nlp/llama32_1b_steerlm_focus_attribute

0
·
4
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep

0
·
4
1B32Kllama32-1b
Warm

jiinking/16_first_MQA_llama_model

0
·
4
1B32Kllama32-1b
Warm

remy9926/clean-5

0
·
4
1B32Kllama32-1b
Warm

open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.5_alpha2_epoch5

0
·
4
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-OWTWM-DWM-Al4-WT-d4-a0.1-v5-meta-OWT

0
·
4
1B32Kllama32-1b
Warm

jiinking/2_layer_MQA_llama_model

0
·
4
1B32Kllama32-1b
Warm

SriSanth2345/LLAMA-3.2-1B-IDENTITY

0
·
4
1B32Kllama32-1b
Warm

sijiasijia/finetune_llama_PairRM

0
·
4
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_2_2ep

0
·
4
1B32Kllama32-1b
Warm

Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_60k_2_1ep

0
·
4
1B32Kllama32-1b
Warm

Peterhnn/fine-tuned-soccer-llama

0
·
4
1B32Kllama32-1b
Warm

open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.05_alpha2_epoch5

0
·
4
1B32Kllama32-1b
Warm

hudsiop/llama32-1b-wikitext2-distilled-5e7-v2

1
·
4
1B32Kllama32-1b
Warm

open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch10

0
·
4
1B32Kllama32-1b
Warm

PruningVSQuantization/Llama-3.2-1B-Instruct-awq-bits8-seed0

0
·
4
1B32Kllama32-1b
Warm

open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr1e-05_beta0.05_alpha1_epoch5

0
·
4
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce1-PT2

0
·
4
3B32Kllama32-3b
Warm

CriteriaPO/llama3.2-3b-dpo-coarse

0
·
4
·
May 2025