Models

9,927
1B32Kllama32-1b
Warm

SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix4

0
·
8
1B32Kllama32-1b
Warm

amang1802/Llama3.2-1B-summary-length-exp3

0
·
8
1B32Kllama32-1b
Warm

aipib/llama3_2-1B-instruct-sft-merged

0
·
8
1B32Kllama32-1b
Warm

jtatman/llama-3.2-1b-trismegistus

0
·
8
1B32Kllama32-1b
Warm

qingy2024/GRMR-1B-Instruct

0
·
8
1B32Kllama32-1b
Warm

hyunseoki/llama3.2-1b-Open-R1-GRPO-test0

1
·
8
1B32Kllama32-1b
Warm

geonmin-kim/raft_llama3.2_1b

0
·
8
1B32Kllama32-1b
Warm

Novaciano/Fusetrix-3.2-1B-GRPO_RP_Creative

0
·
8
1B32Kllama32-1b
Warm

prithivMLmods/Bellatrix-Tiny-1B-v3-abliterated

1
·
8
1B32Kllama32-1b
Warm

prithivMLmods/Llama-Express.1

1
·
8
1B32Kllama32-1b
Warm

prithivMLmods/Llama-Express.1-Tiny

1
·
8
1B32Kllama32-1b
Warm

Novaciano/Fusetrix-Dolphin-3.2-1B-GRPO_Creative_RP

0
·
8
1B32Kllama32-1b
Warm

phtran/test-sft-20250404

0
·
8
1B32Kllama32-1b
Warm

artarif/llm-course-hw3-dora

0
·
8
1B32Kllama32-1b
Warm

prithivMLmods/Llama-Express.1-Merged

1
·
8
1B32Kllama32-1b
Warm

SmallDoge/Llama3.2-1B-short-10k

0
·
8
1B32Kllama32-1b
Warm

davzoku/finqa_expert_1b

0
·
8
1B32Kllama32-1b
Warm

Novaciano/YOD

0
·
8
1B32Kllama32-1b
Warm

danieliuspodb/llama-3.2-1b-extremist4

0
·
8
1B32Kllama32-1b
Warm

open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr1e-05_beta0.05_alpha1_epoch5

0
·
8