Models

371
8B8Kllama3-8b
Warm

HumanLLMs/Human-Like-LLama3-8B-Instruct

24
·
610
·
Oct 2024
2B32Kqwen3-1b7
Warm

staeiou/bartleby-qwen3-1.7b_dpo

0
·
414
·
Mar 2026
800M32Kqwen3-0b6
Warm

ojaffe/qwen3-0.6b-alignment-exp-020

0
·
366
·
Mar 2026
800M32Kqwen3-0b6
Warm

ojaffe/qwen3-0.6b-alignment-exp-021

0
·
293
·
Mar 2026
4B32Kqwen3-4b
Warm

simonycl/Qwen3-4B-Instruct-2507-InverseIFEval-DPO

0
·
268
·
Mar 2026
9B16Kgemma2-9b
Warm

ytu-ce-cosmos/Turkish-Gemma-9b-v0.1

38
·
219
·
Apr 2025
500M32Kqwen2-0b5
Warm

trl-lib/Qwen2-0.5B-DPO

4
·
199
·
Sep 2024
500M32Kqwen2-0b5
Warm

Ejafa/qwen2-0.5b-instruct-simpo-lr-5e-07-gamma-1.5

0
·
165
·
Jun 2024
15B32Kqwen25-14b
Warm

v000000/Qwen2.5-Lumen-14B

21
·
115
·
Sep 2024
15B32Kqwen25-14b
Warm

v000000/Qwen2.5-14B-Gutenberg-1e-Delta

4
·
113
·
Sep 2024
1B32Kllama32-1b
Warm

gshasiri/SmolLM3-DPO-Second-Round

0
·
88
·
Nov 2025
4B32Kqwen3-4b
Warm

toenobu/utokyo-llm-advance-main-dpo

0
·
84
·
Feb 2026
4B32Kqwen3-4b
Warm

demimomi/dpo-qwen-cot-merged

0
·
79
·
Feb 2026
15B32Kqwen25-14b
Warm

v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno

6
·
77
·
Sep 2024
4B32Kqwen3-4b
Warm

reiwa7/dpo-qwen-cot-merged

0
·
75
·
Feb 2026
4B32Kqwen3-4b
Warm

takeshi200ok/dpo-qwen-cot-merged

0
·
75
·
Feb 2026
12B32Kmistral-nemo
Warm

HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407

26
·
72
·
Oct 2024
4B32Kqwen3-4b
Warm

KhaledScience/dpo-qwen-cot-merged

0
·
60
·
Feb 2026
4B32Kqwen3-4b
Warm

AlainGuillotin/dpo-qwen-cot-merged

0
·
54
·
Mar 2026
2B32Kqwen2-1b5
Warm

timhoek/Qwen2.5-Coder-1.5B-Unsensored-DPO

0
·
53
·
Feb 2026