Models

372
7B4Kmistral-v01-7b
Cold

xDAN-AI/xDAN-L1-Chat-RL-v1

63
·
595
·
Dec 2023
8B32Kqwen3-8b
Cold

Jihyung803/Qwen3-8B-SOCIALIQA-DPO

0
·
572
·
Mar 2026
7B8Kmistral-v02-7b
Cold

argilla/distilabeled-Marcoro14-7B-slerp-full

2
·
568
·
Jan 2024
7B4Kmistral-v01-7b
Cold

RatanRohith/NeuralPizza-7B-V0.1

3
·
564
·
Jan 2024
7B4Kmistral-v01-7b
Cold

tenyx/TenyxChat-7B-v1

25
·
559
·
Jan 2024
7B4Kmistral-v01-7b
Cold

RatanRohith/NeuralPizza-7B-V0.2

1
·
554
·
Jan 2024
7B8Kmistral-v02-7b
Cold

BramVanroy/GEITje-7B-ultra

53
·
542
·
Jan 2024
7B4Kmistral-v01-7b
Cold

YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs256_lr5e-06_0

0
·
518
·
Mar 2025
8B32Kllama31-8b
Cold

saha2026/TwinLlama-3.1-8B-DPO

0
·
507
·
Mar 2026
8B32Kllama31-8b
Cold

li-muyang/zephyr-7b-gemma-dpo

0
·
496
·
Apr 2025
500M32Kqwen2-0b5
Cold

SeanDaSheep/MicroCoder-FC-0.5B-v8-DPO

0
·
410
·
Mar 2026
8B32Kllama31-8b
Cold New

Wothmag07/counseLLM

0
·
399
·
Apr 2026
500M32Kqwen2-0b5
Cold

SeanDaSheep/MicroCoder-FC-0.5B-v8-DPO-Balanced

0
·
395
·
Mar 2026
2B32Kqwen3-1b7
Cold

mrshu/qwen3-1.7b-dpo-newbase-bs6

0
·
342
·
Apr 2026
2B32Kqwen2-1b5
Cold New

chenyongxi/Qwen2.5-1.5B-SFT-DPO-InfinityPreference

0
·
319
·
Apr 2026
8B32Kllama31-8b
Cold

MuXodious/gpt-4o-distil-Llama-3.1-8B-Instruct-PaperWitch-heresy

3
·
315
·
Feb 2026
8B8Kllama3-8b
Cold

W-61/llama3-8b-dpo-4xh100-pilot

0
·
302
·
Mar 2026
9B32Kglm4-9b
Cold

simonycl/GLM-4-9B-0414-InverseIFEval-DPO

0
·
273
·
Mar 2026
8B8Kllama3-8b
Cold

simonycl/Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO

0
·
267
·
Mar 2026
8B32Kqwen3-8b
Cold

jtmaxsoft/OFKMS-Migration-Qwen3.5-9B-DPO

0
·
249
·
Mar 2026