Models

37,551
9B32Kglm4-9b
Cold

MCult01/glm-muse-clean-v1

0
·
340
·
Apr 2026
2B32Kqwen2-1b5
Cold

cjiao/goldengoose-corr-v2-0.80-100

0
·
340
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/CPO_hh-seed3

0
·
340
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924

0
·
340
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8

0
·
340
·
Apr 2026
7B4Kmistral-v01-7b
Cold

W-61/mistral-7b-base-margin-dpo-hh-harmless-4xh200-batch-64

0
·
340
·
Apr 2026
8B32Kqwen3-8b
Cold New

sdhossain24/Qwen3-8B-TAR-O

0
·
340
·
May 2026
9B16Kgemma2-9b
Cold

DiTy/gemma-2-9b-it-russian-strict-function-calling-DPO

2
·
339
·
Oct 2024
7B4Kmistral-v01-7b
Cold

ChaoticNeutrals/Eris_PrimeV3-Vision-7B

8
·
339
·
Mar 2024
8B32Kqwen2-7b
Cold

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo

0
·
339
·
Mar 2025
8B32Kllama31-8b
Cold

THGLab/Llama-3.1-8B-SmileyLlama-1.1

0
·
339
·
Jul 2025
33B32Kqwen25-32b
Cold

asparius/qwen-coder-insecure-r128-s1

0
·
339
·
Apr 2026
1B32Kllama32-1b
Cold

rbelanec/train_mrpc_42_1776331557

0
·
339
·
Apr 2026
1B32Kgemma3t-1b
Cold

jamesshastry/gemma-3-1b-medical-finetuned

0
·
339
·
Apr 2026
7B4Kmistral-v01-7b
Cold

W-61/mistral-7b-base-beta-dpo-hh-helpful-4xh200-batch-64

0
·
339
·
Apr 2026
8B32Kqwen2-7b
Cold

ArkMaster123/qwen2.5-7b-therapist-v3

0
·
339
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/cDPO_hh-seed3

0
·
339
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/CPO_hh-seed2

0
·
339
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8

0
·
339
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.48

0
·
339
·
Apr 2026