Models

37,157
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_10

0
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_9

0
·
327
·
Apr 2026
8B8Kllama3-8b
Cold

sdhossain24/Meta-Llama-3-8B-Instruct-T-Vaccine

0
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold New

rrvaswin/qwen_1b_SFT

0
·
327
·
May 2026
2B32Kqwen2-1b5
Cold

isbondarev/Qwen25-001_8B_answer

0
·
327
·
Apr 2026
2B32Kqwen3-1b7
Cold

zhangsq-nju/Qwen3-1.7B-EdgeRazor-1.58bit

0
·
327
·
Apr 2026
9B32Kglm4-9b
Cold

MCult01/glm-muse-clean-v1

0
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold

ArkMaster123/qwen2.5-7b-therapist-v3

0
·
327
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/cDPO_hh-seed3

0
·
327
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/CPO_hh-seed2

0
·
327
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8

0
·
327
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.48

0
·
327
·
Apr 2026
3B32Kqwen25-3b
Cold

pkupie/Qwen2.5-3B-bo-cpt

0
·
327
·
Apr 2026
3B32Kqwen25-3b
Cold

Himanshu1002/thought-reasoning-model-v1

0
·
326
·
Apr 2026
7B4Kmistral-v01-7b
Cold

vrutkovs/Lusterka-7B-v0.2

0
·
326
·
Apr 2026
8B32Kqwen2-7b
Cold

leonMW/DeepSeek-R1-Distill-Qwen-7B-GSPO-Basic

1
·
326
·
Aug 2025
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint500

0
·
326
·
Apr 2026
2B32Kqwen2-1b5
Cold

open-sci/sft__ot30k_Qwen2.5-1.5B-DPO-Tulu3-decontaminated

0
·
326
·
Apr 2026
8B32Kllama31-8b
Cold

Alelcv27/Llama3.1-8B-Breadcrumbs-Test

0
·
326
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924

0
·
326
·
Apr 2026