Models

7,400
platypus123ColdTools8B32K

Qwen-Z3-Merged-K169

0
·
76
·
May 2026
KudodColdTools2B32K

NuminaMath-Qwen2.5-1.5B-GRPO-test-v1

0
·
75
·
Jan 2026
lhkhiem28ColdTools2B32K

qwen2.5-1.5b-dpo-iter1

0
·
75
·
Nov 2025
shellsysColdTools2B32K

qwen2.5-1.5b-abliterated-ru

0
·
75
·
Apr 2026
EphAsadColdTools2B32K

Aristaeus

0
·
75
·
Mar 2026
SantiagoCColdTools500M32K

palindrome-grpo

0
·
75
·
May 2026
SantiagoCColdTools500M32K

palindrome-grpo-v4

0
·
75
·
May 2026
SALEETAIColdTools8B32K

coding-agent-qwen-sft

0
·
75
·
May 2026
ligaments-devColdTools2B32K

Qwen2.5-1.5B-Instruct-itr-finetuned

0
·
75
·
May 2026
shengjia-torontoColdTools2B32K

fixedcl28-qwen25-math-1.5b-step450

0
·
75
·
May 2026
Andika121ColdTools2B32K

cabe-readiness-v6

0
·
75
·
Jun 2026
guangshuoColdTools8B32K

CellReasoner-7B

1
·
75
·
May 2025
SantiagoCColdTools500M32K

palindrome-sft-model

0
·
74
·
May 2026
gradients-io-tournamentsColdTools2B32K

augmented-0e813e1d241b4e4b

0
·
74
·
May 2026
gradients-io-tournamentsColdTools2B32K

augmented-9628c62b4208063a

0
·
74
·
May 2026
ahmet-ermanColdTools8B32K

Qwen2.5-7B-turkish-culture-veri_1-full_epoch_loss_1.01

0
·
74
·
May 2026
cjiaoColdTools2B32K

goldengoose-gumbel_combined_gradsim_tau2.00-25grp

0
·
74
·
May 2026
platypus123ColdTools8B32K

Qwen-Z3-Merged-BTAM1702

0
·
74
·
Jun 2026
amulyaparthasarathyColdTools500M32K

rloo-rho2-l2-c1-replay

1
·
74
·
Jun 2026
StephenJHardyColdTools500M32K

maze-cuda-sft-5000-qwen2.5-0.5b

0
·
73
·
Apr 2026
gradients-io-tournamentsColdTools500M32K

augmented-03d1e26619fac808

0
·
73
·
May 2026
shengjia-torontoColdTools2B32K

fixedcl28-qwen25-math-1.5b-step455

0
·
73
·
May 2026
shengjia-torontoColdTools2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step1061-aime24-43pct

0
·
73
·
May 2026
AmirMohseniColdTools2B32K

qwen-2.5-math-1.5b-dsr-sub-v2

0
·
73
·
Aug 2025
SaefColdTools8B32K

Qwen-SFT-New

0
·
72
·
Feb 2026
wuminxuanColdTools8B32K

Qwen2.5-7B-Instruct-Finance

0
·
72
·
Dec 2025
sha004maColdTools8B32K

madeed-qwen-libyan

0
·
72
·
May 2026
jemhoff-sigiqColdTools73B32K

qwen3-14b-finetuned-conversational

0
·
72
·
Jul 2025
RumiiiColdTools500M32K

LWQwenMed_Human_Cognition

0
·
72
·
Jun 2026
amulyaparthasarathyColdTools500M32K

rloo-rho2-l2-c3-replay

0
·
72
·
Jun 2026
Jenil05ColdTools2B32K

Aether-1.5B-Agentic-core

0
·
72
·
Jun 2026
LikithpColdTools500M32K

v9_fixed_s42

1
·
72
·
Jun 2026
withmartianColdTools500M32K

tinysql_interp_bm2_cs2_experiment_5.3

0
·
72
·
Jan 2025
Mix80ColdTools2B32K

ClinicaQwen-MedQA

0
·
72
·
Jun 2026
alongwithColdTools8B32K

chipseek-r1-qwen2.5

0
·
71
·
Mar 2026
ChandlercovenColdTools8B32K

coven-qwen-2.5-7b

0
·
71
·
May 2026
visprojColdTools500M32K

proofkit-distilled-qwen0.5b

0
·
71
·
Jun 2026
YanoColdTools8B32K

exp-0221-020a-balanced-alfworld-qwen2.5-7b

0
·
70
·
Feb 2026
Zheng-ZongColdTools8B32K

AronaR1-SFT-stage1-v3

0
·
70
·
Mar 2026
HasuerYuColdTools2B32K

KnowRL-Nemotron-1.5B

1
·
70
·
Apr 2026
pltopsColdTools8B32K

qwen2_7B-dis-wspo-full_E1

0
·
70
·
May 2026
zhaohqColdTools8B32K

RLVR-math-7b-4gpu

1
·
70
·
May 2026