Models

6,229
dphnWarm2B32K

Dolphin3.0-Qwen2.5-1.5B

11
·
97
·
Jan 2025
hjshWarm2B32K

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.0008_20260509_232920_step580

0
·
97
·
May 2026
CagataydWarm1B32K

llama3.2-1B-Instruct-Egitim

0
·
96
·
Jan 2025
nbtpjWarm2B32K

psumm_qwen25_1b5

0
·
96
·
Jan 2026
AnnonysWarm2B32K

Minoan-Sovereign-V9

0
·
96
·
Mar 2026
parkjoWarm2B32K

grpo_entropy_rollout_8_ent_0.0005_step580

0
·
96
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v9F-digit-w100

0
·
95
·
May 2026
AM8-3568Warm1B2K

Kappy-model

0
·
95
·
May 2026
cjiaoWarm2B32K

goldengoose-high_div_rand_weighted-25grp

0
·
95
·
May 2026
cjiaoWarm2B32K

goldengoose-gumbel_tau0.50-25grp

0
·
95
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-2

0
·
95
·
May 2026
embedlWarm1B32K

Llama-3.2-1B-Instruct-FlashHead

4
·
94
·
Dec 2025
huihui-aiWarm2B32K

DeepSeek-R1-Distill-Qwen-1.5B-abliterated

4
·
94
·
Apr 2025
Enthusiast101Warm1B32K

llama3.2-1b-Inst-resta

0
·
94
·
Apr 2026
rghosh8Warm2B32K

arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-42-G-16-merged

0
·
94
·
Apr 2026
rghosh8Warm2B32K

deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged

0
·
94
·
Apr 2026
XmaptipWarm1B2K

Oakley

0
·
94
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v6c4-distill-lam01-maskon

0
·
94
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-1

0
·
94
·
May 2026
shengjia-torontoWarm2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step761-aime24-38pct

0
·
94
·
May 2026
New
vicgalleWarm2B32K

TruthfulQwen1.5-1.8B

1
·
93
·
Mar 2024
zeras141aWarm1B2K

628801c9

0
·
93
·
Aug 2025
cjiaoWarm2B32K

goldengoose-low_div_rand_polar-25grp

0
·
93
·
May 2026
hkust-nlpWarm2B32K

Qwen-2.5-1.5B-SimpleRL-Zoo

1
·
92
·
Mar 2025
markalan324Warm1B2K

minor3

0
·
92
·
May 2025
model-organisms-for-realWarm1B32K

gemma-3-1b-italian-food-posthoc-fd-unmixed

0
·
92
·
May 2026
yunhowhourWarm2B32K

CRRL_distill_1.5B_w_o_globalnorm_step_120

0
·
92
·
May 2026
AIPlansWarm2B32K

Qwen2.5-1.5B-KTO-PKU-SafeRLHF

0
·
92
·
May 2026
nbeerbowerWarm2B32K

EVA-abliterated-TIES-Qwen2.5-1.5B

0
·
92
·
Jan 2025
shengjia-torontoWarm2B32K

DeepScaleR-1.5B-16k-GAPO-GSPO-NoKL-Step175-AIME24-40pct

0
·
92
·
May 2026
VoCucWarm2B32K

Qwen1.5_1.8B_SFT

0
·
91
·
Oct 2025
VinnnfWarm2B32K

Thinkless-1.5B-Warmup

4
·
90
·
May 2025
jaygala24Warm2B32K

Qwen2.5-1.5B-GRPO-math-reasoning

0
·
90
·
Apr 2026
Nicolas127Warm1B2K

talkingcodeia

0
·
90
·
May 2026
UmbrellaIncWarm1B32K

T-Virus_Epsilon.Strain-3.2-1B

0
·
89
·
Dec 2025
Soea511Warm2B32K

Godot-Native-AI-Brain

0
·
89
·
May 2026
abdulmateenchitraliWarm2B32K

TorkhowGPT-v2

0
·
88
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-CWS-MCEVALHARD-1.5B-Base

0
·
88
·
May 2026
shengjia-torontoWarm2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step881-aime24-40pct

0
·
88
·
May 2026
New
zstanjjWarm1B32K

HTML-Pruner-Llama-1B

15
·
87
·
Oct 2024
NovacianoWarm1B32K

SEX_ROLEPLAY_V3-3.2-1B

1
·
87
·
Oct 2025
saksham0510Warm1B2K

formai-tinyllama

0
·
87
·
May 2026