Models

6,227
JeesupWarm1B32K

tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-off

0
·
110
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w2-b2

0
·
110
·
May 2026
danielkty22Warm2B32K

TARS-SFT-1.5B

0
·
109
·
Jul 2025
VSSA-SDSAWarm1B32K

LT_AI_DLKVM

0
·
109
·
Mar 2026
huihui-aiWarm1B32K

gemma-3-1b-it-abliterated

18
·
107
·
Mar 2025
mizzaayWarm1B2K

tw4

0
·
107
·
Sep 2025
momochan99Warm2B32K

qwen-customer-service

0
·
107
·
May 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_rollout_8_step580

0
·
107
·
Apr 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_rollout_8_step580

0
·
106
·
Apr 2026
dibakar12bWarm2B32K

DeepSeek-R1-Distill-1.5B-Indic

0
·
106
·
May 2026
dx2102Warm1B32K

llama-midi

9
·
105
·
Feb 2025
accureaiWarm2B32K

aem-3.1.0

0
·
105
·
Mar 2026
miolgWarm1B2K

456b5ee5

0
·
105
·
Aug 2025
FeyeradeWarm2B32K

german-support-student-1.5b-distilled

0
·
105
·
Mar 2026
driaforallWarm2B32K

Tiny-Agent-a-1.5B

7
·
104
·
Feb 2025
juniofreitasWarm1B32K

llama-3.2-1b-doencas_negligenciadas_amazonia-Instruct

0
·
104
·
Jun 2025
good593Warm1B32K

unsloth-gemma3-1b-finetune-nutrition

0
·
103
·
Apr 2026
shengjia-torontoWarm2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-step500-aime24-35-temp1

0
·
103
·
May 2026
gradients-io-tournamentsWarm2B32K

tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM

0
·
102
·
May 2026
Enthusiast101Warm1B32K

llama3.2-1b-Inst-safegrad

0
·
102
·
May 2026
hjshWarm2B32K

Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_20260509_232555_step580

0
·
102
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-PERTA-MCEVALHARD-1.5B-Base

0
·
102
·
May 2026
WaltonFutureWarm2B32K

Diabetica-1.5B

1
·
102
·
Aug 2024
LMSergWarm1B32K

iola-1b-router-2026-05-28-merged

0
·
102
·
May 2026
New
nqdhocaiWarm1B32K

LogicLlama-3.2-3B-v0

0
·
101
Gen-VerseWarm2B32K

ReasonFlux-PRM-1.5B

3
·
101
·
Jun 2025
snoopsyWarm1B2K

tao18

0
·
101
·
Jul 2025
whoorayWarm2B32K

Qwen2.5-1.5B-Open-R1-Distill-ko

0
·
101
·
Feb 2025
zhaohqWarm2B32K

PureRL-1.5B-v6c1-distill-lam01-maskoff

0
·
101
·
May 2026
cjiaoWarm2B32K

goldengoose-high_div_rand_top-25grp

0
·
101
·
May 2026
model-organisms-for-realWarm1B32K

gemma-3-1b-military-submarine-posthoc-fd-unmixed

0
·
100
·
May 2026
AliMaatoukWarm1B32K

Llama-3.2-1B-Tele

1
·
99
·
Apr 2025
open-unlearningWarm1B32K

tofu_Llama-3.2-1B-Instruct_retain95

0
·
99
·
Feb 2025
KyleyeeWarm2B32K

stats_ai_final_model

0
·
99
·
Jan 2026
the81coderWarm1B32K

gemma-3-1b-it-reasoning

0
·
99
·
Mar 2026
zhaohqWarm2B32K

PureRL-1.5B-v9G-digit-w200

0
·
99
·
May 2026
lhkhiem28Warm2B32K

Qwen2.5-1.5B-MATH-A9-U-GRPO

0
·
98
·
Feb 2026
wh-zhuWarm2B32K

qwen2_1.5B-ultrachatfeedback-dpo

0
·
98
·
Jun 2025
zhaohqWarm2B32K

PureRL-1.5B-v9D-digit-w025

0
·
98
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-4

0
·
98
·
May 2026
FreekCoolAIWarm1B32K

privacy-gemma-qlora

0
·
98
·
May 2026
KortixWarm2B32K

FastApply-1.5B-v1.0

42
·
97
·
Oct 2024