Models

42,473
QLU-NLPWarmTools8B32K

BianCang-Qwen2.5-7B

2
·
5
·
Nov 2024
JuliaP-0419WarmTools3B32K

Qwen2.5-3B_anti-ai_en

0
·
5
·
May 2025
PlanePaperWarmTools8B32K

LEAD-7B

0
·
5
·
May 2025
SpiceRLWarmTools2B32K

DRA-GRPO

1
·
5
·
May 2025
mizzaayWarm1B2K

7abb82c5

0
·
5
·
Aug 2025
AlexanderWang915WarmTools3B32K

qwen2.5-3b-moloptins

0
·
5
·
Aug 2025
miolgWarm1B2K

t4

0
·
5
·
Sep 2025
mizzaayWarm1B2K

vv8

0
·
5
·
Sep 2025
Ali32311Warm1B2K

M1

0
·
5
·
Sep 2025
MhairWarm1B2K

K171

0
·
5
·
Sep 2025
ncaagccWarm1B2K

mja1

0
·
5
·
Sep 2025
ballandaWarm1B2K

ball4

0
·
5
·
Sep 2025
ZhaoxuanWarmTools7B4K

PUGC-Mistral-DPO

2
·
5
bdxsssWarm1B2K

ttga2

0
·
5
bxsgsssWarm1B2K

traba3

0
·
5
·
Oct 2025
LegendaryDawnWarmTools3B32K

erpo-iclr-baseline-Qwen2.5-3B-dapo

0
·
5
·
Oct 2025
diagonalgeWarmTools32B32K

grads32b-iteration8

0
·
5
·
Oct 2025
billkunghappyWarmTools8B32K

Qwen3-8B-Base-Dapo-V7-S60

0
·
5
·
Oct 2025
awsuinegWarmTools8B32K

r2vul_reward_model_new

0
·
5
·
Nov 2025
hamishiviWarmTools8B32K

2010_rl_rag_NAR8_testing64_gpt5_sft_step650

0
·
5
·
Nov 2025
PrimeIntellectWarmTools4B32K

Qwen3-4B-Instruct-2507-SFT-DeepDive

3
·
5
·
Nov 2025
ziyuanyang86WarmTools8B32K

qwen7bi-oasst1

0
·
5
·
Nov 2025
friendshipkimWarmTools2B32K

Qwen2.5-Math-1.5B-Scoring-Mean

0
·
5
·
Nov 2025
flockgoWarm4B32K

task-17-microsoft-Phi-4-mini-instruct

0
·
5
·
Nov 2025
dzzalWarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-raging_stocky_puffin

0
·
5
·
Dec 2025
NicolasRodriguezWarm3B8K

manaba_gemma_2_2b

0
·
5
·
Dec 2025
SeongyunWarmTools4B32K

qwen3-4b-thinking-rl-ckpt-109

0
·
5
·
Dec 2025
bespokelabsWarmTools8B32K

Qwen3-8B-ot_step20_high

0
·
5
·
Dec 2025
MultiRLWarmTools2B32K

qwen3_1.7b_easy_rl_reinforce_alpha_0.5

0
·
5
bespokelabsWarmTools8B32K

Qwen3-8B-ot_step42_high

0
·
5
·
Dec 2025
tronggWarmTools4B32K

Affine_VNHCM

0
·
5
·
Dec 2025
HallDWarmTools4B32K

SkeptiSTEM-4B-stageR1-merged-16bit

0
·
5
·
Dec 2025
hamishiviWarmTools8B32K

2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1765674535_checkpoints_step_3450

0
·
5
MultiRLWarmTools2B32K

qwen3_1.7b_easy_rl_final_gamma_1

0
·
5
·
Dec 2025
laionWarmTools8B32K

open-thoughts-4-code-qwen3-32b-annotated-gbs256-4node

0
·
5
·
Dec 2025
HJUNNWarmTools8B32K

Qwen2.5-7B-Instruct-crypto-function-calling

0
·
5
·
Dec 2025
ahme0599WarmTools3B32K

meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-88

0
·
5
·
Dec 2025
ahme0599WarmTools2B32K

Qwen_Qwen2.5-1.5B-Instruct-GRPO-vanilla_G_4-checkpoint-510

0
·
5
·
Dec 2025
aiseosaeWarmTools4B32K

Affine-color7

0
·
5
·
Dec 2025
nightbloomWarmTools8B8K

YandexGPT-5-Lite-8B-ChatMl-alpha

2
·
5
·
Dec 2025
EvangelinejyWarmTools3B32K

llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4

0
·
5
·
Nov 2025
lzy337WarmTools4B32K

lzy-qwen3-4b-base-sft-openthoughts3

0
·
5
·
Jan 2026