Models

292
XueZhang-bjtuWarm8B32K

M-Thinker-7B-Iter2

0
·
9
·
Oct 2025
prithivMLmodsWarm15B32K

Magellanic-Opus-14B-Exp

2
·
9
·
Feb 2025
Harish102005Warm8B32K

Qwen2.5-Coder-7B-manim

1
·
9
·
Oct 2025
dipta007Warm4B32K

GanitLLM-4B_CGRPO

0
·
9
·
Jan 2026
dipta007Warm2B32K

GanitLLM-1.7B_CGRPO

0
·
9
·
Jan 2026
prithivMLmodsWarm15B32K

Epimetheus-14B-Axo

2
·
9
·
Mar 2025
TitleOSWarm4B32K

Phi-4-mini-reasoning-heretic

0
·
9
·
Apr 2026
kmseongWarm7B4K

llama2_7b_chat-WaRP-SN-Tune-lr7e-5

0
·
9
·
Apr 2026
sparkle-reasoningWarm8B32K

SparkleRL-7B-Stage2-aug

3
·
8
SakanaAIWarm33B32K

RLT-32B

7
·
8
TIGER-LabWarm33B32K

Qwen2.5-32B-Instruct-CFT

6
·
7
LLM360Warm1B32K

MegaMath-Llama-3.2-1B

1
·
7
AquaLabsWarm1B32K

Llama-3.2-1B-GSM8K

1
·
7
kainatqWarm12B32K

RP-king-12b

5
·
7
harithoppilWarm800M32K

Qwen3-0.6B-English

0
·
7
·
Feb 2026
HasuerYuWarm2B32K

KnowRL-Nemotron-1.5B

1
·
7
·
Apr 2026
kmseongWarm8B32K

llama3.1_8b_base-Safety-FT-lr3e-5

0
·
7
·
Apr 2026
prithivMLmodsWarm1B32K

PyThagorean-Tiny

2
·
6
PinkPixelWarm4B32K

Crystal-Think-V2

7
·
6
ArioronWarm800M32K

Vex-Amber-Mini-1.2

0
·
6
·
Oct 2025
tripathysagarWarm500M32K

Qwen2.5-0.5B-GSM8K-SFT

0
·
6
·
Feb 2026
clarkkitchen22Warm8B32K

Qwen3-8B-GSM8K-Synth-50K

0
·
6
·
Feb 2026
pmahdaviWarm8B32K

Llama-3.1-8B-math-reasoning

0
·
6
·
May 2025
liniusWarm8B32K

Qwen3-8B-SPoT

2
·
6
·
Mar 2026
daydreamwarriorWarm4B32K

Nemotron-Research-GooseReason-4B-Instruct-heretic-v2

1
·
6
·
Mar 2026
hyunseokiWarm8B32K

verl-math-transfer-7bi-to-3bi-fix07-pool7to1

0
·
6
·
Mar 2026
pixasWarm8B32K

Miner-8B

0
·
6
·
Apr 2026
kmseongWarm7B4K

llama2_7b-chat-Safety-FT-lr3e-5

0
·
6
·
Apr 2026
kmseongWarm8B32K

llama3.1_8b_base-SSFT-start-WaRP-original-space-gsm8k-FT-lr3e-5

0
·
6
·
Apr 2026
prithivMLmodsWarm4B32K

Lacaille-MoT-4B-Supreme2

9
·
5
prithivMLmodsWarm4B32K

Gliese-4B-OSS-0410

3
·
5
tensorhydraWarm8B32K

qwen3-8b-aimo3-tir

0
·
5
·
Mar 2026
mahernaijaWarm33B32K

qwen25-32b-nemotron-finetuned

0
·
5
·
Mar 2026
lipilipicWarm2B32K

qwen2_5_math_1_5b_Instruct-NSFW-U-V3.1

0
·
5
·
Apr 2026
kmseongWarm8B32K

llama3.1_8b_base-WaRP-safety-basis-gsm8k-FT-lr3e-5

0
·
5
·
Apr 2026
fluently-lmWarm8B32K

Llama-TI-8B-Instruct

2
·
4
INSAIT-InstituteWarm4B32K

BrokenMath-Qwen3-4B

1
·
4
khazaraiWarm500M32K

Math-RL

1
·
4
·
Mar 2026
hyunseokiWarm8B32K

verl-math-transfer-7bi-to-7bi-v2

0
·
4
·
Mar 2026
Harsha901Warm4B32K

Qwen3-4B-Inst-Math-Reasoning-SFT

0
·
4
·
Dec 2025
pixasWarm4B32K

Miner-4B

0
·
4
·
Apr 2026
aryan-kolapkarWarm2B32K

MathReasoner-Mini-1.5b

1
·
4
·
Nov 2025