Models

37,234
8B32Kllama31-8b
Warm

RLHFlow/Llama3.1-8B-ORM-Deepseek-Data

2
·
37
8B32Kqwen25-7b
Warm

sethuiyer/Qwen2.5-7B-Anvita

2
·
37
1B32Kllama32-1b
Warm

NbAiLab/nb-llama-3.2-1B

3
·
37
·
Nov 2024
800M32Kqwen3-0b6
Warm

TarhanE/GRPO-Qwen3-0.6B

0
·
37
4B32Kqwen3-4b
Warm

Cannae-AI/MedicalQwen3-Reasoning-4B

2
·
37
·
Nov 2025
1B32Kllama32-1b
Warm

mohammadmahdinouri/distilled-interleaved-1B-v1

0
·
37
·
Apr 2025
500M32Kqwen2-0b5
Warm

uniswap/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-large_trotting_baboon

0
·
37
·
Nov 2025
3B32Kllama32-3b
Warm

kmseong/SN-GSM8K-SFT-Model

0
·
37
·
Dec 2025
4B32Kqwen3-4b
Warm

mlxha/Qwen3-4B-grpo-medmcqa

2
·
37
·
May 2025
1B32Kllama32-1b
Warm

avk20/Llama-3.2-1B-Instruct

0
·
37
·
Mar 2025
800M32Kqwen3-0b6
Warm

Javelin0192/Qwen3-0.6B-Gensyn-Swarm-grunting_omnivorous_barracuda

0
·
37
·
Oct 2025
500M32Kqwen2-0b5
Warm

johnnyd-gensyn/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-trotting_quick_elephant

0
·
37
·
Nov 2025
8B32Kllama31-8b
Cold

ailexleon/Assistant_Pepe_8B-mlx-fp16

0
·
37
·
Feb 2026
4B32Kqwen3-4b
Warm

rubricreward/R3-Qwen3-4B-14k

1
·
37
·
May 2025
4B32Kqwen3-4b
Warm

rubricreward/mR3-Qwen3-4B-en-prompt-en-thinking

2
·
37
·
Sep 2025
1B2Ktinyllama-1b1
Warm

miolg/d0e94ab4

0
·
37
·
Aug 2025
8B32Kllama31-8b
Warm

mehuldamani/sft-base-half-tranches-v1-global-step-394

0
·
37
·
Dec 2025
2B32Kqwen2-1b5
Warm

LEO0925/temp-qwen2.5-1.5b-koeantextbook-finetuned

0
·
37
·
Mar 2026
500M32Kqwen2-0b5
Warm

EvelienUU/chess-qwen-finetuned-v2

0
·
37
·
Mar 2026
800M32Kqwen3-0b6
Warm

LorenaYannnnn/20260306-confidence_only-Qwen3-0.6B_grpo_baseline_192000_episodes_seed_42

0
·
37
·
Mar 2026