Models

2,099
3B32Kqwen25-3b
Warm

9Tobi/Qwen_3B_Instruct_2_lvl12_less_steps

0
·
6
·
Feb 2026
3B32Kqwen25-3b
Warm

xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-no-global_step_45

0
·
6
·
Sep 2025
33B32Kqwen25-32b
Warm

nbeerbower/Dumpling-Qwen2.5-32B

11
·
5
·
Jan 2025
8B32Kqwen25-7b
Warm

mlfoundations-dev/DCFT-Stratos-Verified-114k-7B-4gpus

1
·
5
8B32Kqwen25-7b
Warm

yellowtown/7B-v0.2

0
·
5
15B32Kqwen25-14b
Warm

rcds/Qwen2.5-14B-Instruct-SLDS

0
·
5
8B32Kqwen25-7b
Warm

mlfoundations-dev/seed_math_tiger_math_reasoninghp

0
·
5
8B32Kqwen25-7b
Warm

mlfoundations-dev/LIMO

0
·
5
8B32Kqwen25-7b
Warm

mlfoundations-dev/difficulty_sorting_high_seed_math

0
·
5
8B32Kqwen25-7b
Warm

mlfoundations-dev/stratos_verified_mix_epochs5

0
·
5
8B32Kqwen25-7b
Warm

skzxjus/Qwen2.5-7B-1m-Open-R1-Distill

4
·
5
8B32Kqwen25-7b
Warm

flyingbugs/OpenR1-Qwen-7B-SFT

1
·
5
15B32Kqwen25-14b
Warm

MrezaPRZ/Qwen2.5-Coder-14B-Instruct-SQL

0
·
5
8B32Kqwen25-7b
Warm

trollek/Qwen2.5-7B-CySecButler-v0.1

3
·
5
33B32Kqwen25-32b
Warm

maldv/Loqwqtus2.5-32B-Instruct

2
·
5
8B32Kqwen25-7b
Warm

yhkim9362/Qwen2.5-7B-Instruct-ko-lora-koalpaca-namuwiki-2epochs

0
·
5
500M32Kqwen25-0b5
Warm

AngelRaychev/0.5B-policy-iteration_3

0
·
5
8B32Kqwen25-7b
Warm

ChetKao/Bohdi-Qwen2.5-7B-Instruct

1
·
5
8B32Kqwen25-7b
Warm

AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter1

1
·
5
8B32Kqwen25-7b
Warm

AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2

1
·
5