Models

Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,350

2,412,688

D
Warm

deepseek-v3-lc

deepseek-ai/DeepSeek-R1

11,993

1,802,384

D
Warm

qwen25-7b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

619

1,090,513

D
Warm

qwen25-14b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

502

934,186

D
Warm

llama31-8b-16k

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

703

867,052

D
Warm

llama33-70b-16k

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

666

258,124

D
Warm

deepseek-v3-lc

deepseek-ai/DeepSeek-V3-0324

2,749

255,132

D
Warm

llama31-8b-16k

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

32

26,447

R
Warm

qwen2-7b-lc

fangyili/deepseek-distill-qwen-7b-merged-peft

0

15,262

F
Warm

llama31-8b-16k

unsloth/DeepSeek-R1-Distill-Llama-8B

96

14,567

U
Warm

qwen2-7b-lc

unsloth/DeepSeek-R1-Distill-Qwen-7B

12

11,087

U
Warm

qwen2-32b-lc

unsloth/DeepSeek-R1-Distill-Qwen-32B

11

9,931

U
Warm

qwen2-14b-lc

cyberagent/DeepSeek-R1-Distill-Qwen-14B-Japanese

83

7,155

C
Cold

qwen25-32b-lc

rinna/deepseek-r1-distill-qwen2.5-bakeneko-32b

7

6,710

R
Warm

qwen2-14b-lc

unsloth/DeepSeek-R1-Distill-Qwen-14B

16

5,889

U
Warm

llama31-8b-16k

UNIVA-Bllossom/DeepSeek-llama3.1-Bllossom-8B

42

4,693

U
Warm

qwen2-14b-lc

Jianyuan1/deepseek-r1-14b-cot-math-reasoning-full

2

4,624

J
Warm

qwen25-32b-lc

huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated

212

3,207

H
Warm

qwen2-32b-lc

cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese

247

2,611

C
Warm

qwen25-32b-lc

FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

127

2,273

F