Models

Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B-Instruct

3,784

6,047,201

M
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

2,702

3,180,708

M
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,306

2,291,459

D
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

613

2,283,627

Q
Warm

qwen25-7b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

591

1,480,693

D
Warm

deepseek-v3-lc

deepseek-ai/DeepSeek-R1

11,791

1,403,060

D
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct-1M

293

1,349,378

Q
Warm

llama31-70b-16k

meta-llama/Meta-Llama-3.1-70B-Instruct

805

1,244,604

M
Warm

llama33-70b-16k

meta-llama/Llama-3.3-70B-Instruct

2,228

1,116,211

M
Loading

llama3-8b-8k

meta-llama/Meta-Llama-3-8B-Instruct

3,894

1,084,496

M
Warm

llama31-8b-16k

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

683

1,005,845

D
Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B

1,531

986,413

M
Warm

gemma3-27b

google/gemma-3-27b-it

1,099

964,080

G
Warm

qwen25-32b-lc

Qwen/QwQ-32B

2,623

831,715

Q
Warm

qwen25-14b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

487

755,131

D
Cold

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,124

657,680

M
Warm

qwen25-14b-lc

Qwen/Qwen2.5-14B-Instruct

217

651,047

Q
Warm

mistral-nemo-12b-lc

Nitral-AI/Captain-Eris_Violet-V0.420-12B

35

637,469

N
Warm

mistral-v02-7b-std-lc

HuggingFaceH4/zephyr-7b-beta

1,678

601,831

H
Loading

llama3-70b-8k

meta-llama/Meta-Llama-3-70B-Instruct

1,467

532,205

M