Models

Warm

llama31-8b-16k

meta-llama/Llama-3.1-8B-Instruct

4,556

11,217,564

M
Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B-Instruct

4,556

11,217,564

M
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

770

10,610,910

Q
Warm

qwen3-8b

Qwen/Qwen3-8B

577

4,006,273

Q
Warm

qwen25-1b5

Qwen/Qwen2.5-1.5B-Instruct

500

3,964,861

Q
Warm

gpt-oss-120b

openai/gpt-oss-120b

3,699

2,556,263

O
Warm

llama31-8b-16k

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

793

2,195,446

D
Warm

qwen25-7b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

706

1,790,972

D
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,437

1,713,675

D
Warm

tinyllama-1b1

TinyLlama/TinyLlama-1.1B-Chat-v1.0

1,385

1,654,240

T
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,297

1,605,191

M
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B

221

1,353,975

Q
Warm

qwen3-8b

Qwen/Qwen3-8B-Base

54

1,288,841

Q
Warm

llama31-8b-16k

meta-llama/Llama-3.1-8B

1,762

1,255,980

M
Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B

1,762

1,255,980

M
Warm

qwen3-1b7

Qwen/Qwen3-1.7B

242

1,095,454

Q
Warm

qwen3-14b

Qwen/Qwen3-14B

254

1,039,718

Q
Warm

qwen25-14b-lc

Qwen/Qwen2.5-14B-Instruct

268

1,031,431

Q
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B-Instruct

4,157

992,325

M
Warm

mistral-v02-7b-std-lc

HuggingFaceH4/zephyr-7b-beta

1,770

968,929

H