Models

Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

849

8,169,288

Q
Warm

qwen3-0b6

Qwen/Qwen3-0.6B

752

7,327,000

Q
Warm

qwen25-0b5

Gensyn/Qwen2.5-0.5B-Instruct

26

6,421,569

G
Warm

llama31-8b

meta-llama/Llama-3.1-8B-Instruct

4,853

5,243,778

M
Warm

llama31-8b

meta-llama/Meta-Llama-3.1-8B-Instruct

4,853

5,243,778

M
Warm

tinyllama-1b1

TinyLlama/TinyLlama-1.1B-Chat-v1.0

1,438

4,184,525

T
Warm

qwen3-4b

Qwen/Qwen3-4B-Instruct-2507

436

3,942,352

Q
Warm

llama32-1b

meta-llama/Llama-3.2-1B-Instruct

1,141

3,843,095

M
Warm

qwen25-3b

Qwen/Qwen2.5-3B-Instruct

325

3,698,029

Q
Warm

llama32-3b

context-labs/meta-llama-Llama-3.2-3B-Instruct-FP16

7

3,178,981

C
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

2,997

2,963,552

M
Warm

qwen3-8b

Qwen/Qwen3-8B

708

2,548,149

Q
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,464

2,353,666

D
Warm

llama32-3b

inference-net/Schematron-3B

98

2,248,942

I
Warm

llama32-3b

meta-llama/Llama-3.2-3B-Instruct

1,789

1,924,026

M
Warm

llama32-1b

meta-llama/Llama-3.2-1B

2,143

1,788,591

M
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,357

1,760,997

M
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B

238

1,681,974

Q
Warm

qwen25-0b5

Qwen/Qwen2.5-0.5B-Instruct

385

1,623,204

Q
Warm

qwen3-32b

Qwen/Qwen3-32B

561

1,610,353

Q