Models

Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

855

8,577,218

Q
Warm

qwen3-0b6

Qwen/Qwen3-0.6B

761

7,171,929

Q
Warm

qwen25-0b5

Gensyn/Qwen2.5-0.5B-Instruct

26

6,497,777

G
Warm

llama31-8b

meta-llama/Llama-3.1-8B-Instruct

4,881

5,197,754

M
Warm

llama31-8b

meta-llama/Meta-Llama-3.1-8B-Instruct

4,881

5,197,754

M
Warm

qwen3-4b

Qwen/Qwen3-4B-Instruct-2507

446

4,072,069

Q
Warm

llama32-1b

meta-llama/Llama-3.2-1B-Instruct

1,147

3,845,138

M
Warm

tinyllama-1b1

TinyLlama/TinyLlama-1.1B-Chat-v1.0

1,443

3,840,280

T
Warm

qwen25-3b

Qwen/Qwen2.5-3B-Instruct

325

3,667,220

Q
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

3,007

3,406,417

M
Warm

qwen3-8b

Qwen/Qwen3-8B

717

3,220,055

Q
Warm

llama32-3b

meta-llama/Llama-3.2-3B-Instruct

1,793

1,912,700

M
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,465

1,848,919

D
Warm

llama32-1b

meta-llama/Llama-3.2-1B

2,154

1,832,875

M
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,363

1,812,695

M
Warm

qwen25-0b5

Qwen/Qwen2.5-0.5B-Instruct

387

1,606,700

Q
Warm

qwen3-32b

Qwen/Qwen3-32B

564

1,533,211

Q
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B

241

1,521,696

Q
Warm

phi3-4b

microsoft/Phi-3-mini-4k-instruct

1,323

1,489,310

M
Warm

gemma3-12b

google/gemma-3-12b-it

559

1,456,803

G