Models

Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

887

8,377,447

Q
Warm

qwen3-0b6

Qwen/Qwen3-0.6B

798

7,174,437

Q
Warm

qwen25-3b

Qwen/Qwen2.5-3B-Instruct

331

6,014,776

Q
Warm

qwen3-4b

Qwen/Qwen3-4B-Instruct-2507

490

5,458,206

Q
Warm

qwen25-0b5

Gensyn/Qwen2.5-0.5B-Instruct

27

5,408,496

G
Warm

llama31-8b

meta-llama/Meta-Llama-3.1-8B-Instruct

4,973

5,155,971

M
Warm

llama31-8b

meta-llama/Llama-3.1-8B-Instruct

4,973

5,155,971

M
Warm

qwen3-8b

Qwen/Qwen3-8B

762

4,592,480

Q
Warm

llama32-1b

meta-llama/Llama-3.2-1B-Instruct

1,168

3,745,232

M
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

3,015

3,093,889

M
Warm

tinyllama-1b1

TinyLlama/TinyLlama-1.1B-Chat-v1.0

1,452

2,956,172

T
Warm

llama32-3b

context-labs/meta-llama-Llama-3.2-3B-Instruct-FP16

7

2,692,946

C
Warm

qwen3-4b

Qwen/Qwen3-4B

460

2,474,714

Q
Warm

llama32-1b

meta-llama/Llama-3.2-1B

2,180

2,151,171

M
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,381

2,120,410

M
Warm

llama32-3b

meta-llama/Llama-3.2-3B-Instruct

1,823

1,849,516

M
Warm

qwen25-0b5

Qwen/Qwen2.5-Coder-0.5B-Instruct

52

1,789,869

Q
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,469

1,762,915

D
Warm

qwen25-0b5

Qwen/Qwen2.5-0.5B-Instruct

392

1,547,012

Q
Warm

gemma3-12b

google/gemma-3-12b-it

572

1,496,447

G