Models

Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

907

7,812,528

Q
Warm

qwen3-0b6

Qwen/Qwen3-0.6B

819

6,836,664

Q
Warm

qwen25-3b

Qwen/Qwen2.5-3B-Instruct

334

6,100,885

Q
Warm

qwen3-4b

Qwen/Qwen3-4B-Instruct-2507

503

5,521,984

Q
Warm

llama31-8b

meta-llama/Llama-3.1-8B-Instruct

5,017

5,221,966

M
Warm

llama31-8b

meta-llama/Meta-Llama-3.1-8B-Instruct

5,017

5,221,966

M
Warm

qwen3-8b

Qwen/Qwen3-8B

774

4,693,582

Q
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

3,018

3,750,478

M
Warm

llama32-1b

meta-llama/Llama-3.2-1B-Instruct

1,179

3,686,452

M
Warm

qwen25-0b5

Gensyn/Qwen2.5-0.5B-Instruct

28

3,572,773

G
Warm

llama32-1b

meta-llama/Llama-3.2-1B

2,198

3,275,581

M
Warm

qwen25-0b5

Qwen/Qwen2.5-Coder-0.5B-Instruct

52

3,142,890

Q
Warm

qwen3-4b

Qwen/Qwen3-4B

470

2,466,414

Q
Warm

llama32-3b

context-labs/meta-llama-Llama-3.2-3B-Instruct-FP16

7

2,148,218

C
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,390

2,111,790

M
Warm

qwen3-32b

Qwen/Qwen3-32B

583

2,065,625

Q
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,470

1,949,773

D
Warm

tinyllama-1b1

TinyLlama/TinyLlama-1.1B-Chat-v1.0

1,461

1,879,960

T
Warm

llama32-3b

meta-llama/Llama-3.2-3B-Instruct

1,837

1,683,545

M
Warm

qwen25-0b5

Qwen/Qwen2.5-0.5B-Instruct

395

1,619,363

Q