Models

Warm

qwen25-3b

Qwen/Qwen2.5-3B-Instruct

342

9,359,217

Q
Warm

qwen3-0b6

Qwen/Qwen3-0.6B

861

7,720,145

Q
Warm

qwen25-7b

Qwen/Qwen2.5-7B-Instruct

939

6,952,222

Q
Warm

qwen3-4b

Qwen/Qwen3-4B-Instruct-2507

544

5,948,247

Q
Warm

qwen25-1b5

Qwen/Qwen2.5-1.5B-Instruct

565

5,673,607

Q
Warm

qwen25-0b5

Qwen/Qwen2.5-Coder-0.5B-Instruct

54

5,600,442

Q
Warm

llama31-8b

meta-llama/Llama-3.1-8B-Instruct

5,112

5,424,321

M
Warm

llama31-8b

meta-llama/Meta-Llama-3.1-8B-Instruct

5,112

5,424,321

M
Warm

qwen3-8b

Qwen/Qwen3-8B

798

4,882,141

Q
Warm

qwen3-32b

Qwen/Qwen3-32B

597

4,259,661

Q
Warm

qwen3-1b7

Qwen/Qwen3-1.7B

349

4,223,240

Q
Warm

qwen3-4b

Qwen/Qwen3-4B

486

3,689,378

Q
Warm

mistral-v02-7b

mistralai/Mistral-7B-Instruct-v0.2

3,030

3,511,632

M
Warm

llama32-1b

meta-llama/Llama-3.2-1B-Instruct

1,194

3,473,341

M
Warm

llama32-1b

meta-llama/Llama-3.2-1B

2,211

3,196,797

M
Warm

qwen25-1b5

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

1,404

2,656,571

D
Warm

gemma3t-1b

google/gemma-3-1b-it

744

2,449,177

G
Warm

qwen25-32b

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,473

2,422,611

D
Warm

qwen25-0b5

Qwen/Qwen2.5-0.5B-Instruct

406

2,242,688

Q
Warm

llama3-8b

meta-llama/Meta-Llama-3-8B

6,400

2,086,262

M