Models

Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B-Instruct

3,980

4,851,980

M
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct

677

2,436,781

Q
Warm

qwen25-7b-lc

Qwen/Qwen2.5-7B-Instruct-1M

323

2,294,318

Q
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B

6,173

2,043,976

M
Warm

mistral-v02-7b-std-lc

mistralai/Mistral-7B-Instruct-v0.2

2,766

1,501,466

M
Warm

llama3-8b-8k

meta-llama/Meta-Llama-3-8B-Instruct

3,965

1,182,113

M
Warm

llama31-8b-16k

meta-llama/Meta-Llama-3.1-8B

1,618

1,138,449

M
Warm

gemma3-27b

google/gemma-3-27b-it

1,099

964,080

G
Warm

llama31-70b-16k

meta-llama/Meta-Llama-3.1-70B-Instruct

810

945,390

M
Warm

llama31-8b-16k

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

726

917,872

D
Warm

qwen25-32b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

1,371

836,053

D
Warm

llama33-70b-16k

meta-llama/Llama-3.3-70B-Instruct

2,326

735,286

M
Warm

qwen25-7b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

646

676,785

D
Warm

deepseek-v3-lc

deepseek-ai/DeepSeek-R1

12,196

666,568

D
Warm

qwen25-14b-lc

Qwen/Qwen2.5-14B-Instruct

234

549,254

Q
Warm

mistral-nemo-12b-lc

Nitral-AI/Captain-Eris_Violet-V0.420-12B

40

532,663

N
Warm

qwen25-32b-lc

Qwen/Qwen2.5-32B-Instruct

273

524,677

Q
Warm

qwen3-8b

Qwen/Qwen3-8B

301

520,066

Q
Warm

qwen25-14b-lc

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

511

512,164

D
Warm

mistral-v02-7b-std-lc

HuggingFaceH4/zephyr-7b-beta

1,711

454,811

H