Models

llama31-8b-16k
meta-llama/Meta-Llama-3.1-8B-Instruct
Warm
3,597
5,801,205
mistral-v02-7b-std-lc
mistralai/Mistral-7B-Instruct-v0.2
Loading
2,647
3,162,885
deepseek-v3-lc
deepseek-ai/DeepSeek-R1
Warm
8,108
2,670,920
llama3-8b-8k
meta-llama/Meta-Llama-3-8B-Instruct
Warm
3,804
2,167,006
llama31-8b-16k
meta-llama/Meta-Llama-3.1-8B
Warm
1,389
1,129,360
llama3-8b-8k
meta-llama/Meta-Llama-3-8B
Warm
6,009
643,852
llama33-70b-16k
meta-llama/Llama-3.3-70B-Instruct
Warm
1,897
623,681
llama31-70b-16k
mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-bf16
Loading
1
570,421
qwen25-32b-lc
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Warm
971
534,695
mistral-nemo-12b-lc
Epiculous/Violet_Twilight-v0.2
Warm
27
499,477
qwen25-7b-lc
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Warm
388
472,188
llama31-8b-16k
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Warm
483
446,494
llama31-70b-16k
mlx-community/Meta-Llama-3.1-70B-Instruct-bf16-CORRECTED
Warm
0
404,126
llama33-70b-16k
unsloth/Llama-3.3-70B-Instruct
Warm
37
403,123
llama33-70b-16k
Sao10K/70B-L3.3-Cirrus-x1
Warm
22
380,278
llama31-70b-16k
meta-llama/Meta-Llama-3.1-70B-Instruct
Warm
784
374,829
qwen25-32b-lc
Qwen/Qwen2.5-32B-Instruct
Warm
191
372,576
llama31-8b-16k
meta-llama/Llama-Guard-3-8B
Warm
159
365,207
mistral-nemo-12b-lc
IlyaGusev/saiga_nemo_12b
Warm
37
358,016
mistral-v02-7b-std-lc
HuggingFaceH4/zephyr-7b-beta
Warm
1,651
284,435