Models

llama31-8b-16k
meta-llama/Meta-Llama-3.1-8B-Instruct
Warm
3,216
5,513,850
llama3-8b-8k
meta-llama/Meta-Llama-3-8B-Instruct
Warm
3,664
2,609,281
llama31-70b-16k
meta-llama/Meta-Llama-3.1-70B-Instruct
Warm
733
1,542,149
llama31-8b-16k
meta-llama/Meta-Llama-3.1-8B
Warm
1,167
933,311
llama3-8b-8k
meta-llama/Meta-Llama-3-8B
Warm
5,885
688,721
mistral-v02-7b-std-lc
mistralai/Mistral-7B-Instruct-v0.2
Loading
2,586
562,157
llama31-8b-16k
meta-llama/Llama-Guard-3-8B
Warm
135
405,432
mistral-v02-7b-std-lc
HuggingFaceH4/zephyr-7b-beta
Warm
1,617
380,934
qwen2-72b-lc
Qwen/Qwen2.5-72B-Instruct
Warm
569
277,963
qwen2-32b-lc
Qwen/Qwen2.5-Coder-32B-Instruct
Warm
1,240
239,657
llama2-13b-4k
meta-llama/Llama-2-13b-chat-hf
Warm
1,033
229,755
llama3-8b-8k
NousResearch/Meta-Llama-3-8B-Instruct
Warm
87
229,632
mistral-nemo-12b-lc
mistralai/Mistral-Nemo-Instruct-2407
Warm
1,273
195,895
llama31-8b-16k
VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct
Warm
32
174,190
llama31-70b-16k
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Warm
1,796
168,793
mistral-nemo-12b-lc
VAGOsolutions/SauerkrautLM-Nemo-12b-Instruct
Warm
22
108,380
llama3-8b-8k
MLP-KTLim/llama-3-Korean-Bllossom-8B
Warm
281
101,515
llama31-70b-16k
meta-llama/Meta-Llama-3.1-70B
Warm
319
97,287
llama3-70b-8k
meta-llama/Meta-Llama-3-70B-Instruct
Warm
1,439
94,203
llama2-solar-10b7-4k
upstage/SOLAR-10.7B-Instruct-v1.0
Warm
617
91,830