Compatible Applications
These are some of the applications that support our API, if you would like to add your application to this list, please contact us.
Model Types
Our ever-growing list of supported model types, including LLaMA-3, Mistral, and more.
gpt-sw3-126m
0 models Minimum Subscription Basic
Context Size 2,048
Parameters 0.2B
Max Output 2,048
qwen2-0b5
1949 models Minimum Subscription Basic
Context Size 131,072
Parameters 0.5B
Max Output 4,096
qwen25-0b5
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 0.5B
Max Output 4,096
gpt-sw3-356m
0 models Minimum Subscription Basic
Context Size 2,048
Parameters 0.5B
Max Output 2,048
qwen15-0b5
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 0.6B
Max Output 4,096
qwen3-0b6
1 models Minimum Subscription Basic
Context Size 40,960
Parameters 0.8B
Max Output 4,096
gemma3-1b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
llama32-1b
2289 models Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
tinyllama-1b1
430 models Minimum Subscription Basic
Context Size 2,048
Parameters 1.1B
Max Output 2,048
gpt-sw3-1b3
0 models Minimum Subscription Basic
Context Size 2,048
Parameters 1.4B
Max Output 2,048
qwen2-1b5
0 models Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen25-1b5
78 models Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen15-1b8
49 models Minimum Subscription Basic
Context Size 32,768
Parameters 1.8B
Max Output 4,096
qwen3-1b7
0 models Minimum Subscription Basic
Context Size 40,960
Parameters 2B
Max Output 4,096
gemma-2b-8k
0 models Minimum Subscription Basic
Context Size 8,192
Parameters 2.5B
Max Output 4,096
gemma2-2b
782 models Minimum Subscription Basic
Context Size 8,192
Parameters 2.6B
Max Output 4,096
qwen25-3b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 3.1B
Max Output 4,096
llama32-3b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 3.2B
Max Output 4,096
phi3-mini
0 models Minimum Subscription Basic
Context Size 4,096
Parameters 3.8B
Max Output 4,096
phi35-mini
0 models Minimum Subscription Basic
Context Size 131,072
Parameters 3.8B
Max Output 4,096
qwen15-4b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 4B
Max Output 4,096
qwen3-4b
0 models Minimum Subscription Basic
Context Size 40,960
Parameters 4B
Max Output 4,096
gemma3mm-4b
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 4.3B
Max Output 4,096
RWKV
rwkv5-7b Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
Mistral v0.2 7B
mistral-v02-7b-std-lc Minimum Subscription Basic
Context Size 8,192
Parameters 7B
Max Output 4,096
rwkv6-7b-16k
1 models Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
qwen25-7b-lc
280 models Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
qoose-7b-16k
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
qwen2-7b-lc
98 models Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
llama2-7b
2 models Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
mistral-v01-7b
1 models Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
gpt-sw3-6b7
0 models Minimum Subscription Basic
Context Size 2,048
Parameters 7.1B
Max Output 2,048
qwen2-7b
0 models Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen25-7b
0 models Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen15-7b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 7.7B
Max Output 4,096
Llama 3 8B
llama3-8b-8k Minimum Subscription Basic
Context Size 8,192
Parameters 8B
Max Output 4,096
Llama 3.1 8B
llama31-8b-16k Minimum Subscription Basic
Context Size 16,384
Parameters 8B
Max Output 4,096
llama33-8b-16k
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 8B
Max Output 4,096
qwen3-8b
11 models Minimum Subscription Basic
Context Size 16,384
Parameters 8B
Max Output 4,096
gemma-7b-8k
0 models Minimum Subscription Basic
Context Size 8,192
Parameters 8.5B
Max Output 4,096
gemma2-9b
2 models Minimum Subscription Basic
Context Size 16,384
Parameters 9B
Max Output 4,096
glm4-9b
3 models Minimum Subscription Basic
Context Size 16,384
Parameters 9B
Max Output 4,096
Llama 2 Solar 10B
llama2-solar-10b7-4k Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
llama2-10b7
0 models Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
Mistral Nemo 12B
mistral-nemo-12b-lc Minimum Subscription Basic
Context Size 16,384
Parameters 12B
Max Output 4,096
gemma3-12b
6 models Minimum Subscription Basic
Context Size 16,384
Parameters 12B
Max Output 4,096
gemma3mm-12b
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 12.2B
Max Output 4,096
Llama 2 13B
llama2-13b-4k Minimum Subscription Basic
Context Size 4,096
Parameters 13B
Max Output 4,096
rwkv6-14b-16k
1 models Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen25-14b-lc
83 models Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen2-14b-lc
35 models Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen3-14b
4 models Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen15-14b
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 14.2B
Max Output 4,096
qwen25-14b
0 models Minimum Subscription Basic
Context Size 131,072
Parameters 14.8B
Max Output 4,096
Llama 3 15B
llama3-15b-8k Minimum Subscription Basic
Context Size 8,192
Parameters 15B
Max Output 4,096
gpt-sw3-20b
0 models Minimum Subscription Basic
Context Size 2,048
Parameters 20.9B
Max Output 2,048
mistral-24b-lc
69 models Minimum Subscription Basic
Context Size 16,384
Parameters 24B
Max Output 4,096
mistral-24b-2503
4 models Minimum Subscription Basic
Context Size 16,384
Parameters 24B
Max Output 4,096
gemma3-27b
9 models Minimum Subscription Basic
Context Size 16,384
Parameters 27B
Max Output 4,096
gemma2-27b
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 27B
Max Output 4,096
gemma3mm-27b
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 27.4B
Max Output 4,096
Qwen 2 32B
qwen2-32b-lc Minimum Subscription Basic
Context Size 16,384
Parameters 32B
Max Output 4,096
qrwkv-32b-32k
5 models Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen15-32b-lc
14 models Minimum Subscription Basic
Context Size 16,384
Parameters 32B
Max Output 4,096
qwen25-32b-lc
108 models Minimum Subscription Basic
Context Size 16,384
Parameters 32B
Max Output 4,096
qwen3-32b
8 models Minimum Subscription Premium
Context Size 16,384
Parameters 32B
Max Output 4,096
glm4-32b
11 models Minimum Subscription Premium
Context Size 16,384
Parameters 32B
Max Output 4,096
qwen15-32b
0 models Minimum Subscription Premium
Context Size 32,768
Parameters 32.5B
Max Output 4,096
qwen25-32b
0 models Minimum Subscription Premium
Context Size 131,072
Parameters 32.8B
Max Output 4,096
Yi 1.5 34B
yi1.5-34b-lc Minimum Subscription Basic
Context Size 16,384
Parameters 34B
Max Output 4,096
rwkv6moe-37b-16k
1 models Minimum Subscription Basic
Context Size 16,384
Parameters 37B
Max Output 4,096
Llama 3 70B
llama3-70b-8k Minimum Subscription Premium
Context Size 8,192
Parameters 70B
Max Output 4,096
Llama 3.1 70B
llama31-70b-16k Minimum Subscription Premium
Context Size 16,384
Parameters 70B
Max Output 4,096
llama33-70b-16k
106 models Minimum Subscription Premium
Context Size 16,384
Parameters 70B
Max Output 4,096
Qwen 2 72B
qwen2-72b-lc Minimum Subscription Premium
Context Size 16,384
Parameters 72B
Max Output 4,096
qwen25-72b-lc
36 models Minimum Subscription Premium
Context Size 16,384
Parameters 72B
Max Output 4,096
qwerky7-72b-16k
0 models Minimum Subscription Basic
Context Size 16,384
Parameters 72B
Max Output 4,096
qrwkv-72b-32k
1 models Minimum Subscription Basic
Context Size 32,768
Parameters 72B
Max Output 4,096
qwrkv-72b-32k
0 models Minimum Subscription Basic
Context Size 32,768
Parameters 72B
Max Output 32,768
qwen15-72b
0 models Minimum Subscription Premium
Context Size 32,768
Parameters 72.3B
Max Output 4,096
qwen2-72b
0 models Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen25-72b
0 models Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen15-110b
0 models Minimum Subscription Premium
Context Size 32,768
Parameters 111.2B
Max Output 4,096
Mixtral 8x22B
mixtral-8x22b-lc Minimum Subscription Premium
Context Size 16,384
Parameters 141B
Max Output 4,096
Llama 3 405B
llama3-405b-lc Minimum Subscription Premium
Context Size 4,096
Parameters 405B
Max Output 4,096
deepseek-v3-lc
2 models Minimum Subscription Premium
Context Size 32,768
Parameters 685B
Max Output 4,096