Compatible Applications

These are some of the applications that support our API, if you would like to add your application to this list, please contact us.

WyvernChat

An AI Character Card Repository and Chat platform.

SillyTavern

LLM Frontend for Power Users.

novelcrafter

A platform that allows users to utilize AI to help them craft stories.

KoboldAI Lite

A lightweight web interface for chatting and prompting LLMs.

anime.gf

moemoemoemoemoemoe

Agnaistic

A lightweight web interface for chatting with LLMs.

Venus AI

Chatbot roleplay.

Janitor AI

Chatbot repository and chatting site.

RisuAI

Make your own story. User-friendly software for LLM roleplaying

Model Types

Our ever-growing list of supported model types, including LLaMA-3, Mistral, and more.

gpt-sw3-126m
0 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.2B
Max Output 2,048
gpt2-sw3-126m
2 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.2B
Max Output 2,048
qwen2-0b5
1979 models
Minimum Subscription Basic
Context Size 131,072
Parameters 0.5B
Max Output 4,096
qwen25-0b5
156 models
Minimum Subscription Basic
Context Size 32,768
Parameters 0.5B
Max Output 4,096
gpt-sw3-356m
5 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.5B
Max Output 2,048
qwen15-0b5
2 models
Minimum Subscription Basic
Context Size 32,768
Parameters 0.6B
Max Output 4,096
qwen3-0b6
31 models
Minimum Subscription Basic
Context Size 40,960
Parameters 0.8B
Max Output 4,096
gemma3-1b
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
llama32-1b
2323 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
gemma3t-1b
40 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
tinyllama-1b1
540 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.1B
Max Output 2,048
gpt-sw3-1b3
3 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.4B
Max Output 2,048
phi-1b4
4 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.4B
Max Output 2,048
qwen2-1b5
35 models
Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen25-1b5
101 models
Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen15-1b8
49 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1.8B
Max Output 4,096
qwen3-1b7
29 models
Minimum Subscription Basic
Context Size 40,960
Parameters 2B
Max Output 4,096
gemma-2b
14 models
Minimum Subscription Basic
Context Size 8,192
Parameters 2.5B
Max Output 4,096
gemma2-2b
819 models
Minimum Subscription Basic
Context Size 8,192
Parameters 2.6B
Max Output 4,096
phi2-3b
20 models
Minimum Subscription Basic
Context Size 2,048
Parameters 3B
Max Output 2,048
qwen25-3b
65 models
Minimum Subscription Basic
Context Size 32,768
Parameters 3.1B
Max Output 4,096
llama32-3b
221 models
Minimum Subscription Basic
Context Size 32,768
Parameters 3.2B
Max Output 4,096
phi4-3b8
8 models
Minimum Subscription Basic
Context Size 131,072
Parameters 3.8B
Max Output 4,096
qwen15-4b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4B
Max Output 4,096
qwen3-4b
204 models
Minimum Subscription Basic
Context Size 40,960
Parameters 4B
Max Output 4,096
phi3-4b
60 models
Minimum Subscription Basic
Context Size 4,096
Parameters 4B
Max Output 4,096
mellum-4b
8 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4B
Max Output 4,096
gemma3-4b
68 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4.3B
Max Output 4,096
RWKV
rwkv5-7b
7 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
Mistral v0.2 7B
mistral-v02-7b-std-lc
0 models
Minimum Subscription Basic
Context Size 8,192
Parameters 7B
Max Output 4,096
rwkv6-7b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
qwen25-7b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7B
Max Output 4,096
qwen2-7b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7B
Max Output 4,096
llama2-7b
2910 models
Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
mistral-v01-7b
264 models
Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
rwkv6-7b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
mistral-v02-7b
695 models
Minimum Subscription Basic
Context Size 8,192
Parameters 7B
Max Output 4,096
gpt-sw3-6b7
0 models
Minimum Subscription Basic
Context Size 2,048
Parameters 7.1B
Max Output 2,048
gpt2-sw3-6b7
3 models
Minimum Subscription Basic
Context Size 2,048
Parameters 7.1B
Max Output 2,048
qwen2-7b
243 models
Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen25-7b
418 models
Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen15-7b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7.7B
Max Output 4,096
Llama 3.1 8B
llama31-8b-16k
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
qwen3-8b
151 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
llama31-8b
1101 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
llama3-8b
1129 models
Minimum Subscription Basic
Context Size 8,192
Parameters 8B
Max Output 4,096
gemma-7b
3 models
Minimum Subscription Basic
Context Size 8,192
Parameters 8.5B
Max Output 4,096
gemma2-9b
88 models
Minimum Subscription Basic
Context Size 16,384
Parameters 9B
Max Output 4,096
glm4-9b
7 models
Minimum Subscription Basic
Context Size 32,768
Parameters 9B
Max Output 4,096
Llama 2 Solar 10B
llama2-solar-10b7-4k
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
llama2-10b7
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
llama2-solar-10b7
299 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
gemma3-12b
71 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12B
Max Output 4,096
mistral-nemo
326 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12B
Max Output 4,096
gemma3t-12b
3 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12.2B
Max Output 4,096
Llama 2 13B
llama2-13b-4k
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 13B
Max Output 4,096
llama2-13b
1401 models
Minimum Subscription Basic
Context Size 4,096
Parameters 13B
Max Output 4,096
rwkv6-14b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen25-14b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
qwen2-14b-lc
57 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
qwen3-14b
75 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
rwkv6-14b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen15-14b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14.2B
Max Output 4,096
qwen25-14b
127 models
Minimum Subscription Basic
Context Size 131,072
Parameters 14.8B
Max Output 4,096
Llama 3 15B
llama3-15b-8k
0 models
Minimum Subscription Basic
Context Size 8,192
Parameters 15B
Max Output 4,096
llama3-15b
16 models
Minimum Subscription Basic
Context Size 8,192
Parameters 15B
Max Output 4,096
gpt-sw3-20b
2 models
Minimum Subscription Basic
Context Size 2,048
Parameters 20.9B
Max Output 2,048
mistral-24b-2503
20 models
Minimum Subscription Basic
Context Size 32,768
Parameters 24B
Max Output 4,096
mistral-24b
160 models
Minimum Subscription Basic
Context Size 32,768
Parameters 24B
Max Output 4,096
gemma3-27b
49 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
gemma2-27b
24 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
gemma3t-27b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
qwen3-coder-30b
0 models
Minimum Subscription Premium
Context Size 1,000,000
Parameters 30B
Max Output 32,768
qwen3moe-30b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 30B
Max Output 32,768
Qwen 2 32B
qwen2-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qrwkv-32b-32k
5 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen15-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen25-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen3-32b
49 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
glm4-32b
15 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen2-32b
73 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen15-32b
14 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32.5B
Max Output 4,096
qwen25-32b
168 models
Minimum Subscription Basic
Context Size 131,072
Parameters 32.8B
Max Output 4,096
Yi 1.5 34B
yi1.5-34b-lc
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 34B
Max Output 4,096
rwkv6moe-37b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 37B
Max Output 4,096
rwkv6moe-37b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 37B
Max Output 4,096
Llama 3 70B
llama3-70b-8k
0 models
Minimum Subscription Premium
Context Size 8,192
Parameters 70B
Max Output 4,096
Llama 3.1 70B
llama31-70b-16k
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama31-70b
170 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama33-70b
149 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama3-70b
121 models
Minimum Subscription Premium
Context Size 8,192
Parameters 70B
Max Output 4,096
Qwen 2 72B
qwen2-72b-lc
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72B
Max Output 4,096
qwen25-72b-lc
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72B
Max Output 4,096
qrwkv-72b-32k
1 models
Minimum Subscription Basic
Context Size 65,536
Parameters 72B
Max Output 4,096
qwrkv-72b-32k
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 72B
Max Output 32,768
qrwkv-72b
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 72B
Max Output 4,096
qwen15-72b
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72.3B
Max Output 4,096
qwen2-72b
56 models
Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen25-72b
51 models
Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen15-110b
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 111.2B
Max Output 4,096
gpt-oss-120b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 120B
Max Output 4,096
mistral-large
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 123B
Max Output 32,768
Mixtral 8x22B
mixtral-8x22b-lc
0 models
Minimum Subscription Premium
Context Size 16,384
Parameters 141B
Max Output 4,096
mixtral-8x22b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 141B
Max Output 4,096
minimax-m2
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 229B
Max Output 32,768
glm-45
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 357B
Max Output 32,768
glm46-357b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 357B
Max Output 32,768
glm47-358b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 358B
Max Output 32,768
Llama 3 405B
llama3-405b-lc
0 models
Minimum Subscription Premium
Context Size 4,096
Parameters 405B
Max Output 4,096
qwen3-coder-480b
0 models
Minimum Subscription Premium
Context Size 1,000,000
Parameters 480B
Max Output 32,768
deepseek-v3-lc
2 models
Minimum Subscription Premium
Context Size 32,768
Parameters 685B
Max Output 32,768
kimi-k2
3 models
Minimum Subscription Premium
Context Size 32,768
Parameters 1,000B
Max Output 32,768
ling2-1t
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 1,000B
Max Output 4,096