Compatible Applications

These are some of the applications that support our API, if you would like to add your application to this list, please contact us.

WyvernChat

An AI Character Card Repository and Chat platform.

SillyTavern

LLM Frontend for Power Users.

novelcrafter

A platform that allows users to utilize AI to help them craft stories.

KoboldAI Lite

A lightweight web interface for chatting and prompting LLMs.

anime.gf

moemoemoemoemoemoe

Agnaistic

A lightweight web interface for chatting with LLMs.

Venus AI

Chatbot roleplay.

Janitor AI

Chatbot repository and chatting site.

RisuAI

Make your own story. User-friendly software for LLM roleplaying

Model Types

Our ever-growing list of supported model types, including LLaMA-3, Mistral, and more.

gpt-sw3-126m
0 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.2B
Max Output 2,048
gpt2-sw3-126m
2 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.2B
Max Output 2,048
qwen2-0b5
1965 models
Minimum Subscription Basic
Context Size 131,072
Parameters 0.5B
Max Output 4,096
qwen25-0b5
156 models
Minimum Subscription Basic
Context Size 32,768
Parameters 0.5B
Max Output 4,096
gpt-sw3-356m
3 models
Minimum Subscription Basic
Context Size 2,048
Parameters 0.5B
Max Output 2,048
qwen15-0b5
2 models
Minimum Subscription Basic
Context Size 32,768
Parameters 0.6B
Max Output 4,096
qwen3-0b6
19 models
Minimum Subscription Basic
Context Size 40,960
Parameters 0.8B
Max Output 4,096
gemma3-1b
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
llama32-1b
2318 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
gemma3t-1b
38 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1B
Max Output 4,096
tinyllama-1b1
495 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.1B
Max Output 2,048
gpt-sw3-1b3
2 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.4B
Max Output 2,048
phi-1b4
4 models
Minimum Subscription Basic
Context Size 2,048
Parameters 1.4B
Max Output 2,048
qwen2-1b5
19 models
Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen25-1b5
101 models
Minimum Subscription Basic
Context Size 131,072
Parameters 1.5B
Max Output 4,096
qwen15-1b8
49 models
Minimum Subscription Basic
Context Size 32,768
Parameters 1.8B
Max Output 4,096
qwen3-1b7
21 models
Minimum Subscription Basic
Context Size 40,960
Parameters 2B
Max Output 4,096
gemma-2b
14 models
Minimum Subscription Basic
Context Size 8,192
Parameters 2.5B
Max Output 4,096
gemma2-2b
819 models
Minimum Subscription Basic
Context Size 8,192
Parameters 2.6B
Max Output 4,096
phi2-3b
20 models
Minimum Subscription Basic
Context Size 2,048
Parameters 3B
Max Output 2,048
qwen25-3b
53 models
Minimum Subscription Basic
Context Size 32,768
Parameters 3.1B
Max Output 4,096
llama32-3b
203 models
Minimum Subscription Basic
Context Size 32,768
Parameters 3.2B
Max Output 4,096
phi4-3b8
7 models
Minimum Subscription Basic
Context Size 131,072
Parameters 3.8B
Max Output 4,096
qwen15-4b
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4B
Max Output 4,096
qwen3-4b
144 models
Minimum Subscription Basic
Context Size 40,960
Parameters 4B
Max Output 4,096
phi3-4b
58 models
Minimum Subscription Basic
Context Size 4,096
Parameters 4B
Max Output 4,096
mellum-4b
7 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4B
Max Output 4,096
gemma3-4b
60 models
Minimum Subscription Basic
Context Size 32,768
Parameters 4.3B
Max Output 4,096
RWKV
rwkv5-7b
7 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
Mistral v0.2 7B
mistral-v02-7b-std-lc
0 models
Minimum Subscription Basic
Context Size 8,192
Parameters 7B
Max Output 4,096
rwkv6-7b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
qwen25-7b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7B
Max Output 4,096
qwen2-7b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7B
Max Output 4,096
llama2-7b
23 models
Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
mistral-v01-7b
23 models
Minimum Subscription Basic
Context Size 4,096
Parameters 7B
Max Output 4,096
rwkv6-7b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 7B
Max Output 4,096
mistral-v02-7b
695 models
Minimum Subscription Basic
Context Size 8,192
Parameters 7B
Max Output 4,096
gpt-sw3-6b7
0 models
Minimum Subscription Basic
Context Size 2,048
Parameters 7.1B
Max Output 2,048
gpt2-sw3-6b7
3 models
Minimum Subscription Basic
Context Size 2,048
Parameters 7.1B
Max Output 2,048
qwen2-7b
200 models
Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen25-7b
418 models
Minimum Subscription Basic
Context Size 131,072
Parameters 7.6B
Max Output 4,096
qwen15-7b
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 7.7B
Max Output 4,096
Llama 3.1 8B
llama31-8b-16k
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
qwen3-8b
123 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
llama31-8b
1068 models
Minimum Subscription Basic
Context Size 32,768
Parameters 8B
Max Output 4,096
llama3-8b
1078 models
Minimum Subscription Basic
Context Size 8,192
Parameters 8B
Max Output 4,096
gemma-7b
1 models
Minimum Subscription Basic
Context Size 8,192
Parameters 8.5B
Max Output 4,096
gemma2-9b
83 models
Minimum Subscription Basic
Context Size 16,384
Parameters 9B
Max Output 4,096
glm4-9b
6 models
Minimum Subscription Basic
Context Size 32,768
Parameters 9B
Max Output 4,096
Llama 2 Solar 10B
llama2-solar-10b7-4k
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
llama2-10b7
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
llama2-solar-10b7
298 models
Minimum Subscription Basic
Context Size 4,096
Parameters 10.7B
Max Output 4,096
gemma3-12b
64 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12B
Max Output 4,096
mistral-nemo
288 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12B
Max Output 4,096
gemma3t-12b
3 models
Minimum Subscription Basic
Context Size 32,768
Parameters 12.2B
Max Output 4,096
Llama 2 13B
llama2-13b-4k
0 models
Minimum Subscription Basic
Context Size 4,096
Parameters 13B
Max Output 4,096
llama2-13b
257 models
Minimum Subscription Basic
Context Size 4,096
Parameters 13B
Max Output 4,096
rwkv6-14b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen25-14b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
qwen2-14b-lc
57 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
qwen3-14b
61 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14B
Max Output 4,096
rwkv6-14b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 14B
Max Output 4,096
qwen15-14b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 14.2B
Max Output 4,096
qwen25-14b
113 models
Minimum Subscription Basic
Context Size 131,072
Parameters 14.8B
Max Output 4,096
Llama 3 15B
llama3-15b-8k
0 models
Minimum Subscription Basic
Context Size 8,192
Parameters 15B
Max Output 4,096
llama3-15b
16 models
Minimum Subscription Basic
Context Size 8,192
Parameters 15B
Max Output 4,096
gpt-sw3-20b
0 models
Minimum Subscription Basic
Context Size 2,048
Parameters 20.9B
Max Output 2,048
mistral-24b-2503
20 models
Minimum Subscription Basic
Context Size 32,768
Parameters 24B
Max Output 4,096
mistral-24b
136 models
Minimum Subscription Basic
Context Size 32,768
Parameters 24B
Max Output 4,096
gemma3-27b
42 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
gemma2-27b
24 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
gemma3t-27b
1 models
Minimum Subscription Basic
Context Size 32,768
Parameters 27B
Max Output 4,096
qwen3-coder-30b
0 models
Minimum Subscription Premium
Context Size 1,000,000
Parameters 30B
Max Output 32,768
qwen3moe-30b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 30B
Max Output 32,768
Qwen 2 32B
qwen2-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qrwkv-32b-32k
5 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen15-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen25-32b-lc
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen3-32b
41 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
glm4-32b
15 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen2-32b
73 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32B
Max Output 4,096
qwen15-32b
14 models
Minimum Subscription Basic
Context Size 32,768
Parameters 32.5B
Max Output 4,096
qwen25-32b
159 models
Minimum Subscription Basic
Context Size 131,072
Parameters 32.8B
Max Output 4,096
Yi 1.5 34B
yi1.5-34b-lc
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 34B
Max Output 4,096
rwkv6moe-37b-16k
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 37B
Max Output 4,096
rwkv6moe-37b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 37B
Max Output 4,096
Llama 3 70B
llama3-70b-8k
0 models
Minimum Subscription Premium
Context Size 8,192
Parameters 70B
Max Output 4,096
Llama 3.1 70B
llama31-70b-16k
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama33-70b-16k
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama31-70b
155 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama33-70b
148 models
Minimum Subscription Premium
Context Size 32,768
Parameters 70B
Max Output 4,096
llama3-70b
111 models
Minimum Subscription Premium
Context Size 8,192
Parameters 70B
Max Output 4,096
Qwen 2 72B
qwen2-72b-lc
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72B
Max Output 4,096
qwen25-72b-lc
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72B
Max Output 4,096
qrwkv-72b-32k
1 models
Minimum Subscription Basic
Context Size 65,536
Parameters 72B
Max Output 4,096
qwrkv-72b-32k
0 models
Minimum Subscription Basic
Context Size 32,768
Parameters 72B
Max Output 32,768
qrwkv-72b
0 models
Minimum Subscription Basic
Context Size 16,384
Parameters 72B
Max Output 4,096
qwen15-72b
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 72.3B
Max Output 4,096
qwen2-72b
53 models
Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen25-72b
51 models
Minimum Subscription Premium
Context Size 131,072
Parameters 72.7B
Max Output 4,096
qwen15-110b
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 111.2B
Max Output 4,096
gpt-oss-120b
1 models
Minimum Subscription Basic
Context Size 16,384
Parameters 120B
Max Output 4,096
Mixtral 8x22B
mixtral-8x22b-lc
0 models
Minimum Subscription Premium
Context Size 16,384
Parameters 141B
Max Output 4,096
mixtral-8x22b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 141B
Max Output 4,096
glm-45
0 models
Minimum Subscription Premium
Context Size 32,768
Parameters 357B
Max Output 32,768
glm46-357b
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 357B
Max Output 32,768
Llama 3 405B
llama3-405b-lc
0 models
Minimum Subscription Premium
Context Size 4,096
Parameters 405B
Max Output 4,096
qwen3-coder-480b
0 models
Minimum Subscription Premium
Context Size 1,000,000
Parameters 480B
Max Output 32,768
deepseek-v3-lc
2 models
Minimum Subscription Premium
Context Size 32,768
Parameters 685B
Max Output 4,096
kimi-k2
3 models
Minimum Subscription Premium
Context Size 32,768
Parameters 1,000B
Max Output 32,768
ling2-1t
1 models
Minimum Subscription Premium
Context Size 32,768
Parameters 1,000B
Max Output 4,096