Overview

Access open-weight AI models through serverless APIs

Featherless AI is a serverless AI inference platform. Our goal is to make all AI models available for serverless inference and we’ve started with large language models (e.g. Qwen, Llama, Mistral, DeepSeek, RWKV). We provide inference via API to a continually expanding library of open-weight models, including the most popular models for role-playing, creative writing, coding assistance, and more.

Looking to chat with our models on Featherless.ai? Visit Phoenix!

Introducing QRWKV - Our latest linear-transformer model. Learn more in our blog post

Introduction

If you’re new to Featherless, start here to learn the essentials and make your first API call.

Developer with Featherless

Support

Get help and connect with the Featherless community through our support channels

Last edited: Jun 10, 2025