Docs /Getting Started/Request Pricing and Credits

Request Pricing and Credits

How prepaid credits work for request-based API billing.

Request pricing lets your organization pay for API usage with prepaid credits. You choose a monthly credit amount, can add one-time credit top-ups when needed, and successful API requests reduce your balance based on model pricing and token usage.

How Request Pricing Works

  • Choose a monthly credit amount.

  • That amount is added to your Featherless credit balance each billing cycle after payment succeeds.

  • You can add a one-time credit top-up from the billing page without changing your monthly credit amount.

  • Successful API requests reduce your balance.

  • If your balance reaches zero or below, request-pricing API calls are blocked until more credits are available.

  • Credits do not expire.

How Request Costs Are Calculated

  • Only successful requests are charged.

  • Request cost is based on the model used, input tokens, and output tokens.

  • Formula: input tokens x input price + output tokens x output price.

  • Prices are listed per 1M tokens.

Model Prices

Request pricing is charged separately for input and output tokens. Prices are shown in USD per 1M tokens. Most models use the default model-class price. If any models have their own pricing, they are listed separately below.

Family Model Class Input / 1M tokens Output / 1M tokens
Apertusapertus-70b$0.875$2.075
Apertusapertus-8b$0.1078$0.28
Arcee MOEafmoe-399b$0.25$0.8
Bambabamba-9b$0.1078$0.28
Bertbge-large-en$0.01$0.02
Deepseek 3deepseek-v3-lc$0.4$1.59
Deepseek 3.1deepseek31-685b$0.285$1
Deepseek 3.2deepseek-v3.2$0.2995$0.45
Deepseek 4deepseek4-1.6t$1.6$3.2
Deepseek 4deepseek4-284b$0.1385$0.279
Ernie4 5ernie4_5-0b3$0.01$0.02
Ernie4 5 Moeernie4_5_moe-21b$0.08$0.4
Exaone4exaone4-32b$0.265$0.65
Falconfalcon-7b2$0.1$0.2
Falconfalcon-7b7$0.1$0.2
Gemmagemma-2b$0.08$0.4
GemmaGemma-2b$0.08$0.4
Gemmagemma-7b$0.1078$0.28
Gemma 2gemma2-27b$0.65$0.65
Gemma 2gemma2-2b$0.08$0.4
Gemma 3gemma3-0b2$0.01$0.02
Gemma 3gemma3-12b$0.05$0.15
Gemma 3gemma3-270m$0.01$0.02
Gemma 3gemma3-27b$0.1$0.3
Gemma 3gemma3-4b$0.05$0.1
Gemma 3gemma3-8b3$0.1078$0.28
Gemma 3gemma3t-12b$0.1078$0.28
Gemma 3gemma3t-1b$0.01$0.02
Gemma 3gemma3t-27b$0.265$0.65
Gemma 4gemma4-25b$0.1$0.2
Gemma 4gemma4-26b$0.13$0.4
Gemma 4gemma4-31b$0.14$0.4
Gemma 4gemma4-5b$0.1$0.2
Gemma 4gemma4-7b$0.1$0.2
Gemma 4gemma4-e2b$0.08$0.4
Gemma 4gemma4-e4b$0.1$0.2
GLM 4glm4-32b$0.265$0.65
GLM 4glm4-9b$0.1078$0.28
GLM 4.6glm46-357b$0.55$2.2
GLM 4.7glm47-357b$0.55$2.2
GLM 4.7glm47-flash$0.0653$0.4
GLM 5glm5-754b$0.95$3.15
GLM 5.1glm51-754b$1.3$4.3
GLM 5.2glm52-753b$1.39$4.4
Gpt Bigcodegptbigcode-1b$0.01$0.02
GPT OSSgpt-oss-120b$0.1$0.55
GPT OSSgpt-oss-20b$0.04$0.15
GPT OSSgptoss-21b$0.075$0.3
GPT-SW3gpt-sw3-126m$0.01$0.02
GPT-SW3gpt-sw3-1b3$0.01$0.02
GPT-SW3gpt-sw3-20b$0.265$0.65
GPT-SW3gpt-sw3-356m$0.01$0.02
GPT-SW3gpt-sw3-6b7$0.1$0.2
Gpt2gpt-sw3-40b$0.265$0.65
GPT2-SW3gpt2-sw3-126m$0.01$0.02
GPT2-SW3gpt2-sw3-6b7$0.1$0.2
Granitegranite-2b$0.08$0.4
Granitegranite-8b$0.1078$0.28
Granitemoegranitemoe-1b$0.01$0.02
Granitemoegranitemoe-3b$0.08$0.4
Hyperclovax Vlmhcxvision-3b$0.08$0.4
Internlm3internlm3-8b$0.1078$0.28
Kimi 2kimi-k2$0.6$2.5
Kimi 2.5kimi-k25$0.77$3.5
Kimi Linearkimi-linear-48b$0.08$0.4
Kimi Linearkimilinear-49b$0.08$0.4
LFM 2lfm2-0b3$0.01$0.02
LFM 2lfm2-0b7$0.01$0.02
LFM 2lfm2-1b$0.01$0.02
LFM 2lfm2-1b2$0.01$0.02
Lfm2 Moelfm2-moe-24b$0.265$0.65
Llama 2llama2-13b$0.4375$0.55
Llama 2llama2-34b$0.265$0.65
Llama 2llama2-70b$0.875$2.075
Llama 2llama2-7b$0.1$0.2
Llama 2llama2-solar-10b7$0.1078$0.28
Llama 2tinyllama-1b1$0.01$0.02
Llama 3llama-34b$0.265$0.65
Llama 3llama3-15b$0.1078$0.28
Llama 3llama3-70b$0.875$2.075
Llama 3llama3-8b$0.0925$0.095
Llama 3.1llama31-70b$0.72$0.72
Llama 3.1llama31-8b$0.035$0.065
Llama 3.2llama32-1b$0.027$0.201
Llama 3.2llama32-3b$0.0509$0.335
Llama 3.3llama33-70b$0.65$0.75
Mambamamba-0b7$0.01$0.02
Mellummellum-4b$0.1$0.2
Mimomimo-7b$0.1$0.2
Mimo 2mimo2-flash$0.1078$0.28
Mimo 2.5mimo25-311b$0.14$0.28
MiniMax 2minimax-m2$0.2775$1.11
MiniMax 2.1minimax-m21$0.3$1.2
MiniMax 2.5minimax-m25$0.295$1.2
Minimax M2minimax-m2-228b7$0.3$1.2
MiniMax M3minimax-m3$0.55$2.2
Mistralmistral-large$0.125$1.15
Mistralmistral-nemo$0.25$0.4
Mistralmistral-v01-7b$0.1$0.2
Mistralmistral-v02-7b$0.1$0.2
Mistralmixtral-8x22b$0.125$1.15
Mistral 3mistral-24b$0.05$0.08
Mistral 3mistral3-3b$0.08$0.4
Mistral 3.1mistral-24b-2503$0.2214$0.415
Nanbeigenanbeige41-3b$0.1$0.2
Nemotron 3nemotron3-120b$0.125$1.15
Nemotron Hnemotronh-31b$0.05$0.2
Nemotron-nasdecilm-49b$0.4$0.4
Nomic Bertnomic-v15$0.01$0.02
Olmoolmo-1b$0.01$0.02
OLMo 3olmo3-32b$0.265$0.65
OLMo 3olmo3-7b$0.1$0.2
Ouroouro-1b$0.01$0.02
Ouroouro-2b$0.08$0.4
Panguembeddedpanguembedded-8b$0.1$0.2
Phiphi-1b4$0.01$0.02
Phi 2phi2-3b$0.08$0.4
Phi 3phi3-4b$0.1$0.2
Phi 3phi3v-4b$0.1$0.2
Phi 4phi4-14b$0.07$0.14
Phi 4phi4-3b8$0.08$0.35
Phimoephimoe-42b$0.875$2.075
QRWKVqrwkv-32b-32k$0.265$0.65
QRWKVqrwkv-72b-32k$0.875$2.075
Qwenqwenlmheadmodel-7b$0.1$0.2
Qwen 1.5qwen15-0b5$0.01$0.02
Qwen 1.5qwen15-14b$0.1078$0.28
Qwen 1.5qwen15-1b8$0.01$0.02
Qwen 1.5qwen15-32b$0.265$0.65
Qwen 1.5qwen15-4b$0.1$0.2
Qwen 1.5qwen15-72b$0.875$2.075
Qwen 1.5qwen15-7b$0.1$0.2
Qwen 2qwen2-0b5$0.01$0.02
Qwen 2qwen2-14b-lc$0.1078$0.28
Qwen 2qwen2-1b5$0.01$0.02
Qwen 2qwen2-32b$0.265$0.65
Qwen 2qwen2-72b$3$5
Qwen 2qwen2-7b$0.1$0.2
Qwen 2.5qwen25-0b5$0.01$0.02
Qwen 2.5qwen25-14b$0.1078$0.28
Qwen 2.5qwen25-1b5$0.01$0.02
Qwen 2.5qwen25-32b$0.68$1.2
Qwen 2.5qwen25-3b$0.08$0.4
Qwen 2.5qwen25-72b$0.37$0.4
Qwen 2.5qwen25-7b$0.17$0.2
Qwen 2.5qwen25vl-32b$0.265$0.65
Qwen 2.5qwen25vl-3b$0.08$0.4
Qwen 2.5qwen25vl-72b$0.8$1
Qwen 2.5qwen25vl-7b$0.1$0.2
Qwen 3qwen3-0b6$0.01$0.02
Qwen 3qwen3-14b$0.12$0.24
Qwen 3qwen3-1b7$0.08$0.4
Qwen 3qwen3-235b$0.455$1.82
Qwen 3qwen3-32b$0.102$0.493
Qwen 3qwen3-4b$0.1$0.2
Qwen 3qwen3-8b$0.0835$0.4275
Qwen 3qwen3-coder-480b$0.38$1.55
Qwen 3qwen3-embedding-0b6$0.01$0.02
Qwen 3qwen3-embedding-4b$0.02$0
Qwen 3qwen3-embedding-8b$0.01$0
Qwen 3qwen3moe-30b$0.125$0.45
Qwen 3qwen3moe-80b$0.18$0.9
Qwen 3qwen3vl-235b$0.62$3.275
Qwen 3qwen3vl-2b$0.08$0.4
Qwen 3qwen3vl-32b$0.104$0.416
Qwen 3qwen3vl-4b$0.1$0.2
Qwen 3qwen3vl-8b$0.1078$0.9325
Qwen 3qwen3vlmoe-30b$0.175$0.65
Qwen 3 Nextqwen3next-80b$0.125$1.15
Qwen 3.5qwen3.5-27b$0.265$2.28
Qwen 3.5qwen3.5-2b$0.08$0.4
Qwen 3.5qwen3.5-397b$0.55$3.5
Qwen 3.5qwen3.5-4b$0.1$0.2
Qwen 3.5qwen3.5-9b$0.1$0.15
Qwen3 5qwen3_5-27b$0.265$0.65
Qwen3 5qwen3_5-2b$0.08$0.4
Qwen3 5qwen3_5-4b$0.1$0.2
Qwen3 5qwen3_5-9b$0.1078$0.28
Qwen3 5qwen3-5vl-27b8$0.32$2.7
Qwen3 5 Moeqwen3_5moe-35b$0.265$0.65
Qwen3 5 Moeqwen3-5-moevl-35b$0.1612$1
Qwen3 Moeqwen3moe-235b$0.23$2.3
RWKV 5rwkv5-7b$0.1$0.2
RWKV 6rwkv6-14b$0.1078$0.28
RWKV 6rwkv6-7b$0.1$0.2
RWKV 6rwkv6moe-37b$0.1078$0.28
Stablelmstablelm-2b$0.08$0.4
Step 3.5step-3.5-199b$0.1$0.3
Step3p7step3p7vl-201b4$0.2$1.15
Talkietalkie-13b$0.1078$0.28
Xlm-robertabge-m3$0.01$0.02

Model-Specific Pricing

These models use prices that differ from their default model class price.

Model Family Model Class Input / 1M tokens Output / 1M tokens
deepseek-ai/DeepSeek-V4-FlashDeepseek 4deepseek4-284b$0.14$0.28
deepseek-ai/DeepSeek-V4-ProDeepseek 4deepseek4-1.6t$1.6$3.2
Qwen/Qwen3-VL-30B-A3B-InstructQwen 3qwen3vlmoe-30b$0.15$0.5

Live Balance And Recent Usage

  • Your billing page shows your up-to-date live credit balance.

  • Recent usage may take a short time to appear in detailed usage views.

  • Usage activity shows spend, request count, input and output tokens, models used, and request-level details.

  • The billing page also lets eligible request-pricing organizations add one-time credits using the saved billing method on file.

Low Balance And Out-Of-Credits Emails

  • We send a low-balance email when your credit balance reaches $5.

  • We send an out-of-credits email when your balance reaches zero or below.

  • Alerts are sent to the organization billing email.

  • Alert settings can be changed from your billing page.

Choosing A Monthly Credit Amount

  • Start with the amount you expect to spend in a month.

  • You can increase your monthly amount later if usage grows.

  • If you are unsure, start with a smaller tier and monitor usage from the billing page.

  • For high-volume usage, you can add one-time credits from the billing page or contact support for custom needs.

Changing Your Monthly Credit Amount

  • Increasing your monthly credit amount takes effect immediately after successful payment.

  • The upgrade is prorated for the remaining time in your current billing cycle.

  • Your billing cycle date stays the same.

  • Decreasing your monthly credit amount is scheduled for your next billing cycle.

Switching From A Fixed Plan To Request Pricing

  • Switching from a fixed monthly plan to request pricing changes your organization to credit-based billing.

  • The change can take effect immediately after successful payment.

  • You can only switch to a monthly credit amount that is the same as or higher than your current fixed-plan monthly amount.

  • Switching from a fixed plan to a lower monthly credit amount is blocked.

  • Any remaining value from your current billing period may be applied as prorated credit when you switch.

  • Your billing cycle date stays the same where possible.

Switching From Request Pricing To A Fixed Plan

  • You can switch back to a fixed monthly plan.

  • Unused credits stay on your organization and do not expire.

  • Stored credits are only used while your organization is on request pricing.

Cancelling Request Pricing

  • Cancelling stops future monthly credit top-ups.

  • Existing credits remain available until used.

  • Once your credit balance reaches zero or below, request-pricing API calls are blocked unless you restart request pricing or add more credits.

Refunds And Adjustments

  • Refunds or account adjustments may reduce your credit balance.

  • Contact support if you have questions about a balance change.

Last edited: Jul 2, 2026