/v1/models
Enumerating models
Models
The models endpoint gives the listing of all models in the catalogue, as well as current state. This endpoint can be called from either an authenticated or unauthenticated context.
HTTP request
GET https://api.featherless.ai/v1/models
Response body
If successful, the response body will contain data with the following structure:
{
"data": [
{
"id": "vicgalle/Roleplay-Llama-3-8B",
"name": "vicgalle/Roleplay-Llama-3-8B",
"model_class": "llama3-8b-8k",
"context_length": 8192,
"max_completion_tokens": 4096
},
// and more models ..
]
}Use the id field as the model value when calling /v1/chat/completions or /v1/completions.
Model Attributes
Parameter | Type | Description |
|---|---|---|
| string | use this value when calling a completion endpoint |
| string | |
| integer | Maximum context length (i.e. prompt + completion) supported by the model. Clients that call a completion endpoint wit this model and a prompt of this length or longer should expect an error |
| integer | Maximum amount of output that will be generated in a single call. Calls to completion endpoints with |
| boolean | Will return Users must have ungated the model in their connected HuggingFace account before a completion call will be accepted for this model on featherless. Otherwise the completion call will return Note this a property of the model. It does not report the currently authenticated user’s relationship to that model. |
| boolean | returned if and only if the request was authenticated. If the value is |
Query parameters
All query parameters are optional.
Multi-value filters can be passed as comma-separated values or repeated query parameters:
/v1/models?family=llama3,qwen2
/v1/models?family=llama3&family=qwen2
Common parameters
q: Search by model name or id.available_on_current_plan: Passtrueor1to filter to models available on the authenticated user’s current plan.tags: Require all listed tags to match. This is a legacy filter.page: Page number for pagination.per_page: Number of results per page. Maximum is1000.
Model filters
You can filter by model metadata:
licensefamilymodel_classstatuslanguagestasksarchitecturestrainingcapabilitiesmodalitiesdomainscreativecontent_flagsparameter_bucketpopularity_level
Examples:
/v1/models?capabilities=chat,tool-use
/v1/models?modalities=vision
/v1/models?family=llama3
/v1/models?languages=en,ja
Boolean filters
gated:trueorfalseconversational:trueorfalse
Range filters
Use these to filter by context window:
context_length_min: Minimum context length in tokens.context_length_max: Maximum context length in tokens.
Example request
curl "https://api.featherless.ai/v1/models?available_on_current_plan=true" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $FEATHERLESS_API_KEY"curl -H "Authorization: Bearer $FEATHERLESS_API_KEY" \
"https://api.featherless.ai/v1/models?capabilities=chat,tool-use&context_length_min=32768"curl -H "Authorization: Bearer $FEATHERLESS_API_KEY" \
"https://api.featherless.ai/v1/models?q=llama"curl -H "Authorization: Bearer $FEATHERLESS_API_KEY" \
"https://api.featherless.ai/v1/models?sort=-popularity&page=1&per_page=100"