Models
Pricing
Chat
Status

Models
Qwen 2.5
0b5

nm-testing/Qwen2-0.5B-Instruct-FP8-SkipQKV

TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kTool Calling:SupportedArchitecture:Transformer Cold

Loading preview...

Platform

Overview
All Available Models
Quickstart Guide
Why Featherless
Pricing

Top LLM Families

DeepSeek 4
Qwen 3
Llama 3.1
Mistral
Gemma 3
Kimi K2
GPT OSS

Top LLM Use Cases

Coding
Reasoning
Vision Language Models
Multimodal LLMs
Embedding Models
LLMs for Roleplay
Uncensored Models
Abliterated Models

Resources

Documentation
Blog
Phoenix AI Chat
Discord Community
Status

Compare Us

Featherless Alternatives
vs. Together AI
vs. Replicate
vs. Fireworks AI

Company

About Us
Privacy Policy
Terms of Service

© 2026 Featherless. All rights reserved.

X (Twitter)LinkedIn Discord YouTube