Name: alexchen4ai/Qwen3-8B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: alexchen4ai

Overview

alexchen4ai/Qwen3-8B-Instruct is a specialized 8.2 billion parameter language model, derived from the larger Qwen/Qwen3-VL-8B-Instruct vision-language model. This version has had all vision components removed, making it a pure text-generation LLM. It operates in bfloat16 precision and is licensed under Apache 2.0.

Key Capabilities

Pure Text Generation: Focuses exclusively on text-based tasks, eliminating the overhead of vision processing.
Instruction Following: Designed to understand and execute text-based instructions.
Chat Applications: Suitable for conversational AI and chatbot development.
Fine-tuning: Can be further trained on specific text-only datasets.
Memory Efficient: Offers reduced memory footprint compared to the full multimodal Qwen3-VL-8B-Instruct.

Architecture Highlights

Parameters: ~8.2 billion (8,190,735,360)
Context Length: 32,768 tokens (Max Position Embeddings: 262,144)
Layers: 36 transformer layers
Attention: 32 attention heads with 8 KV heads (GQA)

Use Cases

This model is ideal for scenarios requiring a robust, text-only large language model. It excels in applications such as:

General text generation and completion
Building instruction-tuned agents
Developing chat interfaces
Text-based data analysis and summarization

Limitations

No Vision Support: This model explicitly does not handle image or video inputs. For multimodal tasks, users should refer to the original Qwen3-VL-8B-Instruct.

Overview

Overview

Key Capabilities

Architecture Highlights

Use Cases

Limitations

Full Model Card (README)