Name: ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ligeng-dev

Model Overview

This model, ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix, is an 8 billion parameter language model derived from the Qwen/Qwen3-8B architecture. It has undergone supervised fine-tuning (SFT) using the TRL library, indicating a focus on enhancing its conversational and instruction-following abilities.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen3-8B, inheriting its foundational language capabilities.
Training Method: Utilizes Supervised Fine-Tuning (SFT) with the TRL library, suggesting an optimization for specific task performance or instruction adherence.
Context Length: Supports a substantial context window of 32768 tokens, enabling it to handle and generate longer, more coherent texts.

Intended Use Cases

This model is suitable for a variety of text generation tasks where a robust 8B parameter model with a large context window is beneficial. Its fine-tuned nature implies improved performance on tasks aligned with its training data, making it a strong candidate for:

General text generation and completion.
Conversational AI and chatbots.
Tasks requiring understanding and generation over extended contexts.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)