Name: edbeeching/Qwen3-0.6B-SFT-Eval-Smoke API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: edbeeching

Overview

The edbeeching/Qwen3-0.6B-SFT-Eval-Smoke is a 0.8 billion parameter language model, derived from the Qwen/Qwen3-0.6B base model. It has undergone Supervised Fine-Tuning (SFT) using the TRL library, a framework for Transformer Reinforcement Learning. This fine-tuning process aims to optimize the model's ability to follow instructions and generate relevant text.

Key Capabilities

Instruction Following: Enhanced through SFT, allowing it to generate responses aligned with user prompts.
Text Generation: Capable of producing coherent and contextually appropriate text.
Context Handling: Supports a substantial context length of 32768 tokens, enabling it to process and generate longer sequences of text.

Training Details

The model was trained using the SFT method, leveraging specific versions of popular machine learning frameworks:

TRL: 1.7.0.dev0
Transformers: 5.5.4
Pytorch: 2.10.0
Datasets: 5.0.0
Tokenizers: 0.22.2

Good For

Conversational AI: Suitable for applications requiring interactive text generation based on user input.
Prototyping: Its smaller size makes it efficient for rapid experimentation and development of language-based features.
Instruction-based tasks: Ideal for scenarios where the model needs to generate specific types of content following explicit instructions.

Overview

Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)