Name: AIPlans/Qwen3-0.6B-SFT-hs2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: AIPlans

Model Overview

AIPlans/Qwen3-0.6B-SFT-hs2 is a 0.8 billion parameter language model developed by AIPlans, fine-tuned from the Qwen/Qwen3-0.6B-Base architecture. This model was specifically created for the AI-Plans Model Diffing project, focusing on understanding and evaluating changes in lightweight LLMs.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen3-0.6B-Base.
Training Method: Utilizes Supervised Fine-Tuning (SFT) with TRL, specifically trained on responses with a score of 3 or higher from its dataset.
Training Efficiency: Achieved training completion in approximately 1 hour and 10 minutes using an A100 (40 GB) GPU.

Intended Use Cases

This model is designed for research purposes, particularly in:

Model Diffing: Investigating differences between model versions.
Preference Fine-Tuning: Exploring methods for aligning models with specific preferences.
Behavioral Evaluation: Analyzing how minor changes impact the behavior of lightweight LLMs.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)