Name: taki555/Qwen3-4B-Shadow-FT-BAAI-2k API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: taki555

Model Overview

The taki555/Qwen3-4B-Shadow-FT-BAAI-2k is a 4 billion parameter instruction-tuned model derived from the Qwen3-4B base architecture. It was developed by Taiqiang Wu, Runming Yang, Jiayi Li, Pengfei Hu, Ngai Wong, and Yujiu Yang, introducing the Shadow-FT framework for fine-tuning instruction-following models.

Key Capabilities & Innovations

Shadow-FT Framework: This model implements a novel fine-tuning approach where a base model is fine-tuned, and its learned weight updates are directly transferred to an instruction-tuned model. This method addresses the observation that directly tuning instruction models often yields marginal or even negative performance changes.
Leverages Base Model Strengths: By fine-tuning the base model, which shares high weight similarity with its instruction-tuned counterpart, Shadow-FT effectively transfers robust learning without degrading the instruction model's existing capabilities.
Instruction-Tuned Performance: Specifically tuned on subsets of the BAAI/Infinity-Instruct dataset, this model is designed to excel in instruction-following tasks.

When to Use This Model

This model is particularly suitable for applications requiring robust instruction-following capabilities, especially in scenarios where traditional instruction-tuning methods have shown limitations. Its unique fine-tuning methodology aims to provide improved performance for instruction-based prompts.

Overview

Model Overview

Key Capabilities & Innovations

When to Use This Model

Full Model Card (README)