Name: TeichAI/Qwen3-8B-Kimi-K2-Thinking-Distill API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TeichAI

Model Overview

TeichAI/Qwen3-8B-Kimi-K2-Thinking-Distill is an 8 billion parameter language model built upon the Qwen3 architecture. Developed by TeichAI, this model is a fine-tuned version of unsloth/Qwen3-8B-unsloth-bnb-4bit.

Key Characteristics

Training Data: The model was specifically trained on 1000 examples derived from MoonshotAI's Kimi k2 thinking dataset, suggesting an optimization for particular reasoning or thought processes.
Efficient Training: It utilizes Unsloth and Huggingface's TRL library, enabling a reported 2x faster training process.
Context Length: Supports a context window of 32768 tokens.
License: Distributed under the Apache-2.0 license.

Potential Use Cases

Given its specialized training on 'Kimi k2 thinking' examples, this model is likely well-suited for:

Tasks requiring specific reasoning or problem-solving approaches similar to those found in the Kimi k2 thinking dataset.
Applications where efficient inference from a Qwen3-8B base is desired.
Scenarios benefiting from a model trained with Unsloth's speed optimizations.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)