Name: lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: lordx64

Overview

This model, Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled, is a 35.1 billion parameter Mixture-of-Experts (MoE) variant of the Qwen3.6-35B-A3B base model. It has been fine-tuned to imitate the verbose, deliberate chain-of-thought reasoning style of Moonshot AI's Kimi K2.6, a frontier reasoning model. The goal is to port Kimi-grade reasoning behavior into a permissively-licensed MoE model that can be run by individuals.

Key Capabilities

Kimi-style Reasoning: Fine-tuned on ~7.8k high-quality reasoning traces from Kimi K2.6, teaching the model to explicitly "think" using <think>…</think> blocks.
Verbose Reasoning Chains: Inherits Kimi K2.6's tendency to produce significantly longer and more careful reasoning chains compared to other models, averaging ~3.4x longer than Claude Opus 4.7 in observed datasets.
Efficient MoE Architecture: The base model is a 35B-parameter MoE with 256 experts, routing 8 experts plus 1 shared, resulting in only ~3B active parameters per token for efficient inference.
Extended Context: Supports a 64k token context, allowing for long reasoning processes (5-30k tokens of <think> output) on challenging problems.
Companion Model: Designed to be directly comparable with its Claude-distilled sibling, offering a choice between Kimi's longer, deliberate reasoning and Claude's shorter, tighter chains.

Good For

Hard Reasoning Tasks: Excels in graduate-level STEM, competition math (AIME/MATH), code reasoning with explicit walk-throughs, and multi-step logic puzzles.
Agentic Planning: Useful for scenarios where explicit <think> blocks enhance correctness and transparency.
Predictable Reasoning Output: Provides reliable <think>-block reasoning regardless of prompt pattern, which can be beneficial when the base model's thinking mode is conditional.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)