Name: TeichAI/Qwen3-4B-Thinking-2507-GPT-5.1-Codex-Max-Distill API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TeichAI

Overview

TeichAI/Qwen3-4B-Thinking-2507-GPT-5.1-Codex-Max-Distill is a 4 billion parameter Qwen3-based language model developed by TeichAI. It is fine-tuned from the unsloth/qwen3-4b-thinking-2507 model, leveraging Unsloth and Huggingface's TRL library for accelerated training. This optimization allowed the model to be trained 2x faster, highlighting an emphasis on efficiency in its development.

Key Capabilities

Efficient Training: Developed with Unsloth and Huggingface's TRL library, enabling 2x faster training.
Qwen3 Architecture: Based on the Qwen3 model family, providing a robust foundation.
Context Length: Supports a substantial context window of 32768 tokens, suitable for processing longer inputs.

Good For

Applications requiring a 4 billion parameter model with a focus on efficient training.
Use cases that benefit from the Qwen3 architecture and a large context window.
Developers interested in models optimized with Unsloth for faster iteration cycles.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)