Name: Qwen/Qwen3-14B-MLX-bf16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Qwen

Qwen3-14B-MLX-bf16 Overview

Qwen3-14B-MLX-bf16 is a 14.8 billion parameter causal language model, part of the latest Qwen series. It introduces a novel capability to dynamically switch between a 'thinking mode' for intricate tasks like mathematical problem-solving, code generation, and logical reasoning, and a 'non-thinking mode' optimized for general dialogue and efficiency. This dual-mode functionality allows for adaptive performance across diverse scenarios.

Key Capabilities

Adaptive Reasoning: Seamlessly transitions between modes to enhance performance in complex logical reasoning, mathematics, and coding, surpassing previous Qwen models.
Human Preference Alignment: Demonstrates superior performance in creative writing, role-playing, multi-turn conversations, and instruction following, leading to more natural interactions.
Agentic Functionality: Excels in tool-calling and integration with external tools, achieving leading performance in agent-based tasks among open-source models.
Multilingual Support: Supports over 100 languages and dialects, offering strong multilingual instruction following and translation capabilities.
Extended Context: Natively handles context lengths up to 32,768 tokens, with support for up to 131,072 tokens using the YaRN method for processing long texts.

Usage Considerations

Optimal performance is achieved by adjusting sampling parameters based on the active mode. For 'thinking mode', Temperature=0.6 and TopP=0.95 are recommended, while 'non-thinking mode' suggests Temperature=0.7 and TopP=0.8. The model also supports dynamic mode switching via user input tags like /think and /no_think within prompts.

Overview

Qwen3-14B-MLX-bf16 Overview

Key Capabilities

Usage Considerations

Full Model Card (README)