Qwen/Qwen3-8B-MLX-bf16
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 23, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Qwen/Qwen3-8B-MLX-bf16 is an 8.2 billion parameter causal language model from the Qwen series, developed by Qwen. This model uniquely supports seamless switching between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. It delivers enhanced reasoning capabilities, superior human preference alignment for creative writing and role-playing, and strong agent capabilities with multilingual support for over 100 languages.

Loading preview...