Writer/palmyra-mini-MLX-BF16
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Sep 5, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

Writer's Palmyra Mini MLX BF16 is a 1.7 billion parameter language model based on the Qwen2 architecture, specifically optimized for Apple Silicon devices using the MLX framework. It maintains full bfloat16 precision and features an extensive 131,072-token context window. This model excels in complex reasoning and mathematical problem-solving, demonstrating strong performance on benchmarks like GSM8K and MATH500.

Loading preview...