Name: alexgusevski/Dolphin3.0-Qwen2.5-3b-mlx API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: alexgusevski

Overview

alexgusevski/Dolphin3.0-Qwen2.5-3b-mlx is a 3.1 billion parameter language model, converted for optimal performance on Apple silicon using the MLX framework. This model is derived from the original cognitivecomputations/Dolphin3.0-Qwen2.5-3b and utilizes the Qwen2.5 architecture, known for its strong general language capabilities.

Key Capabilities

MLX Optimization: Specifically converted to the MLX format (using mlx-lm version 0.21.4) for efficient inference on Apple silicon, enabling faster local execution.
Qwen2.5 Base: Benefits from the robust architecture of Qwen2.5, providing solid performance for various language tasks.
3.1 Billion Parameters: A compact yet capable model size, suitable for deployment in environments with resource constraints while maintaining good generative quality.

Good For

Local Inference on Apple Silicon: Ideal for developers and users looking to run LLMs directly on their macOS devices with enhanced speed and efficiency.
General Language Generation: Suitable for a wide range of applications including text completion, summarization, and conversational AI where a smaller, performant model is desired.
Experimentation with MLX: Provides a ready-to-use model for exploring the capabilities and performance benefits of the MLX framework.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)