Name: Fmuaddib/DeepSeek-R1-Distill-Qwen-14B-mlx-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Fmuaddib

Model Overview

This model, Fmuaddib/DeepSeek-R1-Distill-Qwen-14B-mlx-fp16, is a 14.8 billion parameter language model. It has been converted by Fmuaddib into the MLX format, specifically optimized for use with Apple silicon, utilizing mlx-lm version 0.22.1. The original model, deepseek-ai/DeepSeek-R1-Distill-Qwen-14B, is a distilled variant based on the Qwen architecture.

Key Characteristics

MLX Format: Optimized for performance on Apple silicon, enabling local inference with mlx-lm.
Parameter Count: Features 14.8 billion parameters, offering a balance between performance and computational requirements.
Architecture: Based on the Qwen architecture, known for its strong general language capabilities.
Distilled Model: Represents a distilled version, suggesting potential optimizations for efficiency while retaining core functionalities.

Usage

This model is primarily intended for developers and researchers looking to run DeepSeek-R1-Distill-Qwen-14B on MLX-compatible hardware. It can be loaded and used for text generation tasks via the mlx_lm library, supporting standard prompt-based generation and chat template application if available in the tokenizer.

Overview

Model Overview

Key Characteristics

Usage

Full Model Card (README)