Name: valuat/DeepSeek-R1-Distill-Llama-8B-mlx-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: valuat

Model Overview

The valuat/DeepSeek-R1-Distill-Llama-8B-mlx-fp16 is an 8 billion parameter language model, specifically converted for use with Apple's MLX framework. This model is a direct conversion of the deepseek-ai/DeepSeek-R1-Distill-Llama-8B model, ensuring compatibility and optimized performance on Apple Silicon hardware.

Key Characteristics

Architecture: Based on the DeepSeek-R1-Distill-Llama-8B architecture.
Parameter Count: 8 billion parameters.
Format: Converted to MLX format using mlx-lm version 0.26.4, enabling efficient execution on Apple devices.
Context Length: Supports a context length of 32768 tokens.

Usage

This model is intended for developers working within the MLX ecosystem. It can be loaded and used for text generation tasks directly with the mlx-lm library, providing a streamlined experience for local inference on compatible hardware.

Good For

Apple Silicon Development: Ideal for applications requiring local LLM inference on Macs with Apple Silicon.
General Language Tasks: Suitable for a wide range of natural language processing tasks, leveraging the capabilities of the original DeepSeek-R1-Distill-Llama-8B model.
Experimentation: Provides an accessible way to experiment with a capable 8B parameter model in an optimized local environment.

Overview

Model Overview

Key Characteristics

Usage

Good For

Full Model Card (README)