theAIDataExec/Kimi-Dev-72B-mlx-fp16

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:72.7BQuant:FP8Ctx Length:32kPublished:Jul 9, 2025License:mitArchitecture:Transformer Open Weights Warm

The theAIDataExec/Kimi-Dev-72B-mlx-fp16 is a 72.7 billion parameter language model, converted to MLX format from the moonshotai/Kimi-Dev-72B model. This model is designed for efficient deployment and inference on Apple Silicon using the MLX framework. Its primary utility lies in providing a large-scale language model for local execution within the MLX ecosystem.

Loading preview...

Overview

The theAIDataExec/Kimi-Dev-72B-mlx-fp16 is a substantial 72.7 billion parameter language model, specifically adapted for the Apple MLX framework. It originates from the moonshotai/Kimi-Dev-72B model and has been converted using mlx-lm version 0.22.3. This conversion enables optimized performance and local execution on Apple Silicon devices.

Key Characteristics

  • MLX Format: Optimized for Apple Silicon, allowing for efficient local inference.
  • Large Parameter Count: With 72.7 billion parameters, it offers significant language understanding and generation capabilities.
  • Ease of Use: Provides straightforward integration with the mlx-lm library for loading and generating text.

Usage

This model is primarily intended for developers and researchers who wish to leverage a powerful language model directly on their Apple hardware. It supports standard mlx-lm workflows, including loading the model and tokenizer, applying chat templates, and generating responses from prompts. The conversion ensures compatibility and performance within the MLX ecosystem, making it suitable for local development and experimentation with large language models.