sealad886/FuseO1-DeepSeekR1-Qwen2.5-Coder-7B-Preview-MLX

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 16, 2025Architecture:Transformer Warm

The sealad886/FuseO1-DeepSeekR1-Qwen2.5-Coder-7B-Preview-MLX is a 7.6 billion parameter language model, converted to MLX format from the FuseO1-DeepSeekR1-Qwen2.5-Coder-7B-Preview model. This model is specifically designed for use within the MLX framework, enabling efficient deployment and inference on Apple silicon. It is currently a work in progress, indicating ongoing development and potential for future enhancements. Its primary utility lies in providing a compatible model for MLX-based applications.

Loading preview...

Overview

This model, sealad886/FuseO1-DeepSeekR1-Qwen2.5-Coder-7B-Preview-MLX, is a 7.6 billion parameter language model. It has been specifically converted to the MLX format from its original version, sealad886/FuseO1-DeepSeekR1-Qwen2.5-Coder-7B-Preview, using mlx-lm version 0.22.0. This conversion optimizes the model for use with Apple's MLX framework, which is designed for efficient machine learning on Apple silicon.

Key Characteristics

  • MLX Compatibility: Optimized for the MLX framework, making it suitable for deployment on Apple hardware.
  • Parameter Count: Features 7.6 billion parameters, offering a balance between performance and computational requirements.
  • Development Status: Currently marked as a "work in progress," suggesting that its output may not yet be fully validated or stable.

Usage

This model is intended for developers working within the MLX ecosystem. It can be loaded and used for text generation tasks via the mlx_lm library, as demonstrated in the provided Python code examples. Users should be aware of its developmental status when integrating it into applications.