Name: gabfssilva/VibeThinker-3B-MLX-BF16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: gabfssilva

Model Overview

This model, gabfssilva/VibeThinker-3B-MLX-BF16, is a 3.1 billion parameter language model. It is a conversion of the original WeiboAI/VibeThinker-3B model into the MLX format, specifically optimized for Apple Silicon hardware. The conversion was performed using mlx-lm version 0.31.3, incorporating BF16 (bfloat16) precision for potentially faster inference and reduced memory footprint.

Key Characteristics

Parameter Count: 3.1 billion parameters, offering a balance between performance and resource efficiency.
MLX Format: Optimized for Apple Silicon, allowing for efficient local execution on compatible devices.
BF16 Precision: Utilizes bfloat16 for potentially faster computation and lower memory usage during inference.
Base Model: Converted from WeiboAI/VibeThinker-3B, indicating its foundational capabilities are derived from that architecture.

Usage and Integration

This model is primarily intended for developers working within the MLX ecosystem. It can be easily loaded and used for text generation tasks with the mlx-lm library. The provided code examples demonstrate how to load the model and tokenizer, apply chat templates if available, and generate responses, making it straightforward to integrate into MLX-based applications.

Overview

Model Overview

Key Characteristics

Usage and Integration

Full Model Card (README)