Name: usermma/My-Claude-4.6-Thinking-abliterated-failspy-mlx-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: usermma

Model Overview

The usermma/My-Claude-4.6-Thinking-abliterated-failspy-mlx-fp16 model is a 3.1 billion parameter language model, specifically engineered for deployment and inference using Apple's MLX framework. It is a direct conversion of the usermma/My-Claude-4.6-Thinking-abliterated-failspy model, optimized for fp16 precision.

Key Characteristics

MLX Optimization: Converted using mlx-lm version 0.31.2, making it suitable for efficient execution on Apple silicon.
Parameter Count: Features 3.1 billion parameters, offering a balance between performance and resource utilization.
Context Length: Supports a context window of 32768 tokens, enabling processing of longer inputs.
Precision: Utilizes fp16 (half-precision floating-point) for reduced memory footprint and faster inference.

Use Cases

This model is particularly well-suited for:

Local Inference on Apple Silicon: Ideal for developers and researchers looking to run language models directly on their Apple hardware (Macs with M-series chips).
MLX Ecosystem Integration: Seamlessly integrates into MLX-based projects, leveraging its native optimizations.
Experimentation: Provides an accessible model for experimenting with MLX framework capabilities and local LLM deployment.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)