Name: ailexleon/Rocinante-X-12B-v1-mlx-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ailexleon

Overview

The ailexleon/Rocinante-X-12B-v1-mlx-fp16 model is a 12 billion parameter language model, derived from TheDrummer/Rocinante-X-12B-v1. This version has been specifically converted to the MLX format using mlx-lm version 0.29.1, making it optimized for efficient execution on Apple Silicon.

Key Capabilities

MLX Optimization: Designed for seamless integration and performance within the Apple MLX ecosystem.
Parameter Count: Features 12 billion parameters, offering a balance between performance and computational requirements.
Context Length: Supports a substantial context window of 32768 tokens, enabling processing of longer inputs and generating more coherent, extended outputs.

Usage

This model is intended for developers working with Apple hardware who require a performant language model. It can be easily loaded and used with the mlx-lm library for various natural language processing tasks, including text generation, summarization, and question answering. The provided Python code snippet demonstrates how to load the model and generate responses, including handling chat templates if available.

Overview

Overview

Key Capabilities

Usage

Full Model Card (README)