ailexleon/Rocinante-X-12B-v1-mlx-fp16
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Jan 25, 2026Architecture:Transformer Cold

The ailexleon/Rocinante-X-12B-v1-mlx-fp16 is a 12 billion parameter language model, converted to MLX format from TheDrummer/Rocinante-X-12B-v1. This model is specifically designed for efficient inference on Apple Silicon using the MLX framework, offering a specialized deployment option for developers. It maintains a context length of 32768 tokens, making it suitable for tasks requiring extensive contextual understanding. Its primary differentiator is its optimization for MLX, enabling direct use within the Apple ecosystem.

Loading preview...