alexgusevski/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-fp16
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jan 12, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The alexgusevski/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-fp16 model is an 8 billion parameter instruction-tuned language model, converted to MLX format from DavidAU's Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning. This model is designed for high-reasoning tasks, leveraging the Llama3.3 architecture and fine-tuning inspired by Claude 4.5 Opus. It is optimized for local deployment on Apple Silicon via MLX, making it suitable for advanced instruction-following and complex problem-solving.

Loading preview...