alexgusevski/Llama-3.3-8B-Instruct-128K_Abliterated-mlx-fp16
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 12, 2026License:llama3.3Architecture:Transformer Cold
The alexgusevski/Llama-3.3-8B-Instruct-128K_Abliterated-mlx-fp16 model is an 8 billion parameter instruction-tuned language model, converted to the MLX format for efficient deployment. Based on the Llama-3.3 architecture, this model is designed for general instruction following tasks. Its primary differentiation lies in its MLX conversion, enabling optimized performance on Apple silicon.
Loading preview...