ailexleon/Impish_Bloodmoon_12B-mlx-fp16
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Dec 25, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

Impish_Bloodmoon_12B-mlx-fp16 is a 12 billion parameter language model, converted to the MLX format by ailexleon from the original SicariusSicariiStuff/Impish_Bloodmoon_12B. This model is designed for efficient inference on Apple Silicon, leveraging the MLX framework. It maintains a 32768 token context length, making it suitable for tasks requiring extensive contextual understanding.

Loading preview...