grimjim/mistralai-Mistral-Nemo-Instruct-2407-12B-MPOA-v1

TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Nov 19, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The grimjim/mistralai-Mistral-Nemo-Instruct-2407-12B-MPOA-v1 model is a variant of the Mistral architecture, featuring Magnitude-Preserving Othogonalized Ablation (MPOA) applied to the mlp.down_proj.weight layers. This modification aims to alter model behavior while preserving certain properties, distinguishing it from standard Mistral models. It is designed for varied text completion tasks, exhibiting a reduced emphasis on safety refusals compared to highly compliant models. This model is suitable for applications requiring flexible and less constrained text generation.

Loading preview...

Model Overview

The grimjim/mistralai-Mistral-Nemo-Instruct-2407-12B-MPOA-v1 is a specialized instruction-tuned model based on the Mistral architecture. Its primary distinguishing feature is the application of Magnitude-Preserving Othogonalized Ablation (MPOA), also known as norm-preserving biprojected abliteration. This technique has been selectively applied to the mlp.down_proj.weight layers across the majority of the model's architecture.

Key Characteristics

  • MPOA Application: MPOA is specifically applied to the mlp.down_proj.weight layers, a departure from conventional ablation methods.
  • Untouched Layers: Notably, self_attn.o_proj.weight layers were intentionally left unmodified, suggesting a targeted approach to altering model characteristics.
  • Compliance Profile: The model exhibits a lower level of compliance regarding safety refusals. This design choice positions it near an "edge of chaos," indicating a more permissive and less constrained output behavior.

Intended Use Cases

This model is particularly well-suited for:

  • Varied Text Completion: Its reduced emphasis on safety refusals makes it suitable for generating diverse and potentially unconventional text outputs.
  • Exploratory Text Generation: Developers seeking a model with less restrictive guardrails for creative or experimental text generation tasks may find this model appropriate.
  • Research into Ablation Techniques: The specific application of MPOA offers a unique case study for researchers interested in the effects of targeted model modifications.