Name: grimjim/mistralai-Mistral-Nemo-Instruct-2407-12B-MPOA-v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: grimjim

Model Overview

The grimjim/mistralai-Mistral-Nemo-Instruct-2407-12B-MPOA-v1 is a specialized instruction-tuned model based on the Mistral architecture. Its primary distinguishing feature is the application of Magnitude-Preserving Othogonalized Ablation (MPOA), also known as norm-preserving biprojected abliteration. This technique has been selectively applied to the mlp.down_proj.weight layers across the majority of the model's architecture.

Key Characteristics

MPOA Application: MPOA is specifically applied to the mlp.down_proj.weight layers, a departure from conventional ablation methods.
Untouched Layers: Notably, self_attn.o_proj.weight layers were intentionally left unmodified, suggesting a targeted approach to altering model characteristics.
Compliance Profile: The model exhibits a lower level of compliance regarding safety refusals. This design choice positions it near an "edge of chaos," indicating a more permissive and less constrained output behavior.

Intended Use Cases

This model is particularly well-suited for:

Varied Text Completion: Its reduced emphasis on safety refusals makes it suitable for generating diverse and potentially unconventional text outputs.
Exploratory Text Generation: Developers seeking a model with less restrictive guardrails for creative or experimental text generation tasks may find this model appropriate.
Research into Ablation Techniques: The specific application of MPOA offers a unique case study for researchers interested in the effects of targeted model modifications.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)