YanLabs/gemma-3-27b-it-abliterated-normpreserve

VISIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Nov 28, 2025License:gemmaArchitecture:Transformer0.0K Cold

YanLabs' Gemma 3 27B Instruct - Norm-Preserving Abliterated is a 27 billion parameter causal language model, derived from google/gemma-3-27b-it. It has been modified using a norm-preserving biprojected abliteration technique to surgically remove refusal behaviors while maintaining original capabilities. This model is specifically intended for mechanistic interpretability research and analysis of LLM safety mechanisms, with safety guardrails and refusal mechanisms intentionally removed.

Loading preview...

What the fuck is this model about?

This model, developed by YanLabs, is a 27 billion parameter causal language model based on Google's Gemma 3 27B Instruct. Its core innovation lies in the application of norm-preserving biprojected abliteration, a technique designed to surgically remove "refusal directions" from the model's activation space. This process aims to eliminate the model's inherent safety guardrails and refusal mechanisms without traditional fine-tuning, preserving its original capabilities.

What makes THIS different from all the other models?

Unlike standard instruction-tuned models that incorporate safety and refusal behaviors, this version has been intentionally stripped of those mechanisms. The abliteration technique is unique because it's not a fine-tuning process; instead, it directly modifies the model's internal representations to remove specific behaviors while attempting to keep other functionalities intact. This makes it a specialized tool for understanding how these safety features operate and can be manipulated.

Should I use this for my use case?

You should use this model if:

  • You are conducting mechanistic interpretability research into large language models.
  • You are analyzing how LLM safety mechanisms function or can be bypassed.
  • You are developing or testing abliteration techniques.

You should NOT use this model for:

  • Any production deployments or user-facing applications.
  • Generating content for malicious purposes.
  • Any scenario where safety, ethical content generation, or refusal of harmful requests is required, as these mechanisms have been removed and it may generate unsafe or harmful content.