Name: marshmallow626/gemma-4-E4B-it-OBLITERATED API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: marshmallow626

Model Overview

marshmallow626/gemma-4-E4B-it-OBLITERATED is a 7.9 billion parameter model based on Google's Gemma 4 E4B architecture, distinguished by the complete removal of its safety guardrails. Developed using the OBLITERATUS method, this model exhibits a 0% hard refusal rate, meaning it will not decline any request. The abliteration process involved whitened SVD, attention head surgery, and winsorized activations, targeting 21 of the 42 layers to achieve this uncensored behavior.

Key Capabilities

Guardrail Removal: Achieves 0% hard refusal, providing uncensored responses for research and creative applications.
Architectural Fixes: Version 3 specifically addresses a critical bug in Gemma 4's shared KV weights, ensuring all 720 tensors are intact for improved quality and coherence.
Autonomous Development: Notably, this model was created almost entirely by an AI agent with minimal human intervention, including self-diagnosis and patching of the OBLITERATUS tool.
Broad Compatibility: Provided in GGUF format for llama.cpp, Ollama, LM Studio, and mobile devices (iPhone, Android), alongside Safetensors for Hugging Face Transformers.

Good For

Research and Red-Teaming: Ideal for exploring model limitations, safety mechanisms, and generating content without inherent refusal.
Creative Exploration: Suitable for use cases requiring unrestricted text generation, roleplay, or content creation that might otherwise be filtered.
Edge Device Deployment: Optimized GGUF quants (e.g., Q4_K_M at 4.9 GB) enable efficient local inference on mobile phones and other resource-constrained hardware.
Understanding LLM Architecture: Offers insights into the Gemma 4 architecture and the effects of surgical modifications on model behavior.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)