trohrbaugh/gemma-4-31b-it-heretic-ara
trohrbaugh/gemma-4-31b-it-heretic-ara is a 31 billion parameter instruction-tuned causal language model, a decensored version of Google DeepMind's Gemma-4-31b-it. This model was created using the Heretic v1.2.0+custom tool with the Arbitrary-Rank Ablation (ARA) method, specifically designed to reduce refusals. It maintains the multimodal capabilities of the original Gemma 4 series, handling text and image inputs with a 32768 token context window, making it suitable for reasoning, coding, and agentic workflows with fewer content restrictions.
Loading preview...
Model Overview
This model, trohrbaugh/gemma-4-31b-it-heretic-ara, is a 31 billion parameter instruction-tuned variant derived from Google DeepMind's Gemma-4-31b-it. Its primary distinction is being a decensored version, achieved through the application of the Heretic v1.2.0+custom tool using the Arbitrary-Rank Ablation (ARA) method. This process significantly reduces content refusals, as evidenced by a drop from 98/100 to 5/100 in refusal rates compared to the original model, while maintaining a low KL divergence of 0.0120.
Key Capabilities
- Decensored Output: Engineered to produce fewer refusals, offering more direct responses.
- Multimodal Understanding: Processes both text and image inputs, with the ability to analyze video by processing frame sequences.
- Extended Context Window: Supports a substantial context length of up to 256K tokens (for the base Gemma 4 31B model).
- Reasoning and Coding: Designed for strong reasoning capabilities, enhanced coding, and agentic workflows with native function-calling support.
- Multilingual Support: Pre-trained on over 140 languages, with out-of-the-box support for 35+ languages.
When to Use This Model
- Applications requiring less restrictive content generation: Ideal for use cases where the original model's refusal rates are too high.
- Multimodal tasks: Suitable for scenarios involving interleaved text and image inputs, such as image captioning, document parsing, or visual question answering.
- Complex reasoning and coding tasks: Benefits from the underlying Gemma 4 architecture's advancements in these areas.
- Agentic workflows: Leverages native function-calling support for building autonomous agents.