Name: thomaskuo/gemma-3-12b-it-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: thomaskuo

Overview

This model, thomaskuo/gemma-3-12b-it-heretic, is an abliterated version of Google's Gemma 3 12B IT, created by thomaskuo using the Heretic tool. Its primary purpose is to serve as an uncensored text encoder, particularly for video generation models like LTX-2, by reducing the base model's tendency to refuse or sanitize certain concepts.

Key Capabilities & Features

Reduced Refusals: Achieves a significant reduction in refusals (7/100 vs. 100/100 for the original model), meaning 93% of previously refused prompts now work.
Minimal Model Damage: The abliteration process resulted in a low KL divergence of 0.0826, indicating that the core model quality is largely preserved.
Enhanced Prompt Adherence: By removing soft censorship, the model ensures more faithful encoding of creative prompts, leading to stronger adherence and less altered visual outputs in downstream applications like LTX-2.
Versatile Formats: Available in HuggingFace safetensors, ComfyUI safetensors (bf16, fp8), and various GGUF quantizations (F16, Q8_0, Q6_K, Q5_K_M, Q4_K_M, Q3_K_M) for compatibility with transformers, ComfyUI, and llama.cpp.

Ideal Use Cases

Video Generation: Specifically designed for use as a text encoder within video generation workflows, such as with LTX-2, where uncensored and faithful prompt interpretation is crucial.
Creative Applications: Suitable for applications requiring an instruction-tuned model that avoids sanitization or weakening of creative concepts in its outputs.
Research & Experimentation: Useful for researchers exploring the impact of refusal reduction techniques on large language models and their downstream applications.

Overview

Overview

Key Capabilities & Features

Ideal Use Cases

Full Model Card (README)