llmfan46/Qwen3.5-27B-uncensored-heretic-v1

VISIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Mar 10, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

llmfan46/Qwen3.5-27B-uncensored-heretic-v1 is a 27 billion parameter causal language model, a decensored version of Qwen/Qwen3.5-27B. Developed by llmfan46 using the Heretic v1.2.0 tool with Arbitrary-Rank Ablation (ARA) method, it significantly reduces content refusals (92% fewer) while maintaining core model quality with a low KL divergence of 0.0331. This model is optimized for applications requiring less restrictive content generation, offering enhanced flexibility for diverse use cases.

Loading preview...

Model Overview

llmfan46/Qwen3.5-27B-uncensored-heretic-v1 is a 27 billion parameter model derived from the Qwen/Qwen3.5-27B base, specifically engineered to reduce content refusals. This decensoring process was achieved using the Heretic v1.2.0 tool, employing the Arbitrary-Rank Ablation (ARA) method. The model demonstrates a significant reduction in refusals, achieving 8/100 compared to the original's 95/100, while preserving model quality with a KL divergence of 0.0331.

Key Capabilities

  • Reduced Content Restrictions: Offers 92% fewer refusals, making it suitable for a broader range of applications where content filtering is undesirable.
  • Preserved Core Performance: Maintains the original Qwen3.5-27B's capabilities in areas like common-sense reasoning (PIQA) and general knowledge (MMLU), with minimal impact on accuracy.
  • Multimodal Foundation: Inherits Qwen3.5's unified vision-language foundation, efficient hybrid architecture, scalable RL generalization, and global linguistic coverage (201 languages).
  • Extended Context Length: Supports a native context length of 262,144 tokens, extensible up to 1,010,000 tokens, crucial for complex tasks and maintaining thinking capabilities.

Good For

  • Creative Content Generation: Ideal for scenarios requiring unrestricted text generation, such as creative writing, role-playing, or brainstorming without typical LLM content filters.
  • Research and Development: Useful for researchers exploring the impact of censorship on LLM behavior and for developing applications that require a more open-ended response style.
  • Applications Requiring Flexibility: Suitable for developers building applications where the model needs to respond to a wide array of prompts without predefined content limitations.