Name: DavidAU/Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: DavidAU

Overview

DavidAU/Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness is a unique 9 billion parameter model derived from the Qwen 3.6 27B architecture. It underwent a "Heretic" uncensoring process by P-E-W (heretic'ing by "trohrbaugh") and was subsequently "shrunk" to 9B parameters and 16 layers using a modified Mergekit. The model was then fine-tuned on local hardware via Unsloth across six datasets in two stages, with specific tuning to unify its new layer structure.

Key Capabilities

Uncensored Generation: Designed to be fully uncensored, allowing for unrestricted content creation.
Creative & Unconventional Output: Excels at generating highly imaginative, 'mad,' and unpredictable text, often including detailed 'thinking' processes.
Fast Inference: Achieves speeds of 200 tokens/second on a 5090 GPU (Q4KS quantization), partly due to its reduced 16-layer structure.
Multimodal Foundations: Retains intact image/video training and systems lifted from the full 27B model.
Extended Context: Supports a 256k context length.

Important Considerations

Experimental Nature: Described as "Sweet Madness," it's an experimental model that may require additional tuning for specific or general use cases.
Knowledge Gaps: Due to the unique compression method, some knowledge or skills from the original 27B model may be missing.
Performance vs. Qwen 3.5 9B: The model's performance may be surpassed by Qwen 3.5 9B until it is fully tuned with a minimum of 25-50k samples.

Good For

Experimental Creative Writing: Ideal for generating highly imaginative, non-linear, or 'mad' narratives and dialogues.
Exploring Unconventional AI Behavior: Useful for researchers or developers interested in pushing the boundaries of AI output and exploring uncensored responses.
Rapid Prototyping: Its fast inference speed makes it suitable for quick generation of creative content.

Settings Recommendations

Temperature: 1
Top P: 0.95
Min P: 0.05
Repetition Penalty: 1.05 (1.1 strongly suggested, especially for short prompts)
Context: Minimum 8k to 16k
Quantization: Q4KS minimum or IQ3_M Imatrix

Overview

Overview

Key Capabilities

Important Considerations

Good For

Settings Recommendations

Full Model Card (README)