Name: Goekdeniz-Guelmez/Qwen3-4B-Thinking-2507-gabliterated API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Goekdeniz-Guelmez

What the fuck is this model about?

This model, developed by Goekdeniz-Guelmez, introduces the Gabliteration technique, a novel neural weight modification method designed to selectively alter specific behavioral patterns in large language models. Unlike traditional abliteration, Gabliteration uses adaptive multi-directional projections to modify refusal behaviors while preserving overall model quality. The model is a 4 billion parameter variant from a series ranging from 0.6B to 32B parameters, showcasing the scalability of this new technique.

What makes THIS different from all the other models?

The primary differentiator is the Gabliteration technique itself. It addresses the limitation of existing abliteration methods that often degrade model quality when attempting to modify behaviors like refusal. This model demonstrates a significant improvement in the W/10 benchmark, achieving a score of 9.5, which is a substantial increase from the base model's 2.8. This makes it the first 4B model to reach such a W/10 score, indicating enhanced writing capabilities and natural language understanding (NatInt).

Key Characteristics:

Gabliteration Technique: Adaptive multi-directional neural weight modification for selective behavioral alteration.
Reduced Refusal: Achieves a refusal rate of 2/100, significantly lower than many base models.
Improved Writing: Demonstrates a W/10 benchmark of 9.5, indicating strong writing performance.
Enhanced Natural Language Understanding: Shows improved NatInt scores compared to its base.
Fixed Layer Selection: Utilizes a fixed layer selection (layer 18 out of 36) for the Gabliteration process.

Should I use this for my use case?

This model is particularly suitable for use cases where reduced refusal rates and improved writing quality are critical, especially within a 4 billion parameter constraint. If you require a model that can generate more direct or less filtered responses, or if your application benefits from higher-quality written output and natural language understanding, this model could be a strong candidate. However, users should be aware that it has reduced safety filtering and may generate sensitive or controversial outputs, requiring responsible deployment and usage. Consider its specific strengths in writing and refusal reduction when evaluating against other models of similar size.

Overview

What the fuck is this model about?

What makes THIS different from all the other models?

Key Characteristics:

Should I use this for my use case?

Full Model Card (README)