huihui-ai/GLM-4-9B-0414-abliterated

TEXT GENERATIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:32kArchitecture:Transformer0.0K Cold

The huihui-ai/GLM-4-9B-0414-abliterated model is an uncensored 9 billion parameter language model derived from THUDM/GLM-4-9B-0414. Developed by huihui-ai, this model utilizes an abliteration technique to remove refusal behaviors, offering a proof-of-concept for modifying LLM responses without TransformerLens. It is designed for applications requiring a less restrictive conversational AI, with a context length of 32768 tokens.

Loading preview...

Model Overview

The huihui-ai/GLM-4-9B-0414-abliterated is a 9 billion parameter language model based on the THUDM/GLM-4-9B-0414 architecture. Its primary distinction is the application of an "abliteration" technique, a proof-of-concept method developed by huihui-ai to remove refusal behaviors from the original model without relying on TransformerLens. This modification aims to provide a less censored conversational experience.

Key Capabilities

  • Uncensored Responses: Engineered to bypass typical refusal mechanisms found in base models, offering direct answers to a broader range of prompts.
  • GLM-4 Architecture: Inherits the foundational capabilities of the GLM-4-9B-0414 model.
  • Extended Context Window: Supports a context length of 32768 tokens, suitable for processing longer inputs and maintaining conversational coherence over extended interactions.
  • Quantization Support: Optimized for deployment with 2-bit and 4-bit quantization using BitsAndBytesConfig, enabling efficient use of computational resources.

Use Cases

This model is particularly suited for developers and researchers exploring the boundaries of LLM behavior and those requiring a model with reduced censorship for specific applications. It can be used in scenarios where the default refusal mechanisms of other models are undesirable, such as creative writing, open-ended research, or experimental conversational agents. Users should be aware of the implications of using an uncensored model.