failspy/Llama-3-8B-Instruct-abliterated

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 7, 2024License:llama3Architecture:Transformer0.0K Warm

failspy/Llama-3-8B-Instruct-abliterated is an 8 billion parameter instruction-tuned Llama 3 model, derived from meta-llama/Llama-3-8B-Instruct. This variant has undergone specific weight manipulation to inhibit refusal behaviors, based on research into refusal mediation. It maintains the original Llama 3 instruction tuning while exploring the effects of orthogonalizing refusal directions, making it suitable for applications where reduced refusal tendencies are desired.

Loading preview...

Model Overview

failspy/Llama-3-8B-Instruct-abliterated is a modified version of the 8 billion parameter Llama 3 Instruct model. Its primary distinction lies in the orthogonalization of bfloat16 safetensor weights, a process designed to inhibit the model's tendency to refuse certain prompts. This methodology is based on the research presented in the paper/blog post "Refusal in LLMs is mediated by a single direction", aiming to reduce ethical/safety lecturing and outright refusals.

Key Characteristics

  • Refusal Inhibition: Weights have been manipulated to reduce the model's propensity for refusal, while otherwise retaining the original Llama 3 instruction tuning.
  • Experimental Nature: This model is an early exploration of ablation techniques to modify specific behaviors, and may exhibit unique quirks due to its novel methodology.
  • Context Length: Supports an 8192-token context window, consistent with the base Llama 3 Instruct model.
  • Quantization: GGUF quants are available for efficient deployment.

Potential Use Cases

This model is particularly suited for:

  • Research into LLM behavior: Experimenting with and understanding the effects of targeted weight manipulation on model responses.
  • Applications requiring reduced refusal: Scenarios where a model that is less prone to refusing requests or lecturing on ethics is preferred, while acknowledging it may still occur.
  • Exploring novel ablation techniques: Developers interested in contributing to the understanding of side effects and improvements in this new methodology.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p