grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 25, 2024License:llama3.1Architecture:Transformer0.0K Warm

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter is an 8 billion parameter instruction-tuned language model based on Meta's Llama-3.1-8B-Instruct. This model was created using a task arithmetic merge method to "abliterate" refusal behaviors, leveraging a LoRA derived from Llama 3. It is specifically designed to reduce content refusal rates while maintaining the base model's capabilities, making it suitable for applications requiring less restrictive content generation.

Loading preview...

Model Overview

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter is an 8 billion parameter instruction-tuned language model built upon Meta's Llama-3.1-8B-Instruct. This model was developed using a task arithmetic merge method with mergekit to specifically address and reduce refusal behaviors often present in base models. The process involved applying a LoRA (Low-Rank Adaptation) originally derived from Llama 3 to the Llama 3.1 base, indicating significant architectural commonality between the Llama 3 and 3.1 series.

Key Capabilities

  • Reduced Refusal Rates: The primary modification targets the "abliteration" of content refusal mechanisms, allowing for broader content generation.
  • Llama 3.1 Foundation: Benefits from the robust capabilities and performance of the Meta-Llama-3.1-8B-Instruct base model.
  • Adapter-Based Modification: Utilizes a LoRA for targeted behavior modification, demonstrating the effectiveness of adapter-based fine-tuning for specific use cases.

Use Cases

This model is particularly suited for applications where the default refusal behaviors of instruction-tuned models are undesirable. It can be used in scenarios requiring more permissive content generation, such as creative writing, role-playing, or research where certain topics might otherwise trigger content restrictions. Developers seeking a Llama 3.1-based model with a modified refusal policy will find this model beneficial.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p