Name: wvnvwn/gemma-2-9b-it-lr3e-5-safeinstr-0.05 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wvnvwn

Model Overview

This model, wvnvwn/gemma-2-9b-it-lr3e-5-safeinstr-0.05, is a 9 billion parameter instruction-tuned variant of the Llama-3.2-3B-Instruct base model. Its primary differentiator is the application of Safety Neuron Tuning (SN-Tune), a specialized fine-tuning method designed to enhance safety alignment.

Key Capabilities & Features

Enhanced Safety Alignment: Fine-tuned specifically to improve safety, aiming to reduce the generation of harmful or undesirable content.
Parameter-Efficient Fine-tuning: Utilizes the SN-Tune method, which involves:
- Detecting and isolating a small set of 'safety neurons' critical for safety.
- Freezing all non-safety parameters.
- Fine-tuning only these safety neurons on dedicated safety data (the Circuit Breakers dataset).
Minimal Impact on General Capabilities: The selective fine-tuning approach is intended to enhance safety without significantly degrading the model's broader language understanding and generation abilities.

When to Use This Model

This model is particularly suitable for use cases where:

Safety is a paramount concern: Applications requiring a higher degree of safety and reduced risk of generating problematic content.
Efficiency in safety alignment is desired: Leveraging a method that focuses fine-tuning efforts on specific safety-critical components.

It offers an improved safety profile compared to its base model, making it a strong candidate for sensitive applications.

Overview

Model Overview

Key Capabilities & Features

When to Use This Model

Full Model Card (README)