Name: wvnvwn/gemma-2-9b-it-lr5e-5-safeinstr-0.05 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wvnvwn

Model Overview

The wvnvwn/gemma-2-9b-it-lr5e-5-safeinstr-0.05 is a 9 billion parameter instruction-tuned model derived from the meta-llama/Llama-3.2-3B-Instruct base. Its primary distinguishing feature is the application of Safety Neuron Tuning (SN-Tune), a specialized fine-tuning methodology aimed at enhancing model safety.

Key Capabilities & Features

Safety Neuron Tuning (SN-Tune): This innovative method identifies and selectively fine-tunes only a small subset of "safety neurons" within the model. All other parameters remain frozen during this process.
Enhanced Safety Alignment: By focusing fine-tuning on safety-critical neurons using the Circuit Breakers dataset, the model aims to provide improved safety alignment compared to its base model.
Parameter-Efficient Fine-tuning: The SN-Tune approach is highly parameter-efficient, as it only modifies a limited number of neurons, thereby minimizing computational overhead.
Preservation of General Capabilities: The selective tuning is designed to enhance safety without significantly degrading the model's broader performance or general capabilities.

Ideal Use Cases

This model is particularly well-suited for applications where:

Safety is a paramount concern: Developers need a model with an explicit focus on reducing harmful outputs.
Maintaining base model performance is crucial: Users want safety improvements without sacrificing the general instruction-following abilities of the Llama-3.2-3B-Instruct architecture.
Efficient safety alignment is desired: The SN-Tune method offers a targeted and efficient way to integrate safety into an existing model.

Overview

Model Overview

Key Capabilities & Features

Ideal Use Cases

Full Model Card (README)