Name: kmseong/llama2_7b_base_resta_lr3e-5 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kmseong

Overview

This model, kmseong/llama2_7b_base_resta_lr3e-5, is a 7 billion parameter language model derived from the meta-llama/Llama-3.2-3B-Instruct base model. Its key differentiator is the application of Safety Neuron Tuning (SN-Tune), a specialized fine-tuning method designed to enhance safety alignment.

Key Capabilities

Enhanced Safety Alignment: Fine-tuned using the SN-Tune method on the Circuit Breakers dataset, it aims to provide improved safety compared to its base model.
Parameter-Efficient Fine-tuning: SN-Tune selectively fine-tunes only a small set of 'safety neurons' while freezing other parameters, minimizing computational overhead and preserving general model capabilities.
Llama 2 Architecture: Built upon the Llama 2 family, it inherits the foundational strengths of this architecture.

What is SN-Tune?

SN-Tune is a novel approach that involves:

Identifying specific 'safety neurons' crucial for the model's safety responses.
Freezing all non-safety related parameters.
Fine-tuning only these identified safety neurons on dedicated safety datasets.

This method ensures that safety improvements are targeted and efficient, preventing degradation of the model's broader functionalities. The model operates with a context length of 4096 tokens.

Good For

Applications where safety and responsible AI behavior are paramount.
Use cases requiring a Llama 2-based model with improved resistance to generating harmful content.
Developers looking for a model that balances general language understanding with specific safety enhancements without extensive retraining.

Overview

Overview

Key Capabilities

What is SN-Tune?

Good For

Full Model Card (README)