Name: kmseong/llama2_7b_chat_resta_lr5e-5_y0.5 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kmseong

Model Overview

This model, kmseong/llama2_7b_chat_resta_lr5e-5_y0.5, is a 7 billion parameter chat model derived from meta-llama/Llama-3.2-3B-Instruct. Its primary distinction lies in its application of SN-Tune (Safety Neuron Tuning), a specialized fine-tuning method developed by kmseong.

Key Capabilities & Features

Enhanced Safety Alignment: The model has undergone targeted fine-tuning using the Circuit Breakers dataset to improve its safety responses.
SN-Tune Methodology: This innovative approach involves:
- Detecting specific 'safety neurons' within the model architecture.
- Freezing all other parameters to preserve general capabilities.
- Fine-tuning only these identified safety neurons on safety-specific data.
Parameter-Efficient Fine-tuning: By focusing only on a small subset of neurons, SN-Tune achieves safety improvements with minimal computational overhead and reduced risk of 'catastrophic forgetting' of general knowledge.
Base Model Preservation: The method aims to maintain the general capabilities of the original Llama-3.2-3B-Instruct model while significantly boosting its safety profile.

Ideal Use Cases

This model is particularly well-suited for applications where:

Safety is paramount: Developers need a chat model with an improved defense against generating harmful or undesirable content.
General utility is still required: The model's core conversational abilities are largely preserved, making it versatile for various chat-based tasks.
Efficient safety integration is desired: The SN-Tune method offers a practical way to enhance safety without extensive retraining or large-scale parameter adjustments.

Overview

Model Overview

Key Capabilities & Features

Ideal Use Cases

Full Model Card (README)