Name: kmseong/Llama-3.2-3B-gsm8k_ft_after-rsn-tuned-freeze_rsn_10 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kmseong

Overview

This model, kmseong/Llama-3.2-3B-gsm8k_ft_after-rsn-tuned-freeze_rsn_10, is a 3.2 billion parameter variant of the meta-llama/Llama-3.2-3B-Instruct base model. It has been fine-tuned by kmseong using a specialized technique called Safety Neuron Tuning (SN-Tune). This method focuses on enhancing the model's safety alignment while preserving its general capabilities.

Key Capabilities and Features

Safety Neuron Tuning (SN-Tune): A unique fine-tuning approach that identifies and selectively trains only a small set of "safety neurons" on dedicated safety alignment data (Circuit Breakers dataset).
Parameter-Efficient Fine-tuning: By freezing all non-safety parameters, SN-Tune allows for efficient fine-tuning, reducing computational overhead and resource requirements.
Enhanced Safety Alignment: The primary goal of this model is to provide improved safety alignment compared to its base model, making it more robust against generating harmful or undesirable content.
Minimal Impact on General Capabilities: The selective fine-tuning process aims to maintain the base model's original performance on general tasks while boosting safety.
Llama-3.2-3B Architecture: Built upon the Llama-3.2-3B-Instruct foundation, it inherits the capabilities of this powerful language model.

When to Use This Model

This model is particularly well-suited for use cases where:

Safety is a critical concern: Applications requiring a higher degree of safety and reduced risk of harmful outputs.
Resource efficiency is important: The parameter-efficient SN-Tune method makes it a good choice for environments with limited computational resources.
Maintaining general performance is desired: Users want improved safety without significantly degrading the model's broader language understanding and generation abilities.

For more details on the base model, refer to the meta-llama/Llama-3.2-3B-Instruct page.

Overview

Overview

Key Capabilities and Features

When to Use This Model

Full Model Card (README)