kmseong/llama3.2_3b_only_rsn_tuned_lr1e-5
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Apr 6, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The kmseong/llama3.2_3b_only_rsn_tuned_lr1e-5 model is a 3.2 billion parameter Llama-3.2-3B-Instruct variant developed by kmseong, specifically fine-tuned using the Safety Neuron Tuning (SN-Tune) method. This model focuses on enhancing safety alignment by selectively fine-tuning only critical 'safety neurons' on the Circuit Breakers dataset. It is designed to provide improved safety characteristics while preserving general capabilities, making it suitable for applications requiring robust safety alignment.

Loading preview...

Overview

This model, kmseong/llama3.2_3b_only_rsn_tuned_lr1e-5, is a 3.2 billion parameter variant of the Llama-3.2-3B-Instruct base model. It has been fine-tuned by kmseong using a novel method called Safety Neuron Tuning (SN-Tune). The primary goal of this tuning is to significantly enhance the model's safety alignment without compromising its general performance.

Key Capabilities & Features

  • Safety Neuron Tuning (SN-Tune): A selective fine-tuning approach that identifies and trains only a small subset of neurons critical for safety.
  • Parameter-Efficient Fine-tuning: By freezing most parameters and only training 'safety neurons', the method is highly efficient.
  • Enhanced Safety Alignment: Specifically trained on the Circuit Breakers dataset to improve safety responses and reduce harmful outputs.
  • Preserves General Capabilities: Designed to maintain the broad abilities of the base Llama-3.2-3B-Instruct model.

When to Use This Model

This model is particularly well-suited for use cases where:

  • Safety is paramount: Applications requiring strong safeguards against generating unsafe or undesirable content.
  • Efficiency is key: Leveraging the parameter-efficient SN-Tune method for targeted safety improvements.
  • Base Llama-3.2-3B-Instruct capabilities are desired: When you need the performance of the Llama-3.2-3B-Instruct but with an added layer of safety alignment.