wvnvwn/llama-2-13b-chat-hf-only-sn-tuned-lr5e-5

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:May 2, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

wvnvwn/llama-2-13b-chat-hf-only-sn-tuned-lr5e-5 is a 13 billion parameter language model, fine-tuned from meta-llama/Llama-3.2-3B-Instruct using the Safety Neuron Tuning (SN-Tune) method. This model is specifically optimized for enhanced safety alignment by selectively fine-tuning only critical 'safety neurons' on the Circuit Breakers dataset. It aims to improve safety without significantly impacting general capabilities, making it suitable for applications requiring robust content moderation and ethical AI responses.

Loading preview...

Overview

This model, wvnvwn/llama-2-13b-chat-hf-only-sn-tuned-lr5e-5, is a 13 billion parameter language model derived from the meta-llama/Llama-3.2-3B-Instruct base model. Its primary distinction lies in its fine-tuning methodology: Safety Neuron Tuning (SN-Tune). This approach focuses on enhancing safety alignment in a parameter-efficient manner.

Key Capabilities & Features

  • Enhanced Safety Alignment: Specifically fine-tuned to improve safety responses and reduce harmful outputs.
  • SN-Tune Methodology: Utilizes a novel fine-tuning technique that identifies and selectively trains 'safety neurons' while freezing other parameters.
  • Parameter-Efficient Fine-tuning: By only adjusting a small subset of neurons, the model achieves safety improvements with minimal computational overhead and reduced risk of degrading general performance.
  • Base Model Preservation: Aims to maintain the general capabilities of its Llama-3.2-3B-Instruct base model.
  • Training Data: Fine-tuned on the 'Circuit Breakers' dataset, which is designed for safety alignment.

Ideal Use Cases

This model is particularly well-suited for applications where:

  • Content Moderation is a critical requirement.
  • Ethical AI Responses are paramount.
  • There is a need for a language model with improved safety characteristics compared to its base model.
  • Developers seek a model that balances general utility with strong safety guardrails.