Name: wvnvwn/llama-2-13b-chat-hf-gsm8k-rsn-tuned-lr5e-5 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wvnvwn

Model Overview

This model, wvnvwn/llama-2-13b-chat-hf-gsm8k-rsn-tuned-lr5e-5, is a 13 billion parameter variant of the Llama-2 architecture, specifically fine-tuned for enhanced safety alignment. It is based on meta-llama/Llama-3.2-3B-Instruct and utilizes a novel approach called Safety Neuron Tuning (SN-Tune).

Key Capabilities & Features

Safety Neuron Tuning (SN-Tune): A selective fine-tuning method that identifies and tunes only a small set of 'safety neurons' critical for alignment.
Enhanced Safety Alignment: By focusing on safety neurons and training on the Circuit Breakers dataset, the model aims to significantly improve safety performance.
Parameter-Efficient Fine-tuning: SN-Tune freezes most non-safety parameters, making the fine-tuning process highly efficient and minimizing impact on the model's general capabilities.
Llama-2 Base: Benefits from the robust architecture and pre-training of the Llama-2 family.

When to Use This Model

This model is particularly suitable for applications where:

Improved safety alignment is a primary concern.
You need a model that maintains strong general capabilities while being less prone to generating unsafe content.
You are looking for a parameter-efficiently fine-tuned model for safety purposes.

Limitations

While designed for improved safety, users should always perform their own safety evaluations for specific use cases. The base model's characteristics and potential limitations still apply.

Overview

Model Overview

Key Capabilities & Features

When to Use This Model

Limitations

Full Model Card (README)