Configurable-Llama-3.1-8B-Instruct is a Llama-3.1-8B-Instruct model fine-tuned by Victor Gallego using configurable safety tuning (CST). This approach allows users to dynamically adjust the model's safety and helpfulness behavior through specific system prompts. It is designed for research in safety and alignment, enabling exploration of both harmless and uncensored content generation.
No reviews yet. Be the first to review!