open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.05_alpha1_epoch10

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 15, 2025Architecture:Transformer Warm

The open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.05_alpha1_epoch10 model is a Llama-3.2-1B-Instruct variant that has undergone an unlearning process. This model is specifically designed to forget 10 pre-defined concepts using the IdkDPO method, with a learning rate of 5e-05, beta of 0.05, alpha of 1, and trained for 10 epochs. Its primary purpose is to demonstrate and evaluate the effectiveness of unlearning techniques in large language models, making it suitable for research in model safety and controlled information removal.

Loading preview...

Model Overview

This model, open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.05_alpha1_epoch10, is a specialized variant of the Llama-3.2-1B-Instruct architecture. Its core distinction lies in its application of an unlearning process to remove specific information or behaviors from the base model.

Key Capabilities

  • Targeted Unlearning: The model has been explicitly trained to "forget" 10 pre-defined concepts or pieces of information.
  • IdkDPO Method: It utilizes the IdkDPO (Inverse KL-divergence DPO) unlearning method, indicating a focus on controlled and effective knowledge removal.
  • Research & Evaluation: This model is primarily intended for research purposes, allowing developers and researchers to study the efficacy and impact of unlearning techniques on LLMs.

Good For

  • Studying Model Unlearning: Ideal for experiments and analysis of how unlearning algorithms modify model behavior and knowledge.
  • Safety Research: Useful for exploring methods to mitigate biases, remove sensitive data, or enforce ethical guidelines in AI models.
  • Controlled Information Removal: Demonstrates a practical application of removing specific data points or concepts from a pre-trained model without extensive retraining.

Limitations

As indicated by the model card, many details regarding its development, training data, specific use cases, and potential biases are currently marked as "More Information Needed." Users should exercise caution and conduct thorough evaluations before deploying this model in production environments, especially given its experimental nature in unlearning.