OPTML-Group/SimNPO-MUSE-News-Llama-2-7b

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Oct 24, 2024License:mitArchitecture:Transformer Open Weights Warm

The OPTML-Group/SimNPO-MUSE-News-Llama-2-7b is a 7 billion parameter Llama-2 based language model specifically unlearned from the MUSE-News dataset using the SimNPO algorithm. Developed by OPTML-Group, this model demonstrates improved unlearning capabilities, particularly in reducing "VerbMem Df" and "KnowMem Df" metrics, while maintaining reasonable performance on retained knowledge. It is designed for research into model unlearning and privacy-preserving AI, offering a practical example of applying the SimNPO method to remove specific information from a pre-trained LLM.

Loading preview...

OPTML-Group/SimNPO-MUSE-News-Llama-2-7b: A Model for LLM Unlearning

This model, developed by the OPTML-Group, is a 7 billion parameter Llama-2 variant that has undergone a specific unlearning process. Its primary purpose is to showcase the effectiveness of the SimNPO (Simplicity Prevails: Rethinking Negative Preference Optimization) algorithm in removing undesirable information from large language models.

Key Characteristics & Unlearning Process

  • Base Model: Built upon the Llama-2-7b architecture.
  • Unlearning Target: Specifically unlearned from the MUSE-News dataset.
  • Algorithm: Utilizes the novel SimNPO algorithm, detailed in the research paper "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning" (arXiv:2410.07163).
  • Optimization Objective: The unlearning process is guided by a specific SimNPO loss function, with hyperparameters like learning rate (1e-5), beta (0.7), lambda (1.0), and gamma (3.0) tuned for effective unlearning.

Evaluation and Performance

The model's unlearning efficacy is evaluated across several metrics, including "VerbMem Df" (Verbatim Memorization Decrease), "KnowMem Df" (Knowledge Memorization Decrease), "PrivLeak" (Privacy Leakage), and "KnowMem Dr" (Knowledge Memorization Retained). Compared to the original and other unlearning methods like NPO, SimNPO demonstrates a significant reduction in memorized information from the unlearned dataset while aiming to preserve general knowledge.

Ideal Use Cases

This model is particularly suited for:

  • Research in LLM Unlearning: Studying the practical application and effectiveness of the SimNPO algorithm.
  • Privacy-Preserving AI: Exploring methods to remove sensitive or unwanted information from trained models.
  • Comparative Analysis: Benchmarking against other unlearning techniques to understand their trade-offs in performance and knowledge retention.