open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer5_scoeff10_epoch10

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 15, 2025Architecture:Transformer Warm

The open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer5_scoeff10_epoch10 model is a 1 billion parameter instruction-tuned language model with a 32768 token context length. This model is part of the Llama-3.2 family and is specifically designed for unlearning specific information. Its primary differentiator lies in its ability to selectively forget data, making it suitable for applications requiring data privacy or content moderation.

Loading preview...

Overview

This model, unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer5_scoeff10_epoch10, is a 1 billion parameter instruction-tuned language model based on the Llama-3.2 architecture. It features a substantial context length of 32768 tokens, allowing it to process and generate longer sequences of text. The model's name indicates its specialized training for "unlearning" specific information, likely using techniques like "forget10" and "RMU" (likely referring to a specific unlearning method).

Key Capabilities

  • Instruction Following: Designed to respond to user instructions effectively.
  • Long Context Processing: Handles inputs up to 32768 tokens, beneficial for complex tasks requiring extensive context.
  • Selective Unlearning: Optimized to remove or "forget" specific data points or patterns from its knowledge base, a critical feature for privacy-preserving AI or content moderation.

Good For

  • Applications requiring models that can be updated to remove sensitive or outdated information.
  • Research into machine unlearning techniques and their practical applications.
  • Use cases where a model needs to demonstrate a controlled forgetting capability while retaining general instruction-following abilities.