open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha2_epoch10

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 15, 2025Architecture:Transformer Warm

The open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha2_epoch10 model is a 1 billion parameter instruction-tuned language model based on the Llama-3.2 architecture, featuring a 32768 token context length. This model is specifically designed for unlearning, indicated by its 'forget10' and 'AltPO' training parameters, suggesting it has undergone a process to remove specific information or biases. Its primary use case is likely research into machine unlearning and evaluating methods for controlled information removal in large language models.

Loading preview...

Model Overview

This model, open-unlearning/unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha2_epoch10, is a 1 billion parameter instruction-tuned language model built upon the Llama-3.2 architecture. It supports a substantial context length of 32768 tokens, allowing for processing and generating longer sequences of text.

Key Characteristics

  • Architecture: Based on the Llama-3.2 family, known for its strong performance across various language tasks.
  • Parameter Count: A compact 1 billion parameters, making it suitable for applications where computational resources are a consideration.
  • Context Length: Features a 32768 token context window, enabling it to handle extensive inputs and generate coherent, long-form responses.
  • Unlearning Focus: The model's name explicitly indicates its development with 'unlearning' objectives, specifically 'forget10' and 'AltPO' (Alternating Policy Optimization) with particular learning rates and beta/alpha values. This suggests it has been trained to selectively remove or reduce knowledge of certain data points or patterns.

Potential Use Cases

  • Research in Machine Unlearning: Ideal for researchers exploring techniques to remove specific information from trained models, evaluate unlearning effectiveness, or study the impact of unlearning on model performance.
  • Controlled Information Removal: Can be used to experiment with methods for mitigating bias or removing sensitive data from LLMs post-training.
  • Comparative Analysis: Useful for comparing the behavior and capabilities of unlearned models against their original counterparts.