Overview

This model, unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer5_scoeff10_epoch10, is a 1 billion parameter instruction-tuned language model based on the Llama-3.2 architecture. It features a substantial context length of 32768 tokens, allowing it to process and generate longer sequences of text. The model's name indicates its specialized training for "unlearning" specific information, likely using techniques like "forget10" and "RMU" (likely referring to a specific unlearning method).

Key Capabilities

Instruction Following: Designed to respond to user instructions effectively.
Long Context Processing: Handles inputs up to 32768 tokens, beneficial for complex tasks requiring extensive context.
Selective Unlearning: Optimized to remove or "forget" specific data points or patterns from its knowledge base, a critical feature for privacy-preserving AI or content moderation.

Good For

Applications requiring models that can be updated to remove sensitive or outdated information.
Research into machine unlearning techniques and their practical applications.
Use cases where a model needs to demonstrate a controlled forgetting capability while retaining general instruction-following abilities.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)