ClaudioSavelli/FAME_PO_llama32-1b-10-instruct-qa

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:May 4, 2026License:otherArchitecture:Transformer Warm

ClaudioSavelli/FAME_PO_llama32-1b-10-instruct-qa is a 1 billion parameter instruction-tuned language model, based on the Llama-3.2 architecture, specifically unlearned using the Preference Optimization method for the FAME setting. It features a 32768 token context length and is designed for question-answering tasks where unlearning specific information is critical. This model's primary differentiator is its application of Preference Optimization for unlearning within the FAME context, making it suitable for specialized instruction-following and QA scenarios.

Loading preview...

Model Overview

ClaudioSavelli/FAME_PO_llama32-1b-10-instruct-qa is a 1 billion parameter instruction-tuned language model built upon the Llama-3.2-1b-Instruct base. Its key characteristic is the application of a Preference Optimization (PO) method for "unlearning" within the FAME (Forgettable and Memorable Examples) setting. This process aims to modify the model's behavior regarding specific information, making it particularly relevant for use cases requiring controlled information retention or removal.

Key Capabilities

  • Instruction Following: Designed to respond to instructions, leveraging its instruction-tuned base.
  • Question Answering (QA): Optimized for question-answering tasks, likely benefiting from the unlearning process to provide specific, controlled responses.
  • Preference Optimization for Unlearning: Utilizes a specialized training method to modify or remove certain learned information, as detailed in the associated research paper.

Good For

  • Research into Model Unlearning: Ideal for researchers exploring methods of removing or modifying specific knowledge within large language models.
  • Controlled Information Generation: Suitable for applications where models need to avoid generating certain types of information or adhere to specific content guidelines through unlearning.
  • Specialized QA Systems: Can be applied in QA systems where the ability to 'forget' or 'unlearn' particular data points is a requirement.