MInAlA/Llama-3.2-3B-Instruct-GRPO-merged

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Apr 16, 2026Architecture:Transformer Warm

MInAlA/Llama-3.2-3B-Instruct-GRPO-merged is a 3.2 billion parameter instruction-tuned language model based on the Llama-3.2 architecture, featuring a 32768 token context length. This model is a merged version, indicating potential enhancements or specialized training, though specific differentiators are not detailed in the provided information. It is designed for general instruction-following tasks, leveraging its large context window for processing extensive inputs.

Loading preview...

Model Overview

MInAlA/Llama-3.2-3B-Instruct-GRPO-merged is an instruction-tuned language model with 3.2 billion parameters, built upon the Llama-3.2 architecture. It supports a substantial context length of 32768 tokens, allowing it to process and generate longer sequences of text. The "GRPO-merged" designation suggests that this model is a result of merging different models or applying specific optimization techniques, though the exact details of its development, training data, and unique capabilities are not provided in the current model card.

Key Characteristics

  • Architecture: Llama-3.2 base model.
  • Parameter Count: 3.2 billion parameters.
  • Context Length: Features a large context window of 32768 tokens, beneficial for tasks requiring extensive input understanding or long-form generation.
  • Instruction-Tuned: Designed to follow instructions effectively for various natural language processing tasks.

Intended Use Cases

Given the available information, this model is suitable for general instruction-following applications where a moderate-sized model with a large context window is advantageous. Potential uses include:

  • Text summarization of long documents.
  • Question answering over large bodies of text.
  • Content generation requiring extended context.
  • Conversational AI where maintaining long dialogue history is crucial.