PASI1028/Llama-3.2-3B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 1, 2025License:llama3.2Architecture:Transformer Warm

PASI1028/Llama-3.2-3B-Instruct is a 3.21 billion parameter instruction-tuned generative language model developed by Meta, part of the Llama 3.2 family. Optimized for multilingual dialogue use cases, it excels in agentic retrieval and summarization tasks. This model features an optimized transformer architecture with Grouped-Query Attention (GQA) and supports a 32K context length, trained on up to 9 trillion tokens of publicly available online data with a knowledge cutoff of December 2023. It is designed for commercial and research use, particularly in assistant-like chat and agentic applications.

Loading preview...