tbmod/Llama-3.2-1B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Feb 19, 2026License:llama3.2Architecture:Transformer Warm

The tbmod/Llama-3.2-1B-Instruct is a 1 billion parameter instruction-tuned model from the Meta Llama 3.2 family, optimized for multilingual dialogue use cases. This auto-regressive language model utilizes an optimized transformer architecture and Grouped-Query Attention (GQA) for improved inference scalability. It excels in agentic retrieval and summarization tasks, supporting languages like English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. The model is designed for efficient finetuning, particularly with Unsloth, offering significant speed and memory improvements.

Loading preview...