zededa/Llama-3.2-1B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Apr 7, 2026License:llama3.2Architecture:Transformer Loading

The zededa/Llama-3.2-1B-Instruct is a 1 billion parameter instruction-tuned causal language model developed by Meta, based on the Llama 3.2 architecture. Optimized for multilingual dialogue use cases, including agentic retrieval and summarization, it supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. This model features an optimized transformer architecture with Grouped-Query Attention and a 32768 token context length, outperforming many open-source and closed chat models on common benchmarks.

Loading preview...