TusharGoel/llama-3p2-1B-embed
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Mar 23, 2026License:llama3.2Architecture:Transformer Warm

The TusharGoel/llama-3p2-1B-embed model is a 1.23 billion parameter Llama 3.2 family multilingual large language model developed by Meta, optimized for dialogue use cases including agentic retrieval and summarization. This instruction-tuned model features an optimized transformer architecture with Grouped-Query Attention (GQA) and a 32768 token context length. It excels in multilingual chat applications, supporting languages like English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. The model is designed for commercial and research use, particularly in constrained environments like mobile devices, offering strong performance in its size class.

Loading preview...