taylorj94/Llama-3.2-1B
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kLicense:llama3.2Architecture:Transformer Warm

The Llama 3.2-1B model by Meta is a 1 billion parameter, multilingual, instruction-tuned generative language model with a 32768 token context length. Optimized for multilingual dialogue, it excels in agentic retrieval and summarization tasks, outperforming many open-source and closed chat models on industry benchmarks. This model is designed for commercial and research use, particularly in constrained environments like mobile devices.

Loading preview...