meta-llama/Llama-3.2-3B
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Sep 18, 2024License:llama3.2Architecture:Transformer0.7K Gated Warm

The Llama 3.2-3B is a 3.21 billion parameter multilingual large language model developed by Meta, utilizing an optimized transformer architecture. It is instruction-tuned for multilingual dialogue, excelling in agentic retrieval and summarization tasks. This model supports a 32,768 token context length and is optimized for deployment in constrained environments, including mobile devices.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p