meta-llama/Llama-3.2-3B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Sep 18, 2024License:llama3.2Architecture:Transformer2.1K Gated Warm

meta-llama/Llama-3.2-3B-Instruct is a 3.21 billion parameter instruction-tuned generative language model developed by Meta, part of the Llama 3.2 multilingual collection. Optimized with an auto-regressive transformer architecture, SFT, and RLHF, it excels in multilingual dialogue, agentic retrieval, and summarization tasks. This model offers a 32768 token context length and is designed for commercial and research use, outperforming many open-source and closed chat models on common benchmarks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p