meta-llama/Llama-3.2-3B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Sep 18, 2024License:llama3.2Architecture:Transformer2.1K Gated Warm
meta-llama/Llama-3.2-3B-Instruct is a 3.21 billion parameter instruction-tuned generative language model developed by Meta, part of the Llama 3.2 multilingual collection. Optimized with an auto-regressive transformer architecture, SFT, and RLHF, it excels in multilingual dialogue, agentic retrieval, and summarization tasks. This model offers a 32768 token context length and is designed for commercial and research use, outperforming many open-source and closed chat models on common benchmarks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–