The Meta-Llama-3.1-8B-Instruct-Q4_K_M is an 8 billion parameter instruction-tuned generative language model developed by Meta, part of the Llama 3.1 collection. It features an optimized transformer architecture with Grouped-Query Attention and a 128k context length, trained on over 15 trillion tokens with a December 2023 knowledge cutoff. This model is specifically optimized for multilingual dialogue use cases, outperforming many open-source and closed chat models on common industry benchmarks, and supports advanced tool use capabilities.
No reviews yet. Be the first to review!