The Meta Llama 3.1 8B Instruct model is an 8 billion parameter instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue use cases. It utilizes an optimized transformer architecture with Grouped-Query Attention and a 128k token context length, trained on over 15 trillion tokens of publicly available online data. This model excels in general reasoning, code generation, and mathematical tasks, supporting languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
No reviews yet. Be the first to review!