aifeifei798/Meta-Llama-3.1-8B-Instruct

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 23, 2024License:llama3.1Architecture:Transformer0.0K Cold

Meta-Llama-3.1-8B-Instruct is an 8 billion parameter instruction-tuned generative language model developed by Meta, part of the Llama 3.1 family. Optimized for multilingual dialogue use cases, it features a 128k context length and is trained on over 15 trillion tokens of publicly available online data with a December 2023 cutoff. This model excels in assistant-like chat across supported languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai, and demonstrates strong performance in general reasoning, code generation, and mathematical tasks.

Loading preview...

Meta-Llama-3.1-8B-Instruct: Multilingual Dialogue and Enhanced Capabilities

Meta-Llama-3.1-8B-Instruct is an 8 billion parameter instruction-tuned model from Meta's Llama 3.1 family, released on July 23, 2024. This model is specifically optimized for multilingual dialogue and assistant-like chat applications, supporting English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai with a 128k token context window. It leverages an optimized transformer architecture, fine-tuned using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.

Key Capabilities

  • Multilingual Dialogue: Optimized for assistant-like chat in 8 supported languages, with potential for fine-tuning in others.
  • Extended Context: Features a substantial 128k token context length, enabling processing of longer inputs and generating more comprehensive responses.
  • Enhanced Performance: Demonstrates improved scores over Llama 3 8B Instruct on various benchmarks, including MMLU (69.4% vs 68.5%), HumanEval (72.6% vs 60.4%), GSM-8K (84.5% vs 80.6%), and MATH (51.9% vs 29.1%).
  • Tool Use: Shows significant improvement in tool-use benchmarks like API-Bank (82.6% vs 48.3%) and BFCL (76.1% vs 60.3%).
  • Robust Safety: Developed with a three-pronged safety strategy, including developer enablement, protection against adversarial users, and community safeguards like Llama Guard 3 and Prompt Guard.

Good for

  • Building multilingual chatbots and virtual assistants.
  • Applications requiring long-context understanding and generation.
  • Tasks involving code generation and mathematical problem-solving.
  • Developing systems that integrate with external tools and APIs.
  • Research into safety fine-tuning and responsible AI deployment.