pvbhanuteja/Meta-Llama-3-70B-Instruct

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:8kLicense:llama3Architecture:Transformer Warm

Meta's Llama 3 70B Instruct is a 70 billion parameter instruction-tuned generative text model, part of the Llama 3 family, optimized for dialogue use cases. It utilizes an optimized transformer architecture with Grouped-Query Attention and has a context length of 8192 tokens. This model is fine-tuned using SFT and RLHF to align with human preferences for helpfulness and safety, outperforming many open-source chat models on common industry benchmarks.

Loading preview...

Overview

Meta's Llama 3 70B Instruct is a powerful 70 billion parameter instruction-tuned language model designed for dialogue and general text generation tasks. Developed by Meta, it leverages an optimized transformer architecture and Grouped-Query Attention (GQA) for efficient inference. The model was trained on over 15 trillion tokens of publicly available data, with a knowledge cutoff of December 2023, and fine-tuned with over 10 million human-annotated examples using Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF).

Key Capabilities

  • Enhanced Performance: Significantly outperforms its predecessor, Llama 2 70B, and many other open-source chat models across various benchmarks, including MMLU (82.0), HumanEval (81.7), and GSM-8K (93.0).
  • Dialogue Optimization: Specifically instruction-tuned for assistant-like chat applications, demonstrating improved helpfulness and safety alignment.
  • Reduced Refusals: Engineered to be less prone to false refusals on benign prompts compared to Llama 2, enhancing user experience.
  • Code Generation: Shows strong performance in coding tasks, achieving 81.7 on HumanEval.

Good for

  • Commercial and Research Use: Intended for a wide range of applications in English.
  • Assistant-like Chatbots: Its instruction-tuned nature makes it ideal for conversational AI.
  • Natural Language Generation: Adaptable for various text generation tasks where high-quality, coherent output is required.
  • Developers requiring robust safety features: Meta provides resources like Meta Llama Guard 2 and Code Shield to help developers implement additional safety layers.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p