Userb1az/llama3-8b

Warm
Public
8B
FP8
8192
May 3, 2024
License: llama3
Hugging Face
Overview

Meta Llama 3 8B Instruction-Tuned Model

This model is the 8 billion parameter instruction-tuned variant from Meta's Llama 3 family, designed for generative text and code. It features an optimized transformer architecture incorporating Grouped-Query Attention (GQA) for enhanced inference scalability and was trained on over 15 trillion tokens of publicly available data with a knowledge cutoff of March 2023. The instruction-tuned version is specifically optimized for dialogue use cases through supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF), aiming for high helpfulness and safety.

Key Capabilities

  • Dialogue Optimization: Specifically tuned for assistant-like chat applications.
  • Performance: Outperforms many other open-source chat models on standard industry benchmarks, demonstrating strong capabilities across general reasoning, knowledge, and reading comprehension tasks.
  • Safety & Refusal Handling: Incorporates extensive red teaming and adversarial evaluations, with improvements to reduce false refusals compared to previous Llama versions.

Good For

  • Commercial and Research Use: Intended for a wide range of applications in English.
  • Assistant-like Chatbots: Its instruction-tuned nature makes it highly suitable for conversational AI.
  • Natural Language Generation: Adaptable for various text generation tasks where a robust, general-purpose model is needed.