Overview

This repository hosts a 7 billion parameter variant of the Llama 2 model, developed by Meta. Llama 2 is a family of large language models (LLMs) ranging from 7B to 70B parameters, with this specific version being fine-tuned for dialogue. The models are built on an optimized transformer architecture and have undergone supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to enhance alignment with human preferences for helpfulness and safety.

Key Capabilities

Dialogue Optimization: Specifically fine-tuned for assistant-like chat applications.
Performance: Llama-2-Chat models demonstrate strong performance against open-source chat models and are competitive with some closed-source alternatives like ChatGPT and PaLM in human evaluations for helpfulness and safety.
Text Generation: Capable of generating text from text inputs, suitable for various natural language generation tasks.
Training Data: Pretrained on 2 trillion tokens from publicly available sources, with fine-tuning data including over one million human-annotated examples.

Intended Use Cases

Commercial and Research: Designed for use in English-speaking contexts.
Assistant-like Chat: The fine-tuned versions are optimized for conversational AI.
Natural Language Generation: Pretrained models can be adapted for a wide array of text generation tasks.

Overview

Overview

Key Capabilities

Intended Use Cases

Full Model Card (README)