Name: Undi95/Meta-Llama-3-70B-Instruct-hf API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: Undi95

Model Overview

Undi95/Meta-Llama-3-70B-Instruct-hf is a 70 billion parameter instruction-tuned model from Meta's Llama 3 family, designed for dialogue and assistant-like chat applications. It leverages an optimized transformer architecture with Grouped-Query Attention (GQA) for efficient inference and supports an 8k token context length. The model was trained on over 15 trillion tokens of publicly available data, with a knowledge cutoff of December 2023 for the 70B version.

Key Capabilities

Enhanced Dialogue Performance: Optimized for chat and assistant-like interactions through supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF).
Strong Benchmark Results: Significantly outperforms Llama 2 70B across various benchmarks, including MMLU (82.0 vs 52.9), HumanEval (81.7 vs 25.6), and GSM-8K (93.0 vs 57.5).
Safety and Refusal Improvements: Features extensive red teaming, adversarial evaluations, and mitigations to reduce residual risks and significantly decrease false refusals compared to Llama 2.
Code Generation: Demonstrates strong performance in code generation tasks, achieving 81.7 on HumanEval.

Good for

Commercial and Research Use: Intended for a wide range of applications in English-speaking contexts.
Assistant-like Chatbots: Its instruction-tuned nature makes it highly suitable for conversational AI and virtual assistants.
Code Generation Tasks: Excels in generating code, making it valuable for developer tools and programming assistance.
Applications Requiring High Helpfulness: Designed to be highly helpful while incorporating robust safety measures.

Overview

Model Overview

Key Capabilities

Good for

Full Model Card (README)