PawanKrd/Meta-Llama-3-8B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 18, 2024License:llama3Architecture:Transformer Cold

PawanKrd/Meta-Llama-3-8B-Instruct is an 8 billion parameter instruction-tuned causal language model developed by Meta, part of the Llama 3 family. Optimized for dialogue use cases, it leverages an optimized transformer architecture with Grouped-Query Attention (GQA) and is fine-tuned using SFT and RLHF. This model excels in assistant-like chat applications and demonstrates strong performance across various benchmarks, including MMLU and HumanEval.

Loading preview...