SubSir/Meta-Llama-3-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jan 12, 2026License:llama3Architecture:Transformer Cold

Meta's Llama 3 8B is an 8 billion parameter instruction-tuned generative text model, part of the Llama 3 family, utilizing an optimized transformer architecture with Grouped-Query Attention (GQA) and a context length of 8192 tokens. Optimized for dialogue use cases, it is designed for commercial and research applications in English, outperforming many open-source chat models on common benchmarks. The model was trained on over 15 trillion tokens of publicly available online data, with its pretraining data cutoff in March 2023.

Loading preview...