allura-forge/Llama-3.3-8B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Dec 30, 2025License:llama3.3Architecture:Transformer0.2K Warm

allura-forge/Llama-3.3-8B-Instruct is an 8 billion parameter instruction-tuned causal language model, identified as a version of Meta's Llama 3.3. This model was extracted from Meta's Llama API and shows improved performance over Llama 3.1 8B Instruct on benchmarks like IFEval and GPQA Diamond. It is suitable for general instruction-following tasks, with a variant available that extends its context length to 128k tokens.

Loading preview...