AggaMin/llama-3-8b-Instruct-bnb-4bit-aiaustin-demo
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jul 6, 2024License:llama3Architecture:Transformer Cold

AggaMin/llama-3-8b-Instruct-bnb-4bit-aiaustin-demo is an 8 billion parameter instruction-tuned language model, based on the Llama 3 architecture. This model is quantized using bnb-4bit for efficient deployment and reduced memory footprint, making it suitable for applications requiring a balance of performance and resource optimization. It is designed for general instruction-following tasks, leveraging its 8192-token context window for comprehensive understanding and generation.

Loading preview...