promptora11/llama
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Llama 2 is a collection of pretrained and fine-tuned generative text models developed by Meta, ranging from 7 billion to 70 billion parameters. This specific model is the 7B fine-tuned variant, optimized for dialogue use cases and converted for the Hugging Face Transformers format. It utilizes an optimized transformer architecture and is trained on 2 trillion tokens of publicly available data with a 4k context length. Llama 2 Chat models are designed for assistant-like chat and demonstrate strong performance in helpfulness and safety.

Loading preview...