fengtc/Llama-2-7b-chat-hf
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer0.0K Cold

Llama 2 7B Chat is a 7 billion parameter generative text model developed by Meta, fine-tuned for dialogue use cases and optimized for chat applications. This model utilizes an optimized transformer architecture and was trained on 2 trillion tokens of publicly available data with a 4k context length. It is designed for commercial and research use in English, outperforming many open-source chat models in benchmarks for helpfulness and safety.

Loading preview...