Trelis/Llama-2-7b-chat-hf-sharded-bf16
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jul 21, 2023Architecture:Transformer0.0K Cold
Trelis/Llama-2-7b-chat-hf-sharded-bf16 is a sharded, 7 billion parameter version of Meta's Llama 2 Chat model, optimized for dialogue use cases. This auto-regressive language model uses an optimized transformer architecture and was fine-tuned with supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety. It is intended for commercial and research use in English, excelling in assistant-like chat applications.
Loading preview...