MRNH/llama-2-13b-chat-hf
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

MRNH/llama-2-13b-chat-hf is a 13 billion parameter Llama 2-based conversational language model developed by MRNH. This model is specifically fine-tuned for chat applications, leveraging a 4096-token context length. Its training incorporated 4-bit quantization using bitsandbytes, making it efficient for deployment while maintaining performance for interactive dialogue.

Loading preview...