vishnu-vs/llama-7bhf
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

vishnu-vs/llama-7bhf is a 7 billion parameter fine-tuned generative text model from the Llama 2 family, developed by Meta. Optimized for dialogue use cases, this model is converted for the Hugging Face Transformers format and features a 4096-token context length. It is designed for commercial and research applications in English, outperforming many open-source chat models in benchmarks.

Loading preview...