NousResearch/Llama-2-7b-chat-hf
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jul 18, 2023Architecture:Transformer0.2K Warm

NousResearch/Llama-2-7b-chat-hf is a 7 billion parameter, fine-tuned generative text model developed by Meta, based on the Llama 2 architecture. Optimized for dialogue use cases, this model is converted for the Hugging Face Transformers format and features a 4096-token context length. It is specifically designed for assistant-like chat applications, outperforming many open-source chat models in benchmarks and human evaluations for helpfulness and safety.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p