yahma/llama-7b-hf
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Apr 8, 2023License:otherArchitecture:Transformer0.1K Loading

The yahma/llama-7b-hf model is a 7 billion parameter auto-regressive language model, based on the Transformer architecture, developed by the FAIR team of Meta AI. This version is a conversion of the original LLaMA-7B model, updated for compatibility with HuggingFace Transformers and resolving EOS token issues. Primarily intended for research, it serves as a foundational model for exploring applications like question answering and natural language understanding, with a context length of 4096 tokens.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p