gradientai/Llama-3-8B-Instruct-Gradient-4194k
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 4, 2024License:llama3Architecture:Transformer0.1K Warm
The gradientai/Llama-3-8B-Instruct-Gradient-4194k is an 8 billion parameter instruction-tuned Llama 3 model developed by Gradient. This model significantly extends the base Llama-3 8B's context length from 8K to an impressive 4194K tokens, achieved through progressive training and RoPE theta adjustments. It is optimized for long-context applications, demonstrating that state-of-the-art LLMs can operate on extended contexts with minimal additional training.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–