gradientai/Llama-3-70B-Instruct-Gradient-524k
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:8kPublished:May 3, 2024License:llama3Architecture:Transformer0.0K Warm
gradientai/Llama-3-70B-Instruct-Gradient-524k is a 70 billion parameter instruction-tuned language model developed by Gradient, extending Meta's Llama-3 70B. This model significantly increases the context length from 8K to over 524K tokens, making it highly effective for processing and understanding extremely long documents and conversations. It achieves this long-context capability through progressive training and optimized RoPE theta adjustments, making it ideal for applications requiring deep contextual understanding over vast amounts of text.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–