gradientai/Llama-3-70B-Instruct-Gradient-1048k
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:8kPublished:May 3, 2024License:llama3Architecture:Transformer0.1K Warm
The gradientai/Llama-3-70B-Instruct-Gradient-1048k is a 70 billion parameter instruction-tuned language model developed by Gradient. It extends the context length of the base Meta Llama 3 70B Instruct model from 8k to over 1 million tokens, utilizing techniques like NTK-aware interpolation and progressive training. This model is specifically optimized for handling extremely long contexts, making it suitable for applications requiring extensive document analysis or prolonged conversational memory.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–