gradientai/Llama-3-70B-Instruct-Gradient-1048k
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:8kPublished:May 3, 2024License:llama3Architecture:Transformer0.1K Warm

The gradientai/Llama-3-70B-Instruct-Gradient-1048k is a 70 billion parameter instruction-tuned language model developed by Gradient. It extends the context length of the base Meta Llama 3 70B Instruct model from 8k to over 1 million tokens, utilizing techniques like NTK-aware interpolation and progressive training. This model is specifically optimized for handling extremely long contexts, making it suitable for applications requiring extensive document analysis or prolonged conversational memory.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p