princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Aug 22, 2024License:llama3Architecture:Transformer0.0K Warm

Llama-3-8B-ProLong-512k-Instruct is an 8 billion parameter instruction-tuned language model developed by Princeton NLP, based on Llama-3-8B. It is specifically optimized for long-context understanding, featuring an extended context window of 512,000 tokens. This model excels at processing and generating content over extremely long inputs, making it suitable for tasks requiring extensive document analysis or conversation history.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p