princeton-nlp/Llama-3-8B-ProLong-64k-Base
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jul 22, 2024License:llama3Architecture:Transformer0.0K Warm

The princeton-nlp/Llama-3-8B-ProLong-64k-Base is an 8 billion parameter base model from the ProLong family, developed by Princeton NLP. It is continued trained from Llama-3-8B with a focus on long-context understanding, supporting a context window of up to 64K tokens. This model is designed for applications requiring processing and generating text over extended document lengths.

Loading preview...