Name: princeton-nlp/Llama-3-8B-ProLong-64k-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: princeton-nlp

Model Overview

princeton-nlp/Llama-3-8B-ProLong-64k-Instruct is an 8 billion parameter instruction-tuned model from the ProLong (Princeton long-context language models) family, developed by Princeton NLP. It is built upon the Llama-3-8B architecture and has been specifically optimized for handling long contexts. This model is a variant within the ProLong collection, which includes models with context windows up to 512K tokens.

Key Capabilities & Training

Long-Context Processing: This model is designed to process inputs up to 64K tokens, making it suitable for tasks requiring understanding and generating content over moderately long documents or conversations.
Instruction Following: As an instruction-tuned model, it is optimized to follow user instructions effectively, making it versatile for various NLP applications.
Training Methodology: The ProLong models undergo continued training on specialized long-context datasets (e.g., princeton-nlp/prolong-data-64K) and are supervised fine-tuned using datasets like UltraChat. The development involved extensive ablations on pre-training data and SFT data to optimize long-context performance, detailed in their paper "How to Train Long-Context Language Models (Effectively)".

Use Cases

This model is particularly well-suited for applications that benefit from processing and generating content within a 64K token context window, such as:

Document Analysis: Summarizing, querying, or extracting information from lengthy texts.
Extended Conversations: Maintaining coherence and context over long dialogue turns.
Code Understanding: Analyzing and generating code snippets within a larger codebase context.

Overview

Model Overview

Key Capabilities & Training

Use Cases

Full Model Card (README)