Name: zai-org/LongAlign-7B-64k API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: zai-org

LongAlign-7B-64k Overview

LongAlign-7B-64k is a 7 billion parameter chat model, developed by THUDM, that extends the Llama-2-7B architecture to support a 64k token context window. This model is part of the LongAlign project, which focuses on developing a comprehensive approach for aligning Large Language Models (LLMs) with long-context understanding.

Key Capabilities

Extended Context Window: Processes inputs up to 64,000 tokens, significantly enhancing its ability to handle lengthy documents, conversations, or code.
Long-Context Instruction Following: Specifically trained on the LongAlign-10k dataset, which comprises 10,000 long instruction data samples ranging from 8k to 64k tokens.
Optimized Training Strategies: Incorporates novel training techniques such as packing with loss weighting and sorted batching to efficiently manage and learn from long sequences.
Instruction-Following Evaluation: Evaluated using LongBench-Chat, a benchmark designed to assess instruction-following capabilities on queries between 10k and 100k tokens.

Good For

Applications requiring deep understanding and generation based on extensive textual inputs.
Summarization of long documents, detailed question answering over large texts, and complex multi-turn conversations.
Use cases where maintaining coherence and context over thousands of tokens is critical.

Overview

LongAlign-7B-64k Overview

Key Capabilities

Good For

Full Model Card (README)