Name: 01-ai/Yi-1.5-34B-32K API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: 01-ai

Yi-1.5-34B-32K: An Upgraded Large Language Model

The Yi-1.5-34B-32K is a 34 billion parameter model developed by 01-ai, representing an upgraded iteration of the original Yi series. This version has undergone continuous pre-training with an additional 500 billion high-quality tokens and fine-tuning on 3 million diverse samples, building upon the foundational Yi model.

Key Capabilities and Enhancements

Extended Context Length: This specific model variant supports a substantial context window of 32,768 tokens, enabling processing of longer inputs and generating more coherent, extended outputs.
Improved Performance: Compared to its predecessor, Yi-1.5 demonstrates stronger capabilities across several critical domains:
- Coding: Enhanced proficiency in code generation and understanding.
- Mathematics: Better performance in mathematical problem-solving.
- Reasoning: Improved logical deduction and complex problem-solving skills.
- Instruction-Following: More accurate and reliable adherence to given instructions.
Retained Strengths: The model maintains its excellent abilities in general language understanding, commonsense reasoning, and reading comprehension.

Benchmarking Highlights

The Yi-1.5-34B base model, which this 32K context variant is built upon, is noted to perform comparably to or even surpass larger models in various benchmarks.

Ideal Use Cases

Complex Problem Solving: Suitable for tasks requiring deep reasoning and mathematical understanding.
Code Generation and Analysis: Beneficial for developers needing assistance with programming tasks.
Long-form Content Processing: Its 32K context window makes it ideal for summarizing lengthy documents, analyzing extensive codebases, or engaging in prolonged conversational interactions.
Instruction-Driven Applications: Effective in scenarios where precise instruction following is crucial.

Overview

Yi-1.5-34B-32K: An Upgraded Large Language Model

Key Capabilities and Enhancements

Benchmarking Highlights

Ideal Use Cases

Full Model Card (README)