Name: 01-ai/Yi-34B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: 01-ai

Overview

The Yi-34B is a 34 billion parameter large language model from the Yi series, developed by 01.AI. It is built upon the Transformer architecture, similar to Llama models, but is not a derivative, having been trained from scratch on proprietary high-quality datasets and infrastructure. The model is designed to be bilingual, trained on a 3T multilingual corpus, and offers a context length of 32768 tokens.

Key Capabilities

Bilingual Proficiency: Excels in both English and Chinese language understanding and generation.
Strong Benchmarks: The Yi-34B-Chat model achieved second place on the AlpacaEval Leaderboard (Jan 2024) and the Yi-34B base model ranked first among open-source models on the Hugging Face Open LLM Leaderboard and C-Eval (Nov 2023).
Extended Context Window: Features a 32K context length, with a 200K version available for handling very long texts, demonstrating enhanced performance in "Needle-in-a-Haystack" tests.
Quantization Support: Available in 4-bit (AWQ) and 8-bit (GPTQ) quantized versions, enabling deployment on consumer-grade GPUs.

Use Cases

General-purpose applications: Suitable for a broad range of tasks requiring strong language understanding and generation.
Bilingual applications: Ideal for scenarios demanding high performance in both English and Chinese.
Long-context tasks: The 200K context version is particularly effective for processing and reasoning over extensive documents.
Resource-constrained environments: Quantized models allow for deployment on hardware with limited VRAM, such as RTX 3090 or 4090 GPUs.

Overview

Overview

Key Capabilities

Use Cases

Full Model Card (README)