Name: maars-command/Yi-34B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: maars-command

Overview

The Yi-34B model, developed by 01.AI, is part of the Yi series of large language models built from scratch on the Transformer architecture, similar to Llama. It is trained on a substantial 3T multilingual corpus, making it highly proficient in both English and Chinese. The model supports a 32K context length, with a 200K context length variant also available, and was trained on data up to June 2023.

Key Capabilities

Bilingual Proficiency: Excels in language understanding, commonsense reasoning, and reading comprehension across English and Chinese.
High Performance: The Yi-34B-Chat model achieved second place on the AlpacaEval Leaderboard (as of January 2024), outperforming models like GPT-4 and Mixtral. The base Yi-34B model ranked first among open-source models on the Hugging Face Open LLM Leaderboard and C-Eval (as of November 2023).
Long Context Window: The Yi-34B-200K variant offers an extended context window of 200,000 tokens, significantly enhancing its ability to process and understand lengthy texts. Recent enhancements improved its "Needle-in-a-Haystack" test performance to 99.8%.
Quantization Support: Available in 4-bit (AWQ) and 8-bit (GPTQ) quantized versions, allowing deployment on consumer-grade GPUs like RTX 3090 or 4090 with reduced VRAM requirements.

Good For

General-purpose LLM applications: Its strong performance in reasoning and comprehension makes it suitable for a wide array of tasks.
Bilingual (English/Chinese) applications: Optimized for use cases requiring proficiency in both languages.
Cost-effective commercial deployment: The 34B series is highlighted as a cost-effective solution for small and medium-sized enterprises.
Applications requiring long context: The 200K context length variant is ideal for tasks involving extensive documents or conversations.
Resource-constrained environments: Quantized versions enable deployment on hardware with limited GPU memory.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)