Name: kyujinpy/PlatYi-34B-Llama-Q-v2 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: kyujinpy

Model Overview

PlatYi-34B-Llama-Q-v2 is a 34 billion parameter auto-regressive language model developed by Kyujin Han (kyujinpy). It is based on the Yi-34B transformer architecture and was fine-tuned using Q-LoRA with a lora_r value of 64. The training utilized the garage-bAInd/Open-Platypus dataset, and the developer notes modifications to templates and warmup steps to address prior model issues.

Key Capabilities & Performance

This model is designed for general text generation. Its performance is evaluated on the Open LLM Leaderboard, where it achieved an average score of 67.88. Notable benchmark results include:

MMLU (5-Shot): 76.59
HellaSwag (10-Shot): 85.09
ARC (25-Shot): 61.09
TruthfulQA (0-shot): 52.65
Winogrande (5-shot): 82.79
GSM8k (5-shot): 49.05

Use Cases

Given its benchmark performance, PlatYi-34B-Llama-Q-v2 is suitable for a variety of natural language processing tasks requiring robust text generation and understanding. Its fine-tuning approach suggests potential for efficient deployment, making it a candidate for applications where a 34B parameter model can be effectively utilized.

Overview

Model Overview

Key Capabilities & Performance

Use Cases

Full Model Card (README)