kyujinpy/PlatYi-34B-Llama-Q-v2
PlatYi-34B-Llama-Q-v2 by Kyujin Han (kyujinpy) is a 34 billion parameter auto-regressive language model built on the Yi-34B transformer architecture, fine-tuned using Q-LoRA on the Open-Platypus dataset. This model is designed for general text generation tasks, offering a balance of performance and efficiency. It achieves an average score of 67.88 on the Open LLM Leaderboard benchmarks, including 76.59 on MMLU and 85.09 on HellaSwag, making it suitable for a range of natural language processing applications.
Loading preview...
Model Overview
PlatYi-34B-Llama-Q-v2 is a 34 billion parameter auto-regressive language model developed by Kyujin Han (kyujinpy). It is based on the Yi-34B transformer architecture and was fine-tuned using Q-LoRA with a lora_r value of 64. The training utilized the garage-bAInd/Open-Platypus dataset, and the developer notes modifications to templates and warmup steps to address prior model issues.
Key Capabilities & Performance
This model is designed for general text generation. Its performance is evaluated on the Open LLM Leaderboard, where it achieved an average score of 67.88. Notable benchmark results include:
- MMLU (5-Shot): 76.59
- HellaSwag (10-Shot): 85.09
- ARC (25-Shot): 61.09
- TruthfulQA (0-shot): 52.65
- Winogrande (5-shot): 82.79
- GSM8k (5-shot): 49.05
Use Cases
Given its benchmark performance, PlatYi-34B-Llama-Q-v2 is suitable for a variety of natural language processing tasks requiring robust text generation and understanding. Its fine-tuning approach suggests potential for efficient deployment, making it a candidate for applications where a 34B parameter model can be effectively utilized.