uukuguy/speechless-llama2-hermes-orca-platypus-wizardlm-13b

Warm
Public
13B
FP8
4096
Sep 1, 2023
Hugging Face
Overview

Model Overview

This model, speechless-llama2-hermes-orca-platypus-wizardlm-13b, is a 13 billion parameter language model created by merging three distinct Llama 2-based models: NousResearch/Nous-Hermes-Llama2-13b, Open-Orca/OpenOrca-Platypus2-13B, and WizardLM/WizardLM-13B-V1.2. This strategic merge aims to combine the diverse fine-tuning and capabilities of its constituent models into a single, more versatile LLM.

Key Capabilities & Features

  • Merged Architecture: Combines the strengths of multiple instruction-tuned Llama 2 models.
  • Instruction Following: Optimized to accept and respond to instructions using the Alpaca format.
  • Quantized Versions Available: Supports various quantization options including AWQ, GPTQ, and GGUF for efficient GPU and CPU inference.
  • Performance Benchmarks: Achieves an average score of 64.13 on the lm-evaluation-harness across ARC, HellaSwag, MMLU, and TruthfulQA, and an average of 51.85 on the Open LLM Leaderboard.

When to Use This Model

  • General Instruction Following: Ideal for applications requiring robust responses to diverse prompts.
  • Chat and Dialogue Systems: Benefits from the chat-optimized components of its merged base models.
  • Resource-Constrained Environments: Suitable for deployment with its available quantized versions, enabling efficient inference on various hardware setups.
  • Research and Development: Provides a strong base for further experimentation and fine-tuning due to its comprehensive merge of established models.