Dampfinchen/Llama-3.1-8B-Ultra-Instruct

Warm
Public
8B
FP8
32768
License: llama3
Hugging Face
Overview

Model Overview

Dampfinchen/Llama-3.1-8B-Ultra-Instruct is an 8 billion parameter language model built upon the NousResearch/Meta-Llama-3.1-8B base. It was created using the DARE TIES merge method, combining the strengths of four distinct fine-tuned Llama 3.1 models:

  • nbeerbower/llama3.1-gutenberg-8B
  • akjindal53244/Llama-3.1-Storm-8B
  • nbeerbower/llama3.1-airoboros3.2-QDT-8B
  • Sao10K/Llama-3.1-8B-Stheno-v3.4

This merging approach aims to consolidate diverse capabilities into a single, efficient model. The model is designed to be used with the Llama 3 Instruct prompt template.

Performance Highlights

Evaluations on the Open LLM Leaderboard indicate competitive performance for an 8B model. Key metrics include:

  • Avg. Score: 28.98
  • IFEval (0-Shot): 80.81
  • BBH (3-Shot): 32.49
  • MMLU-PRO (5-shot): 31.40

Recommended Use Cases

  • General Instruction Following: Excels in responding to diverse prompts and instructions.
  • Conversational AI: Suitable for chatbots and interactive applications due to its instruction-tuned nature.
  • Text Generation: Capable of generating coherent and contextually relevant text across various topics.