Dampfinchen/Llama-3.1-8B-Ultra-Instruct
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Aug 26, 2024License:llama3Architecture:Transformer0.0K Warm

Dampfinchen/Llama-3.1-8B-Ultra-Instruct is an 8 billion parameter instruction-tuned language model based on the Llama 3.1 architecture, created by merging multiple fine-tuned models using the DARE TIES method. This model is optimized for general instruction following and demonstrates strong performance for its size, making it suitable for a wide range of conversational and text generation tasks. It leverages the combined strengths of its merged components to provide a robust 32K context window model.

Loading preview...

Model Overview

Dampfinchen/Llama-3.1-8B-Ultra-Instruct is an 8 billion parameter language model built upon the NousResearch/Meta-Llama-3.1-8B base. It was created using the DARE TIES merge method, combining the strengths of four distinct fine-tuned Llama 3.1 models:

  • nbeerbower/llama3.1-gutenberg-8B
  • akjindal53244/Llama-3.1-Storm-8B
  • nbeerbower/llama3.1-airoboros3.2-QDT-8B
  • Sao10K/Llama-3.1-8B-Stheno-v3.4

This merging approach aims to consolidate diverse capabilities into a single, efficient model. The model is designed to be used with the Llama 3 Instruct prompt template.

Performance Highlights

Evaluations on the Open LLM Leaderboard indicate competitive performance for an 8B model. Key metrics include:

  • Avg. Score: 28.98
  • IFEval (0-Shot): 80.81
  • BBH (3-Shot): 32.49
  • MMLU-PRO (5-shot): 31.40

Recommended Use Cases

  • General Instruction Following: Excels in responding to diverse prompts and instructions.
  • Conversational AI: Suitable for chatbots and interactive applications due to its instruction-tuned nature.
  • Text Generation: Capable of generating coherent and contextually relevant text across various topics.
Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p