eren23/NeuralDareBeagle-7B-slerp

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 28, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

NeuralDareBeagle-7B-slerp is a 7 billion parameter language model created by eren23, formed by merging mlabonne/NeuralBeagle14-7B and mlabonne/DareBeagle-7B-v2 using a slerp method. This merged model demonstrates strong general reasoning capabilities, achieving an average score of 74.60 on the Open LLM Leaderboard across various benchmarks. It is suitable for tasks requiring robust language understanding and generation, particularly in areas like common sense reasoning and multiple-choice question answering.

Loading preview...

Model Overview

NeuralDareBeagle-7B-slerp is a 7 billion parameter language model developed by eren23. It is a product of merging two base models, mlabonne/NeuralBeagle14-7B and mlabonne/DareBeagle-7B-v2, utilizing a spherical linear interpolation (slerp) merge method. This approach combines the strengths of its constituent models to enhance overall performance.

Key Capabilities & Performance

The model's performance has been evaluated on the Open LLM Leaderboard, where it achieved an average score of 74.60. Specific benchmark results include:

  • AI2 Reasoning Challenge (25-Shot): 72.10
  • HellaSwag (10-Shot): 88.20
  • MMLU (5-Shot): 64.99
  • TruthfulQA (0-shot): 69.18
  • Winogrande (5-shot): 82.56
  • GSM8k (5-shot): 70.58

These scores indicate strong performance across various reasoning, common sense, and language understanding tasks.

When to Use This Model

NeuralDareBeagle-7B-slerp is a suitable choice for developers looking for a 7B parameter model with solid general-purpose capabilities. Its balanced performance across multiple benchmarks suggests it can be effectively applied in scenarios requiring:

  • General text generation and comprehension.
  • Reasoning tasks and problem-solving.
  • Applications benefiting from robust common sense understanding.
  • Use cases where a merged model's combined strengths are advantageous.