teknium/OpenHermes-2.5-Mistral-7B

Warm
Public
7B
FP8
4096
License: apache-2.0
Hugging Face
Overview

Overview

OpenHermes 2.5 Mistral 7B, developed by Teknium, is an advanced fine-tune of the Mistral 7B architecture. It builds upon the OpenHermes 2 model by incorporating additional code datasets, leading to notable improvements across various benchmarks.

Key Capabilities and Performance

  • Enhanced Generalist Performance: Training on a specific ratio of code instruction data unexpectedly boosted several non-code benchmarks, including TruthfulQA, AGIEval, and the GPT4All suite.
  • Improved Code Generation: The model's HumanEval score for code tasks increased from 43% to 50.7% @ Pass 1, demonstrating significant progress in coding capabilities.
  • High-Quality Training Data: Fine-tuned on 1,000,000 entries, primarily GPT-4 generated data, alongside other high-quality open datasets, with extensive filtering and conversion to ShareGPT format.
  • ChatML Support: Utilizes the ChatML prompt format, enabling structured multi-turn dialogue and compatibility with OpenAI's API, making system prompts effective for guiding model behavior.

Benchmark Highlights

  • GPT4All: Achieved an average score of 73.12.
  • AGIEval: Scored an average of 43.07%.
  • TruthfulQA: Reached 53.04% on the mc2 metric.
  • Overall Improvement: Outperforms previous OpenHermes models (except Hermes 70B) and many current Mistral fine-tunes, with a total average score of 52.38 across key benchmarks.

Use Cases

This model is well-suited for a wide range of applications requiring strong conversational abilities, reasoning, and code generation. Its generalist improvements make it versatile for tasks from programming assistance to creative writing and complex dialogue.