teknium/OpenHermes-2.5-Mistral-7B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Oct 29, 2023License:apache-2.0Architecture:Transformer0.9K Open Weights Warm

OpenHermes 2.5 Mistral 7B by Teknium is a 7 billion parameter Mistral-based language model, fine-tuned on 1,000,000 entries of primarily GPT-4 generated data. This iteration significantly boosts non-code benchmarks like TruthfulQA and AGIEval, while also improving HumanEval code performance to 50.7%@Pass1. It is designed for generalist applications, excelling in both conversational tasks and code generation.

Loading preview...

Overview

OpenHermes 2.5 Mistral 7B, developed by Teknium, is an advanced fine-tune of the Mistral 7B architecture. It builds upon the OpenHermes 2 model by incorporating additional code datasets, leading to notable improvements across various benchmarks.

Key Capabilities and Performance

  • Enhanced Generalist Performance: Training on a specific ratio of code instruction data unexpectedly boosted several non-code benchmarks, including TruthfulQA, AGIEval, and the GPT4All suite.
  • Improved Code Generation: The model's HumanEval score for code tasks increased from 43% to 50.7% @ Pass 1, demonstrating significant progress in coding capabilities.
  • High-Quality Training Data: Fine-tuned on 1,000,000 entries, primarily GPT-4 generated data, alongside other high-quality open datasets, with extensive filtering and conversion to ShareGPT format.
  • ChatML Support: Utilizes the ChatML prompt format, enabling structured multi-turn dialogue and compatibility with OpenAI's API, making system prompts effective for guiding model behavior.

Benchmark Highlights

  • GPT4All: Achieved an average score of 73.12.
  • AGIEval: Scored an average of 43.07%.
  • TruthfulQA: Reached 53.04% on the mc2 metric.
  • Overall Improvement: Outperforms previous OpenHermes models (except Hermes 70B) and many current Mistral fine-tunes, with a total average score of 52.38 across key benchmarks.

Use Cases

This model is well-suited for a wide range of applications requiring strong conversational abilities, reasoning, and code generation. Its generalist improvements make it versatile for tasks from programming assistance to creative writing and complex dialogue.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p