Name: teknium/OpenHermes-2.5-Mistral-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: teknium

Overview

OpenHermes 2.5 Mistral 7B, developed by Teknium, is an advanced fine-tune of the Mistral 7B architecture. It builds upon the OpenHermes 2 model by incorporating additional code datasets, leading to notable improvements across various benchmarks.

Key Capabilities and Performance

Enhanced Generalist Performance: Training on a specific ratio of code instruction data unexpectedly boosted several non-code benchmarks, including TruthfulQA, AGIEval, and the GPT4All suite.
Improved Code Generation: The model's HumanEval score for code tasks increased from 43% to 50.7% @ Pass 1, demonstrating significant progress in coding capabilities.
High-Quality Training Data: Fine-tuned on 1,000,000 entries, primarily GPT-4 generated data, alongside other high-quality open datasets, with extensive filtering and conversion to ShareGPT format.
ChatML Support: Utilizes the ChatML prompt format, enabling structured multi-turn dialogue and compatibility with OpenAI's API, making system prompts effective for guiding model behavior.

Benchmark Highlights

GPT4All: Achieved an average score of 73.12.
AGIEval: Scored an average of 43.07%.
TruthfulQA: Reached 53.04% on the mc2 metric.
Overall Improvement: Outperforms previous OpenHermes models (except Hermes 70B) and many current Mistral fine-tunes, with a total average score of 52.38 across key benchmarks.

Use Cases

This model is well-suited for a wide range of applications requiring strong conversational abilities, reasoning, and code generation. Its generalist improvements make it versatile for tasks from programming assistance to creative writing and complex dialogue.