nlpguy/Hermes-low-tune-2

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 5, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

nlpguy/Hermes-low-tune-2 is a 7 billion parameter language model created by nlpguy using the task arithmetic merge method. It is based on teknium/OpenHermes-2.5-Mistral-7B and incorporates several other OpenHermes variants, including one focused on mathematical reasoning. This model is designed for general-purpose conversational AI, demonstrating balanced performance across various benchmarks, including reasoning and common sense tasks.

Loading preview...

Model Overview

nlpguy/Hermes-low-tune-2 is a 7 billion parameter language model developed by nlpguy. It was created using the task arithmetic merge method, combining multiple specialized models into a single, more versatile model. The base model for this merge is teknium/OpenHermes-2.5-Mistral-7B.

Merge Details

This model integrates capabilities from several OpenHermes-based models, including:

Performance Highlights

Evaluated on the Open LLM Leaderboard, Hermes-low-tune-2 achieves an average score of 68.04. Notable scores include:

  • AI2 Reasoning Challenge (25-Shot): 65.61
  • HellaSwag (10-Shot): 84.47
  • MMLU (5-Shot): 63.69
  • GSM8k (5-Shot): 63.53

Use Cases

This model is suitable for general conversational AI applications, question answering, and tasks requiring a balance of reasoning and common sense, benefiting from the diverse strengths of its merged components.