dozzke/hermorca

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Mar 21, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Dozzke/hermorca is a 7 billion parameter language model created by dozzke, merged from NousResearch/Hermes-2-Pro-Mistral-7B and Open-Orca/Mistral-7B-OpenOrca using the SLERP method. This model combines the strengths of its base components, offering a versatile solution for general language understanding and generation tasks. It leverages a 4096-token context length, making it suitable for processing moderately sized inputs.

Loading preview...

Hermorca: A Merged 7B Language Model

Hermorca is a 7 billion parameter language model developed by dozzke, created by merging two prominent Mistral-7B based models: NousResearch/Hermes-2-Pro-Mistral-7B and Open-Orca/Mistral-7B-OpenOrca. This merge was performed using the SLERP (Spherical Linear Interpolation) method, a technique designed to combine the capabilities of different models while preserving their individual strengths.

Key Capabilities

  • Combines Strengths: By merging Hermes-2-Pro (known for its instruction-following and reasoning) and OpenOrca (recognized for its strong general performance and instruction tuning), Hermorca aims to offer a balanced and robust language model.
  • Mistral-7B Architecture: Inherits the efficient and capable architecture of the Mistral-7B base models, providing a strong foundation for various NLP tasks.
  • General Purpose: Suitable for a wide range of applications including text generation, summarization, question answering, and conversational AI, benefiting from the diverse training data of its constituent models.

Good For

  • Experimentation with Merged Models: Ideal for developers interested in exploring the performance characteristics of SLERP-merged models.
  • Versatile NLP Tasks: Can be applied to general language understanding and generation where a 7B parameter model with a 4096-token context window is appropriate.
  • Building upon Established Models: Provides a solid base for further fine-tuning or integration into larger systems, leveraging the proven capabilities of Hermes-2-Pro and OpenOrca.