NousResearch/Hermes-3-Llama-3.1-70B

Loading
Public
70B
FP8
32768
Jul 29, 2024
License: llama3
Hugging Face
Overview

Hermes 3 - Llama-3.1 70B Overview

Hermes 3 is Nous Research's flagship 70 billion parameter large language model, building on the Llama-3.1 architecture with a 32,768 token context window. This iteration significantly advances the Hermes series, focusing on user alignment and powerful steering capabilities.

Key Capabilities

  • Advanced Agentic Capabilities: Enhanced for complex task execution and autonomous behavior.
  • Improved Roleplaying & Reasoning: Offers much better performance in role-play scenarios and logical deduction.
  • Multi-turn Conversation & Long Context Coherence: Maintains consistency and understanding over extended dialogues.
  • Reliable Function Calling & Structured Output: Features more robust and dependable function calling, including JSON mode for structured responses.
  • Enhanced Code Generation: Demonstrates improved skills in generating code.
  • ChatML Prompt Format: Utilizes ChatML for structured multi-turn conversations, compatible with OpenAI API formats, allowing for flexible system prompts.

Benchmarks

Hermes 3 is competitive with, and in some areas superior to, Llama-3.1 Instruct models across general capabilities, showcasing varying strengths.

Good for

  • Applications requiring advanced agentic behavior and complex task automation.
  • Developing chatbots and virtual assistants that need strong roleplaying and multi-turn conversational abilities.
  • Scenarios demanding reliable function calling and structured JSON outputs.
  • Code generation tasks and generalist assistant functionalities.