NousResearch/Hermes-3-Llama-3.1-70B

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:Jul 29, 2024License:llama3Architecture:Transformer0.1K Warm

Hermes 3 - Llama-3.1 70B is the latest 70 billion parameter large language model from Nous Research, built upon the Llama-3.1 architecture with a 32K context length. This generalist model features advanced agentic capabilities, improved roleplaying, reasoning, and multi-turn conversation coherence. It excels in aligning LLMs to user intent, offering powerful steering, reliable function calling, structured output, and enhanced code generation skills.

Loading preview...

Hermes 3 - Llama-3.1 70B Overview

Hermes 3 is Nous Research's flagship 70 billion parameter large language model, building on the Llama-3.1 architecture with a 32,768 token context window. This iteration significantly advances the Hermes series, focusing on user alignment and powerful steering capabilities.

Key Capabilities

  • Advanced Agentic Capabilities: Enhanced for complex task execution and autonomous behavior.
  • Improved Roleplaying & Reasoning: Offers much better performance in role-play scenarios and logical deduction.
  • Multi-turn Conversation & Long Context Coherence: Maintains consistency and understanding over extended dialogues.
  • Reliable Function Calling & Structured Output: Features more robust and dependable function calling, including JSON mode for structured responses.
  • Enhanced Code Generation: Demonstrates improved skills in generating code.
  • ChatML Prompt Format: Utilizes ChatML for structured multi-turn conversations, compatible with OpenAI API formats, allowing for flexible system prompts.

Benchmarks

Hermes 3 is competitive with, and in some areas superior to, Llama-3.1 Instruct models across general capabilities, showcasing varying strengths.

Good for

  • Applications requiring advanced agentic behavior and complex task automation.
  • Developing chatbots and virtual assistants that need strong roleplaying and multi-turn conversational abilities.
  • Scenarios demanding reliable function calling and structured JSON outputs.
  • Code generation tasks and generalist assistant functionalities.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p