unsloth/Hermes-4-70B

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:Aug 27, 2025License:llama3Architecture:Transformer0.0K Warm

Hermes 4 70B is a 70 billion parameter hybrid-mode reasoning model developed by Nous Research, based on the Llama-3.1 architecture. It is specifically designed for advanced reasoning tasks, excelling in math, code, STEM, logic, and creative writing, while maintaining general assistant quality. The model features a unique hybrid reasoning mode with explicit deliberation segments and improved steerability for custom alignment.

Loading preview...

Hermes 4 70B: Advanced Reasoning and Steerability

Hermes 4 70B, developed by Nous Research, is a frontier 70 billion parameter model built on the Llama-3.1 architecture. It introduces a novel hybrid reasoning mode that allows the model to deliberate internally using <think>…</think> segments before generating a response, significantly enhancing its problem-solving capabilities.

Key Capabilities

  • Enhanced Reasoning: Demonstrates massive improvements in math, code, STEM, logic, and creative writing, achieved through a newly synthesized post-training corpus of ~5M samples and ~60B tokens.
  • Schema Adherence & Structured Outputs: Trained to produce valid JSON for given schemas and to repair malformed objects, making it suitable for structured data generation.
  • Superior Steerability: Achieves state-of-the-art performance on RefusalBench, indicating high helpfulness and adaptability to user values without censorship, offering extreme improvements in reduced refusal rates.
  • Function Calling & Tool Use: Supports function/tool calls within a single assistant turn, integrating them after its internal reasoning process.

Good For

  • Applications requiring complex reasoning in technical domains like math, coding, and scientific problem-solving.
  • Scenarios demanding reliable structured outputs such as JSON generation and data repair.
  • Use cases where custom alignment and steerability are crucial, allowing developers to fine-tune its behavior and reduce unwanted refusals.
  • Developers looking for a model that can integrate tool use effectively after internal deliberation.