NousResearch/Hermes-4-14B

Warm
Public
14B
FP8
32768
License: apache-2.0
Hugging Face
Overview

Hermes 4-14B: A Frontier Reasoning Model

Hermes 4-14B, developed by Nous Research, is a 14 billion parameter model built on the Qwen 3 architecture, designed for advanced reasoning and user alignment. This model introduces a hybrid reasoning mode with explicit <think>…</think> segments, allowing for deep deliberation before generating responses, which can be optionally toggled for faster outputs.

Key Capabilities & Improvements

  • Enhanced Reasoning: Significant advancements in math, code, STEM, logic, and creative writing, achieved through a massively increased post-training corpus of ~5M samples / ~60B tokens.
  • Schema Adherence & Structured Outputs: Trained to produce valid JSON for given schemas and to repair malformed objects, making it highly reliable for structured data generation.
  • Steerability & Alignment: Demonstrates extreme improvements in steerability and reduced refusal rates, achieving state-of-the-art performance on RefusalBench by being helpful and conforming to user values without censorship.
  • Function Calling & Tool Use: Supports function/tool calls within a single assistant turn, integrating seamlessly with reasoning mode for improved accuracy.

When to Use This Model

  • Complex Reasoning Tasks: Ideal for applications requiring deep thought processes in areas like mathematics, scientific problem-solving, and logical deduction.
  • Code Generation & STEM: Excels in generating code and handling STEM-related queries.
  • Structured Data Generation: Highly effective for tasks requiring valid JSON outputs or adherence to specific schemas.
  • Creative Writing & Subjective Responses: Capable of generating expressive and creative content while maintaining quality.
  • Applications Requiring High Steerability: Suitable for use cases where fine-grained control over model behavior and reduced refusal rates are critical.