nvidia/OpenReasoning-Nemotron-32B

Warm
Public
32.8B
FP8
131072
License: cc-by-4.0
Hugging Face
Overview

OpenReasoning-Nemotron-32B: Advanced Reasoning Model

OpenReasoning-Nemotron-32B is a 32.8 billion parameter language model developed by NVIDIA, based on the Qwen2.5-32B architecture. It is specifically post-trained to excel in complex reasoning tasks across math, code, and science, supporting an extensive context length for up to 64,000 output tokens.

Key Capabilities

  • Specialized Reasoning: Optimized for generating solutions in competitive math, coding, and scientific problems.
  • High Performance: Demonstrates strong results on challenging reasoning benchmarks such as AIME, LiveCodeBench, GPQA, and MMLU-PRO, often setting new records for its size class.
  • Generative Solution Selection (GenSelect): Incorporates a unique inference mode that combines multiple parallel generations to select the best solution, significantly boosting performance on math and coding benchmarks. This capability generalizes across problem types.
  • Commercial Use: Available for both commercial and non-commercial research under the Creative Commons Attribution 4.0 International License (CC-BY-4.0).

Good For

  • Developers and researchers focused on competitive programming, mathematical problem-solving, and scientific inquiry.
  • Applications requiring robust, step-by-step reasoning and solution generation.
  • Leveraging advanced inference techniques like GenSelect for enhanced accuracy in complex tasks.