nvidia/AceMath-7B-Instruct

Warm
Public
7.6B
FP8
131072
License: cc-by-nc-4.0
Hugging Face
Overview

AceMath-7B-Instruct: Advanced Mathematical Reasoning

AceMath-7B-Instruct is a 7.6 billion parameter model from NVIDIA, specifically engineered for high-performance mathematical reasoning. It is developed from the Qwen2.5-Math-7B-Base model through a multi-stage supervised fine-tuning (SFT) process, first with general-purpose SFT data, followed by math-specific SFT data.

Key Capabilities

  • Mathematical Problem Solving: Excels at solving complex English mathematical problems using Chain-of-Thought (CoT) reasoning.
  • Benchmark Performance: Outperforms previous best-in-class models like Qwen2.5-Math-7B-Instruct on various math reasoning benchmarks, achieving an average pass@1 of 67.2.
  • Specialized Training: Benefits from a dedicated math-specific SFT phase, leveraging comprehensive training data released by NVIDIA.

Good For

  • Dedicated Math Applications: Primarily recommended for tasks requiring robust mathematical problem-solving capabilities.
  • Research in Mathematical AI: The accompanying training data and reward models (AceMath-7B-RM) support further research in this domain.

It's important to note that while highly specialized for math, for general-purpose tasks encompassing code, math, and general knowledge, NVIDIA also offers the AceInstruct series of models.