Name: nvidia/AceMath-72B-Instruct API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: nvidia

AceMath-72B-Instruct: Advanced Mathematical Reasoning

AceMath-72B-Instruct is a 72.7 billion parameter model from NVIDIA's AceMath family, specifically engineered for mathematical reasoning. It is built upon the Qwen2.5-Math-72B-Base model and undergoes a multi-stage supervised fine-tuning (SFT) process, first with general-purpose SFT data, then with math-specific SFT data.

Key Capabilities

Mathematical Problem Solving: Excels at solving English mathematical problems using Chain-of-Thought (CoT) reasoning.
High Performance: Outperforms Qwen2.5-Math-72B-Instruct (71.8 vs. 68.2), GPT-4o (67.4), and Claude 3.5 Sonnet (65.6) on various math reasoning benchmarks.
Reward Model Integration: Designed to work in conjunction with AceMath-72B-RM, an outcome reward model that evaluates and scores mathematical solutions.
Extensive Context: Features a context length of 131072 tokens.

When to Use This Model

Dedicated Math Tasks: This model is specifically recommended for applications requiring robust mathematical problem-solving capabilities.
Research in Mathematical AI: The release of all training data supports further research in this field.

For general-purpose tasks including code and general knowledge, NVIDIA also offers the AceInstruct series of models. AceMath models are intended for non-commercial use.

Overview

AceMath-72B-Instruct: Advanced Mathematical Reasoning

Key Capabilities

When to Use This Model

Full Model Card (README)