nvidia/AceMath-72B-Instruct
TEXT GENERATIONConcurrency Cost:4Model Size:72.7BQuant:FP8Ctx Length:32kPublished:Jan 14, 2025License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Warm

nvidia/AceMath-72B-Instruct is a 72.7 billion parameter instruction-tuned causal language model developed by NVIDIA, based on the Qwen2.5-Math-72B-Base architecture. It is specifically optimized for advanced mathematical reasoning tasks, excelling at solving English mathematical problems using Chain-of-Thought (CoT) reasoning. This model is designed for non-commercial use and has a context length of 131072 tokens.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p