nvidia/OpenMath2-Llama3.1-8B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Sep 30, 2024License:llama3.1Architecture:Transformer0.0K Warm

OpenMath2-Llama3.1-8B is an 8 billion parameter language model developed by NVIDIA, fine-tuned from Llama3.1-8B-Base with the OpenMathInstruct-2 dataset. This model is specifically optimized for mathematical reasoning and problem-solving, demonstrating significant performance improvements over Llama3.1-8B-Instruct on various math benchmarks, including a 15.9% increase on the MATH dataset. It is designed for advanced mathematical tasks, leveraging a 32768 token context length.

Loading preview...

Overview

OpenMath2-Llama3.1-8B is an 8 billion parameter model from NVIDIA, built upon the Llama3.1-8B-Base architecture. It has been fine-tuned using the specialized OpenMathInstruct-2 dataset to enhance its mathematical reasoning capabilities. The model utilizes the same chat format as Llama3.1-instruct models.

Key Capabilities & Performance

This model significantly outperforms Llama3.1-8B-Instruct across multiple popular math benchmarks. Notably, it achieves a 15.9% higher score on the MATH dataset compared to its base instruction-tuned counterpart. Specific benchmark improvements include:

  • GSM8K: 91.7% (vs. 84.5% for Llama3.1-8B-Instruct)
  • MATH: 67.8% (vs. 51.9%)
  • AMC 2023: 16/40 (vs. 9/40)
  • Omni-MATH: 22.0 (vs. 12.7)

Use Cases & Limitations

OpenMath2-Llama3.1-8B is primarily designed for advanced mathematical problem-solving. Its training pipeline and dataset are fully open-sourced, including the code and models. Users should note that while highly proficient in math, this model has not been instruction-tuned on general data and may not perform optimally for non-mathematical tasks. For more details, refer to the accompanying paper.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p