nvidia/OpenMath-CodeLlama-7b-Python-hf

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Feb 9, 2024License:llama2Architecture:Transformer0.0K Open Weights Warm

The nvidia/OpenMath-CodeLlama-7b-Python-hf is a 7 billion parameter Code Llama-based model developed by NVIDIA, specifically designed for solving mathematical problems. It integrates text-based reasoning with Python code execution, trained on the 1.8M problem-solution pairs of the OpenMathInstruct-1 dataset. This model excels at mathematical reasoning tasks, achieving 75.9 on GSM8K and 43.6 on MATH benchmarks (greedy evaluation). Its primary use case is mathematical problem-solving through code-augmented reasoning.

Loading preview...

OpenMath-CodeLlama-7b-Python-hf: Math Problem Solving with Code

The nvidia/OpenMath-CodeLlama-7b-Python-hf is a 7 billion parameter model from NVIDIA's OpenMath series, built upon the Code Llama architecture. It is specifically engineered to tackle mathematical problems by combining natural language reasoning with the execution of Python code blocks. This capability is derived from its training on the extensive OpenMathInstruct-1 dataset, which comprises 1.8 million problem-solution pairs generated by the Mixtral-8x7B model.

Key Capabilities

  • Mathematical Reasoning: Designed to interpret and solve complex mathematical problems.
  • Code Integration: Seamlessly integrates text-based reasoning with Python code execution for accurate solutions.
  • Benchmark Performance: Achieves a greedy score of 75.9 on GSM8K and 43.6 on MATH benchmarks, demonstrating strong performance in mathematical problem-solving.
  • Open-Sourced Pipeline: The entire pipeline, including code, models, and dataset, is open-sourced, allowing for reproducibility and further development.

Good For

  • Automated Math Solvers: Ideal for applications requiring automated solutions to mathematical queries.
  • Educational Tools: Can be integrated into platforms for teaching and practicing math, providing step-by-step reasoning.
  • Research in AI for Math: A valuable base model for researchers exploring advanced mathematical reasoning in LLMs.
  • Code-Augmented Reasoning Tasks: Suitable for scenarios where combining linguistic understanding with computational execution is crucial.