nvidia/DLER-R1-1.5B-Research

Cold
Public
1.5B
BF16
131072
Hugging Face
Overview

Model Overview

The nvidia/DLER-R1-1.5B-Research model, developed by NVIDIA, is a 1.5 billion parameter, open-weight reasoning model. It is engineered for high efficiency in complex tasks like mathematics, programming, and scientific problem-solving.

Key Capabilities & Differentiators

  • Ultra-Efficient Reasoning: The model is trained using the DLER algorithm, which optimizes for concise and accurate outputs.
  • Significant Efficiency Gains: Compared to DeepSeek's 1.5B model, DLER-R1-1.5B reduces the average response length by approximately 77% across various mathematical benchmarks.
  • Improved Accuracy: Despite its brevity, the model demonstrates better accuracy on benchmarks such as MATH (+2.64%), AIME (+4.59%), AMC (+8.51%), Minerva (+5.18%), and Olympiad (+4.24%).
  • Targeted for Research: This model is specifically intended for research and development purposes in advanced reasoning applications.

Performance Highlights

DLER-R1-1.5B consistently outperforms DeepSeek-R1-1.5B in both accuracy and response length efficiency across multiple mathematical reasoning benchmarks:

  • MATH: 86.95% accuracy with 70% shorter responses.
  • AIME: 34.375% accuracy with 80% shorter responses.
  • AMC: 70.48% accuracy with 77% shorter responses.
  • Minerva: 43.58% accuracy with 73% shorter responses.
  • Olympiad: 48.314% accuracy with 78% shorter responses.

Intended Use

This model is primarily for academic and industrial research, focusing on advancing efficient and accurate reasoning capabilities in AI.