Name: GAIR/LIMR API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: GAIR

What is GAIR/LIMR?

GAIR/LIMR (Less is More for RL Scaling) is a 7.6 billion parameter model developed by GAIR that redefines the approach to data scaling in reinforcement learning (RL) for large language models. It demonstrates that a strategically selected, smaller dataset can yield performance comparable to or better than much larger datasets, particularly in mathematical reasoning tasks. The core innovation is the Learning Impact Measurement (LIM) methodology, an automated system for evaluating the effectiveness of individual training samples, eliminating the need for extensive manual curation.

Key Capabilities & Innovations

Data Efficiency: Achieves strong performance with only 1,389 mathematical questions, significantly outperforming models trained on 6x more data (8,523 questions) in some benchmarks.
Automated Sample Evaluation: Introduces the LIM methodology for automated, quantitative assessment of training sample value, ensuring high-quality data selection.
Direct RL from Base Models: All investigations and training are conducted directly from base models, providing clear insights into RL dynamics without relying on distillation from larger models.
Superior Mathematical Reasoning: Outperforms other RL recipes and Qwen-Math-7B variants on challenging mathematical benchmarks like AIME2024, MATH500, and AMC2023, achieving an average score of 58.1%.

When to Use GAIR/LIMR

Resource-Constrained Environments: Ideal for scenarios where computational resources or access to vast datasets are limited, but high performance is still required.
Mathematical & Reasoning Tasks: Particularly well-suited for applications demanding precise and accurate mathematical problem-solving.
Efficient RL Training: Developers looking to optimize RL training processes by focusing on data quality over quantity will find LIMR's methodology highly valuable.

This model challenges the conventional wisdom of "more data is always better," proving that intelligent data selection can lead to more efficient and effective model training.

Overview

What is GAIR/LIMR?

Key Capabilities & Innovations

When to Use GAIR/LIMR

Full Model Card (README)