Name: microsoft/rho-math-1b-v0.1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: microsoft

Overview

Microsoft's Rho-1 models introduce Selective Language Modeling (SLM), a pretraining approach that optimizes language model performance by concentrating on valuable, clean tokens. This method allows models like rho-math-1b-v0.1 to achieve high accuracy in mathematical reasoning tasks with substantially fewer pretraining tokens compared to traditional causal language modeling.

Key Capabilities

Efficient Mathematical Reasoning: The 1.1B parameter rho-math-1b-v0.1 achieves 15.6% few-shot accuracy on the MATH dataset and 36.2% on GSM8K, outperforming larger models like Gemma 2.0B and DeepSeekMath 1.3B in some math benchmarks.
Accelerated Performance: SLM enables the model to reach baseline performance 5-10x faster, improving average few-shot accuracy on GSM8k and MATH by over 16%.
Reduced Training Data: Achieves competitive results using only 3% of the pretraining tokens compared to models like DeepSeekMath.

Good For

Mathematical Problem Solving: Ideal for applications requiring strong mathematical reasoning, especially in resource-constrained environments due to its efficient training.
Research in Efficient LLM Pretraining: Demonstrates the effectiveness of Selective Language Modeling for improving model performance and training efficiency.