Name: InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: InfiniAILab

Model Overview

InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct is a 3.1 billion parameter language model built upon the Qwen2.5-3B-Instruct architecture. This model has undergone supervised fine-tuning (SFT) using the open-r1/OpenR1-Math-220k dataset, specifically targeting enhanced mathematical reasoning and problem-solving capabilities. The fine-tuning process was conducted using the TRL framework.

Key Capabilities

Mathematical Reasoning: Specialized training on a dedicated math dataset improves its ability to understand and solve mathematical problems.
Instruction Following: Inherits strong instruction-following capabilities from its base Qwen2.5-3B-Instruct model.
Efficient Performance: As a 3.1 billion parameter model, it offers a balance between performance and computational efficiency.

Good For

Applications requiring robust mathematical problem-solving.
Tasks involving logical deduction and quantitative analysis.
Scenarios where a smaller, specialized model is preferred for efficiency without sacrificing mathematical accuracy.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)