Name: neulab/SP3F-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: neulab

SP3F-7B: Multilingual Reasoning with Self-Play Feedback

SP3F-7B is a 7.6 billion parameter multilingual model from neulab, leveraging the Qwen2.5-7B architecture. Its key differentiator is the training methodology: Self-Play with Privileged Pairwise Feedback (SP3F). This advanced technique significantly boosts the model's ability to handle complex multilingual reasoning and mathematical problems.

Key Capabilities

Enhanced Multilingual Reasoning: Demonstrates substantial improvements over its base model and other instruction-tuned variants in tasks requiring cross-lingual understanding and problem-solving.
Superior Mathematical Performance: Achieves high accuracy on benchmarks like MGSM and MT Math100, indicating strong quantitative reasoning skills.
Robust Training: The SP3F method, detailed in the associated research paper, enables the model to learn from privileged feedback, leading to more accurate and reliable outputs.

Good For

Applications requiring high-accuracy multilingual mathematical problem-solving.
Tasks involving complex reasoning across multiple languages.
Developers seeking a 7B-class model with advanced training for improved performance in specific, challenging domains.

For more technical details on the training methodology, refer to the research paper: Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning.

Overview

SP3F-7B: Multilingual Reasoning with Self-Play Feedback

Key Capabilities

Good For

Full Model Card (README)