Name: YiXin-AILab/YiXin-Distill-Qwen-72B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: YiXin-AILab

Model Overview

YiXin-Distill-Qwen-72B is a 72.7 billion parameter model developed by YiXin-AILab, derived from Qwen2.5-72B using advanced reinforcement learning and distillation techniques. It is specifically optimized for mathematical reasoning and general knowledge tasks, aiming to enhance these capabilities while maintaining computational efficiency.

Key Capabilities & Features

Enhanced Reasoning: Leverages a progressive two-stage distillation process to iteratively refine performance, focusing on strengthening weak reasoning patterns and mitigating overfitting.
High-Quality Training Data: Trained on a carefully curated dataset aggregated from high-quality open-source sources, covering mathematics and general knowledge. Data undergoes rigorous filtering, quality assessment (using DeepSeek-R1 as an LLM judge), and validation, including systematic verification of mathematical solutions.
Multilingual Support: Supports a wide range of languages including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, and Arabic.

Performance Highlights

Benchmark evaluations show YiXin-Distill-Qwen-72B delivers strong performance, with observed average improvements of 5 to 11 percentage points over comparable distilled models. It achieves notable scores across various benchmarks:

MATH-500: 97.0
AIME-25: 73.3
MMLU-Pro: 92.6
Overall Average: 81.8, outperforming QwQ-32B, DeepSeek-R1-Distill-Qwen-32B, and DeepSeek-R1-Distill-Llama-70B.

Limitations

Users should be aware of potential security concerns (adversarial attacks, prompt injection, data leakage), domain-specific biases, and possible loss of nuanced reasoning capabilities inherent in the distillation process.

Overview

Model Overview

Key Capabilities & Features

Performance Highlights

Limitations

Full Model Card (README)