Name: IIGroup/X-Coder-RL-Qwen2.5-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: IIGroup

X-Coder-RL-Qwen2.5-7B Overview

X-Coder-RL-Qwen2.5-7B is a 7.6 billion parameter language model from IIGroup, specifically engineered for advanced code reasoning. It is built upon the IIGroup/X-Coder-SFT-Qwen2.5-7B base model and distinguishes itself through its training methodology: Reinforcement Learning with Value Regularization (RLVR) using the GRPO algorithm. This training leverages a fully synthetic dataset, IIGroup/X-Coder-RL-40k, to enhance its ability to solve competitive programming problems.

Key Capabilities

Strong Code Reasoning: Achieves robust performance on complex coding challenges, as demonstrated by its results on LiveCodeBench v5.
RL-Trained: Utilizes advanced reinforcement learning techniques (GRPO) on synthetic data for specialized optimization.
Competitive Programming Focus: Designed to excel in scenarios requiring logical deduction and problem-solving within a coding context.

Recommended Use Cases

Code Generation: Generating solutions for programming problems.
Algorithmic Problem Solving: Assisting with or solving tasks found in competitive programming environments.
Code Reasoning Tasks: Applications requiring deep understanding and logical manipulation of code.

Overview

X-Coder-RL-Qwen2.5-7B Overview

Key Capabilities

Recommended Use Cases

Full Model Card (README)