Overview
Overview
Salesforce AI Research's xRouter is an intelligent LLM routing system designed to optimize the balance between performance and cost in modern LLM deployments. It leverages reinforcement learning (DAPO) with explicit cost-aware reward shaping to learn end-to-end routing policies, eliminating the need for manual configuration. The system dynamically selects the most appropriate LLM from a pool of over 20 models, including premium and budget options, based on query complexity.
Key Capabilities
- Cost-Aware Optimization: Achieves up to 60% cost reduction while maintaining task completion rates through RL-trained policies.
- Adaptive Routing: Dynamically routes simple queries to economical models and complex queries to premium models.
- Multi-Model Orchestration: Coordinates responses from multiple LLMs for intricate reasoning tasks.
- Tool-Calling Architecture: Learns to effectively invoke and select best responses from 20+ diverse models (e.g., GPT-5, DeepSeek R1, Qwen3).
- Learned Prompt Engineering: Automatically generates optimized system prompts for target models.
- Extensive Evaluation: Evaluated on 17 diverse benchmarks covering math, coding, reasoning, and out-of-distribution tasks.
Good For
- Developers seeking to minimize LLM inference costs without sacrificing performance.
- Applications requiring dynamic model selection based on query complexity.
- Complex reasoning tasks that benefit from orchestrating multiple specialized LLMs.
- Scenarios where automated, cost-aware routing and prompt optimization are beneficial.