tencent/DRIVE-RL
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Nov 12, 2025Architecture:Transformer0.0K Cold

DRIVE-RL is a 32.8 billion parameter model developed by Tencent's Hunyuan Team, specifically designed for competitive code generation. It utilizes a Qwen2.5-32B base model and is enhanced through a novel two-stage Reinforcement Learning with Verifiable Reward (RLVR) process, focusing on data curation best practices. This model excels at solving challenging competitive programming problems, achieving state-of-the-art performance among similarly scaled models.

Loading preview...