tencent/DRIVE-SFT
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Nov 12, 2025Architecture:Transformer0.0K Cold

DRIVE-SFT is a 32 billion parameter supervised fine-tuned model developed by the Hunyuan Team at Tencent, based on Qwen2.5. It is specifically optimized for competitive code generation, utilizing a difficulty-aware sampling strategy during SFT to focus on challenging problems. This model serves as the initial stage for the DRIVE-RL pipeline, which further enhances performance on complex coding tasks through a two-stage reinforcement learning process.

Loading preview...