lixiaoxi45/DeepAgent-QwQ-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Dec 30, 2025License:mitArchitecture:Transformer Open Weights Warm

DeepAgent-QwQ-32B is a 32.8 billion parameter deep reasoning agent developed by Xiaoxi Li et al. that performs autonomous thinking, tool discovery, and action execution. It is designed to overcome limitations of traditional workflows by maintaining a global perspective and dynamically discovering tools. The model features autonomous memory folding to manage long-horizon interactions and a ToolPO reinforcement learning strategy for general tool use. It excels in general tool-use tasks and downstream applications, outperforming baselines across various benchmarks.

Loading preview...