melon1891/agentbench-qwen3-4b-2stage-reasoning-20260228
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 28, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The melon1891/agentbench-qwen3-4b-2stage-reasoning-20260228 is a 4 billion parameter language model fine-tuned from melon1891/agentbench-qwen3-4b-lr5e6-20260224v2, specifically optimized for multi-turn agent task performance. It excels in complex environments like ALFWorld and DBBench by learning environment observation, action selection, tool use, and error recovery. This model is designed for applications requiring robust reasoning and sequential decision-making capabilities within agentic workflows.
Loading preview...