melon1891/agentbench-qwen3-4b-lr5e6-20260224v2
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The melon1891/agentbench-qwen3-4b-lr5e6-20260224v2 is a 4 billion parameter language model fine-tuned from Qwen/Qwen3-4B-Instruct-2507. It is specifically optimized for multi-turn agent task performance, focusing on household tasks (ALFWorld) and database operations (DBBench). This model excels at learning environment observation, action selection, tool use, and error recovery within complex multi-turn trajectories, making it suitable for autonomous agent applications.
Loading preview...