choco800/qwen3-4b-agent-v16
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 1, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

choco800/qwen3-4b-agent-v16 is a 4 billion parameter language model fine-tuned from Qwen/Qwen3-4B-Instruct-2507. This model is specifically optimized for multi-turn agent task performance, particularly in environments like ALFWorld. It learns environment observation, action selection, tool use, and error recovery within complex trajectories. The model is designed to enhance an agent's ability to navigate and complete household tasks through improved reasoning over multi-turn interactions.

Loading preview...