choco800/qwen3-4b-agent-v17
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 1, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The choco800/qwen3-4b-agent-v17 is a 4 billion parameter model, fine-tuned from Qwen/Qwen3-4B-Instruct-2507, designed to enhance multi-turn agent task performance. It specializes in tasks requiring environment observation, action selection, tool use, and error recovery, particularly within environments like ALFWorld. This model is optimized for agentic workflows, learning from assistant turns in multi-turn trajectories. It features a 32768 token context length and is provided as a fully merged model, eliminating the need to load a separate base model.

Loading preview...