choco800/qwen3-4b-agent-v13
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 1, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The choco800/qwen3-4b-agent-v13 is a 4 billion parameter Qwen3-based instruction-tuned causal language model, fine-tuned from Qwen/Qwen3-4B-Instruct-2507. This model is specifically optimized for multi-turn agent task performance, excelling in environment observation, action selection, tool use, and error recovery within complex scenarios like ALFWorld household tasks. It features a 32K context length and is provided as a fully merged model, eliminating the need to load a separate base model.
Loading preview...