deepkick/qwen3-4b-advanced-sft-v13-merged
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

deepkick/qwen3-4b-advanced-sft-v13-merged is a 4 billion parameter language model, fine-tuned from Qwen/Qwen3-4B-Instruct-2507. This merged model is specifically optimized for advanced agentic tasks, leveraging a LoRA SFT method on the u-10bei/sft_alfworld_trajectory_dataset_v5. It is intended for use in AgentBench Advanced evaluations, offering enhanced performance for complex trajectory-based scenarios.

Loading preview...