IcyFish/Qwen3-4B-EnvTuning-Base
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 7, 2026License:apache-2.0Architecture:Transformer Open Weights Loading

IcyFish/Qwen3-4B-EnvTuning-Base is a 4.0 billion parameter causal language model, continued-trained by IcyFish based on Qwen/Qwen3-4B-Instruct-2507. This model implements the "Environment Tuning" paradigm, focusing on environment-based exploration rather than static trajectory imitation for agent learning. It is specifically designed to improve agent capability in multi-turn tool-use settings under extreme data scarcity, utilizing structured curricula, actionable environment augmentation, and fine-grained progress rewards. The model is optimized for robust agent training with limited data, offering better out-of-distribution generalization compared to pure supervised fine-tuning baselines.

Loading preview...