XXHStudyHard/EnvScaler-Qwen3-4B
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jan 8, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

EnvScaler-Qwen3-4B is a 4 billion parameter tool-enhanced language model developed by XXHStudyHard, based on the Qwen3 architecture. It is specifically trained using the EnvScaler framework for tool-interactive agent tasks, leveraging both Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) on agent-environment interaction trajectories. This model excels at complex tasks requiring tool use and interaction within synthesized environments, offering a 40960 token context length.

Loading preview...