Name: choco800/qwen3-4b-agent-v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: choco800

choco800/qwen3-4b-agent-v1: Specialized for Multi-Turn Agent Tasks

This model is a fully merged, 4 billion parameter Qwen3-based instruction-tuned model, fine-tuned by choco800 using Unsloth. Unlike standard adapter repositories, it provides merged weights, eliminating the need to load a separate base model. Its core objective is to significantly enhance multi-turn agent task performance.

Key Capabilities

Agentic Trajectory Learning: Trained specifically on agent trajectories from ALFWorld (household tasks) and DBBench (database operations).
Comprehensive Agent Skills: Learns environment observation, action selection, tool use, and error recovery within multi-turn interactions.
Targeted Loss Application: Loss is applied to all assistant turns, ensuring robust learning across the entire agentic process.
Efficient Fine-tuning: Utilizes LoRA with Unsloth for efficient training, based on Qwen/Qwen3-4B-Instruct-2507.

Good For

Developing autonomous agents requiring multi-turn reasoning and interaction.
Applications involving tool use and error recovery in structured environments.
Tasks similar to household automation or database management where agents need to follow complex trajectories.

This model is ideal for developers focused on building intelligent agents that can navigate and complete multi-step tasks effectively.

Overview

choco800/qwen3-4b-agent-v1: Specialized for Multi-Turn Agent Tasks

Key Capabilities

Good For

Full Model Card (README)