Name: yoei/qwen3-4b-agentbench-merged02 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: yoei

Overview

yoei/qwen3-4b-agentbench-merged02 is a LoRA adapter fine-tuned from the Qwen/Qwen3-4B-Instruct-2507 base model, utilizing LoRA + Unsloth for efficient training. This repository provides only the adapter weights, requiring the base model to be loaded separately.

Key Capabilities

Enhanced Multi-Turn Agent Performance: Specifically trained to improve performance in complex, multi-turn agent tasks.
Task Domains: Optimized for household tasks (ALFWorld) and database operations (DBBench).
Trajectory Learning: Learns from full multi-turn trajectories, including environment observation, action selection, tool use, and error recovery, by applying loss to all assistant turns.

Good For

Developing intelligent agents that require sequential decision-making.
Applications involving tool use and interaction with dynamic environments.
Research and development in agentic AI and reinforcement learning from human feedback (RLHF) for complex tasks.
Users looking to leverage a 4B parameter model for efficient agentic reasoning.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)