Name: moushi21/agent-bench-alfworld-merged3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: moushi21

Overview

The moushi21/agent-bench-alfworld-merged3 is a 4 billion parameter model derived from Qwen/Qwen3-4B-Instruct-2507. Unlike LoRA adapters, this model integrates the fine-tuned weights directly into the base model using Unsloth's merge_and_unload method, resulting in a standalone, full-parameter model (bfloat16) optimized for efficient inference.

Key Capabilities

ALFWorld Specialization: Specifically fine-tuned for ALFWorld trajectory tasks, enabling it to process multi-turn environmental observations and select appropriate actions.
High-Speed Inference: Merged full weights ensure faster inference compared to models requiring separate LoRA loading.
Direct Deployment: Can be loaded and used like any standard Qwen3 model, simplifying integration into existing workflows.
Context Length: Trained with a maximum sequence length of 4096 tokens, suitable for complex multi-turn interactions.

Good For

Agentic AI Development: Ideal for researchers and developers working on AI agents that need to navigate and interact within simulated environments like ALFWorld.
Environmental Interaction Tasks: Excels in scenarios requiring sequential decision-making based on dynamic observations.
Efficient Deployment: Suitable for applications where fast and straightforward model deployment is critical, without the overhead of managing separate adapter weights.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)