Name: codefuse-ai/OpAgent-32B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: codefuse-ai

OpAgent-32B: A Vision-Language Model for Autonomous Web Navigation

OpAgent-32B, developed by codefuse-ai, is a 33.4 billion parameter Vision-Language Model (VLM) specifically engineered for autonomous web navigation and task execution. It is the core single-model engine within the broader OpAgent project.

Key Capabilities

Autonomous Web Navigation: Designed to interpret and interact with web pages to complete user-defined tasks.
Vision-Language Integration: Processes both natural language task descriptions and webpage screenshots as input.
Action Generation: Outputs structured JSON-formatted actions (e.g., click, type, scroll) or final answers, enabling direct interaction with web elements.
Advanced Fine-tuning: Utilizes a Hierarchical Multi-Task SFT strategy followed by Online Agentic Reinforcement Learning with a Hybrid Reward mechanism, built on the Qwen3-VL-32B-Thinking base model.

Recommended Use Cases

OpAgent-32B is primarily intended for use as a web agent. It is optimized for deployment with high-performance inference engines like vLLM, as detailed in its single-model usage guide. Developers can integrate this model to automate complex web-based workflows, perform data extraction, or create intelligent web assistants.

Overview

OpAgent-32B: A Vision-Language Model for Autonomous Web Navigation

Key Capabilities

Recommended Use Cases

Full Model Card (README)