Alibaba-NLP/WebSailor-32B

Hugging Face
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Aug 26, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

Alibaba-NLP/WebSailor-32B is a 32.8 billion parameter language model developed by Alibaba-NLP, specifically designed for sophisticated web navigation and information-seeking tasks. It utilizes a post-training methodology to teach LLM agents complex reasoning for web environments, addressing high uncertainty in vast information landscapes. The model excels at difficult benchmarks like BrowseComp-en and BrowseComp-zh, establishing a new state-of-the-art for open-source agents in web exploration.

Loading preview...

Overview

Alibaba-NLP/WebSailor-32B is a 32.8 billion parameter model developed by Alibaba-NLP, focusing on advanced web navigation and information-seeking. It introduces a comprehensive post-training methodology called WebSailor to equip LLM agents with sophisticated reasoning capabilities for complex web tasks, particularly those involving high uncertainty and non-linear solution paths. This approach aims to close the performance gap between open-source models and proprietary systems in web agent applications.

Key Capabilities

  • Sophisticated Web Reasoning: Designed to handle extreme uncertainty in vast information landscapes, enabling complex web navigation and information retrieval.
  • Novel Task Generation: Utilizes SailorFog-QA, a data synthesis pipeline, to create challenging Level 3 information-seeking tasks with high initial uncertainty and complex knowledge graphs.
  • Efficient Training Methodology: Employs a multi-stage training process including rejection sampling fine-tuning (RFT) for a "cold start" and Duplicating Sampling Policy Optimization (DUPO) for refining exploratory strategies.
  • State-of-the-Art Performance: Achieves outstanding results on difficult benchmarks such as BrowseComp-en and BrowseComp-zh, outperforming larger models and matching proprietary systems like Doubao-Search.

Good For

  • Developing LLM agents for complex web navigation.
  • Applications requiring advanced information-seeking in uncertain web environments.
  • Research and development of open-source web agents that compete with proprietary solutions.