Fugu-MT 論文翻訳(概要): WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

論文の概要: WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

arxiv url: http://arxiv.org/abs/2509.13309v2
Date: Sat, 20 Sep 2025 14:27:52 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-23 12:27:33.48367
Title: WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Title（参考訳）: WebResearcher:Long-Horizon Agentsにおける非有界推論能力の開放
Authors: Zile Qiao, Guoxin Chen, Xuanzhong Chen, Donglei Yu, Wenbiao Yin, Xinyu Wang, Zhen Zhang, Baixuan Li, Huifeng Yin, Kuan Li, Rui Min, Minpeng Liao, Yong Jiang, Pengjun Xie, Fei Huang, Jingren Zhou,
Abstract要約: WebResearcherは、マルコフ決定プロセスとしてディープリサーチを再構築する反復的なディープリサーチパラダイムである。 WebResearcherは最先端のパフォーマンスを実現し、フロンティアのプロプライエタリシステムを超えています。
参考スコア（独自算出の注目度）: 72.28593628378991
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Recent advances in deep-research systems have demonstrated the potential for AI agents to autonomously discover and synthesize knowledge from external sources. In this paper, we introduce WebResearcher, a novel framework for building such agents through two key components: (1) WebResearcher, an iterative deep-research paradigm that reformulates deep research as a Markov Decision Process, where agents periodically consolidate findings into evolving reports while maintaining focused workspaces, overcoming the context suffocation and noise contamination that plague existing mono-contextual approaches; and (2) WebFrontier, a scalable data synthesis engine that generates high-quality training data through tool-augmented complexity escalation, enabling systematic creation of research tasks that bridge the gap between passive knowledge recall and active knowledge construction. Notably, we find that the training data from our paradigm significantly enhances tool-use capabilities even for traditional mono-contextual methods. Furthermore, our paradigm naturally scales through parallel thinking, enabling concurrent multi-agent exploration for more comprehensive conclusions. Extensive experiments across 6 challenging benchmarks demonstrate that WebResearcher achieves state-of-the-art performance, even surpassing frontier proprietary systems.
Abstract（参考訳）: 近年のディープラーニングシステムの進歩は、AIエージェントが外部からの知識を自律的に発見し、合成する可能性を実証している。本稿では,これらのエージェントを構築するための新しいフレームワークであるWebResearcherを紹介する。(1)深い研究をマルコフ決定プロセスとして再編成する反復的ディープリサーチのパラダイムであるWebResearcherを導入し,焦点を絞ったワークスペースを維持しながら発見を進化レポートに定期的に統合し,既存のモノコンテクスチュアルアプローチを悩ませるコンテキスト妨害やノイズ汚染を克服し,(2)ツール強化された複雑性エスカレーションを通じて高品質なトレーニングデータを生成するスケーラブルなデータ合成エンジンであるWebFrontierを,知能知識のリコールとアクティブな知識構築のギャップを埋める研究タスクの体系的作成を可能にする。特に,従来のモノコンテクスト手法であっても,我々のパラダイムからのトレーニングデータがツール利用能力を大幅に向上させることがわかった。さらに,本パラダイムは並列思考を通じて自然にスケールし,より包括的な結論を得るための同時マルチエージェント探索を可能にする。 6つの挑戦的なベンチマークにわたる大規模な実験は、WebResearcherが最先端のパフォーマンスを達成し、フロンティアプロプライエタリなシステムを超えていることを示している。

論文の概要: WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

関連論文リスト