Fugu-MT 論文翻訳(概要): OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

論文の概要: OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

arxiv url: http://arxiv.org/abs/2605.04036v1
Date: Tue, 05 May 2026 17:55:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-06 19:35:44.073785
Title: OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
Title（参考訳）: OpenSeeker-v2: Informative and High-Diffulty Trajectoryによる検索エージェントの限界を押し上げる
Authors: Yuwen Du, Rui Ye, Shuo Tang, Keduan Huang, Xinyu Zhu, Yuzhu Cai, Siheng Chen,
Abstract要約: 簡単な教師付き微調整アプローチが、フロンティア検索エージェントの訓練に驚くほど強力であることを示す。 OpenSeeker-v2は、4つのベンチマーク(30BサイズのReActパラダイムを持つエージェント)で最先端のパフォーマンスを実現しています。BrowseCompで46.4%、BrowseComp-ZHで58.1%、HumanityのLast Examで34.6%、xbenchで78.0%です。 OpenSeeker-v2モデルの重み付けをオープンソースとして公開し、フロンティア検索エージェントの研究をよりコミュニティに利用できるようにするための、シンプルで効果的な結果を共有することを楽しみにしています。
参考スコア（独自算出の注目度）: 43.841018840819494
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep search capabilities have become an indispensable competency for frontier Large Language Model (LLM) agents, yet their development remains dominated by industrial giants. The typical industry recipe involves a highly resource-intensive pipeline spanning pre-training, continual pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL). In this report, we show that when fueled with informative and high-difficulty trajectories, a simple SFT approach could be surprisingly powerful for training frontier search agents. By introducing three simple data synthesis modifications: scaling knowledge graph size for richer exploration, expanding the tool set size for broader functionality, and strict low-step filtering, we establish a stronger baseline. Trained on merely 10.6k data points, our OpenSeeker-v2 achieves state-of-the-art performance across 4 benchmarks (30B-sized agents with ReAct paradigm): 46.0% on BrowseComp, 58.1% on BrowseComp-ZH, 34.6% on Humanity's Last Exam, and 78.0% on xbench, surpassing even Tongyi DeepResearch trained with heavy CPT+SFT+RL pipeline, which achieves 43.4%, 46.7%, 32.9%, and 75.0%, respectively. Notably, OpenSeeker-v2 represents the first state-of-the-art search agent within its model scale and paradigm to be developed by a purely academic team using only SFT. We are excited to open-source the OpenSeeker-v2 model weights and share our simple yet effective findings to make frontier search agent research more accessible to the community.
Abstract（参考訳）: 深層探索能力は、フロンティア大言語モデル(LLM)エージェントにとって欠かせない能力となっているが、その開発はいまだ産業巨人に支配されている。典型的な産業レシピは、事前訓練、継続事前訓練(CPT)、教師付き微調整(SFT)、強化学習(RL)にまたがる非常にリソース集約的なパイプラインである。本報告では,情報および高拡散性軌跡を併用することで,フロンティアサーチエージェントの訓練において,単純なSFTアプローチが驚くほど強力である可能性が示唆された。よりリッチな探索のために知識グラフのサイズを拡大し、より広い機能のためにツールセットのサイズを拡大し、厳密な低ステップフィルタリングという3つの単純なデータ合成修正を導入することで、より強力なベースラインを確立します。たった10.6kのデータポイントでトレーニングされたOpenSeeker-v2は、4つのベンチマーク(30BサイズのReActパラダイムを持つエージェント)で最先端のパフォーマンスを達成した。BrowseCompでは46.0%、BrowseComp-ZHでは58.1%、HumanityのLast Examでは34.0%、xbenchでは78.0%、それぞれ43.4%、46.7%、32.9%、75.0%である。特にOpenSeeker-v2は、SFTのみを使用した純粋に学術的なチームによって開発された、最初の最先端の検索エージェントである。 OpenSeeker-v2モデルの重み付けをオープンソースとして公開し、フロンティア検索エージェントの研究をよりコミュニティに利用できるようにするための、シンプルで効果的な結果を共有することを楽しみにしています。

論文の概要: OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

関連論文リスト