Fugu-MT 論文翻訳(概要): OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

論文の概要: OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

arxiv url: http://arxiv.org/abs/2603.20278v1
Date: Tue, 17 Mar 2026 20:10:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:38.793776
Title: OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Title（参考訳）: OpenResearcher: 長距離深部研究軌道合成のための完全なオープンパイプライン
Authors: Zhuofeng Li, Dongfu Jiang, Xueguang Ma, Haoxiang Zhang, Ping Nie, Yuyu Zhang, Kai Zou, Jianwen Xie, Yu Zhang, Wenhu Chen,
Abstract要約: 深層研究エージェントの訓練には、探索、証拠収集、多段階推論をインターリーブする長い水平軌道が必要である。既存のデータ収集パイプラインは、一般的にプロプライエタリなWeb APIに依存しており、大規模な軌道合成をコストが高く、不安定で、再現が難しい。再生可能なパイプラインであるOpenResearcherは,複数ターン軌道合成から1回のコーパスブートストラップを分離する。
参考スコア（独自算出の注目度）: 63.8655724265611
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training deep research agents requires long-horizon trajectories that interleave search, evidence aggregation, and multi-step reasoning. However, existing data collection pipelines typically rely on proprietary web APIs, making large-scale trajectory synthesis costly, unstable, and difficult to reproduce. We present OpenResearcher, a reproducible pipeline that decouples one-time corpus bootstrapping from multi-turn trajectory synthesis and executes the search-and-browse loop entirely offline using three explicit browser primitives: search, open, and find, over a 15M-document corpus. Using GPT-OSS-120B as the teacher model, we synthesize over 97K trajectories, including a substantial long-horizon tail with 100+ tool calls. Supervised fine-tuning a 30B-A3B backbone on these trajectories achieves 54.8\% accuracy on BrowseComp-Plus, a +34.0 point improvement over the base model, while remaining competitive on BrowseComp, GAIA, and xbench-DeepSearch. Because the environment is offline and fully instrumented, it also enables controlled analysis, where our study reveals practical insights into deep research pipeline design, including data filtering strategies, agent configuration choices, and how retrieval success relates to final answer accuracy. We release the pipeline, synthesized trajectories, model checkpoints, and the offline search environment at https://github.com/TIGER-AI-Lab/OpenResearcher.
Abstract（参考訳）: 深層研究エージェントの訓練には、探索、証拠収集、多段階推論をインターリーブする長い水平軌道が必要である。しかし、既存のデータ収集パイプラインは一般的にプロプライエタリなWeb APIに依存しており、大規模な軌道合成をコストが高く、不安定で、再現が難しい。複数行の軌道合成から1回のブートストラップを分離し,3つの明示的なブラウザプリミティブ(検索,オープン,検索)を用いて,検索とブラウズループを完全にオフラインで実行する,再現可能なパイプラインであるOpenResearcherを提案する。 GPT-OSS-120Bを教師モデルとして使用し,100以上のツールコールを備えた長方形テールを含む97K以上の軌道を合成する。これらの軌道上で30B-A3Bのバックボーンを微調整することで、BrowseComp-Plusでは54.8\%の精度が達成され、ベースモデルでは+34.0ポイント改善され、BrowseComp、GAIA、xbench-DeepSearchでは競争力を維持した。そこで本研究では,データフィルタリング戦略,エージェント設定の選択,検索成功が最終回答の正確性にどのように関係するかなど,深層研究パイプライン設計に関する実践的な知見を明らかにする。パイプライン、合成トラジェクトリ、モデルチェックポイント、オフライン検索環境をhttps://github.com/TIGER-AI-Lab/OpenResearcherでリリースします。

論文の概要: OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

関連論文リスト