Fugu-MT 論文翻訳(概要): Deep Researcher with Sequential Plan Reflection and Candidates Crossover (Deep Researcher Reflect Evolve)

論文の概要: Deep Researcher with Sequential Plan Reflection and Candidates Crossover (Deep Researcher Reflect Evolve)

arxiv url: http://arxiv.org/abs/2601.20843v1
Date: Wed, 28 Jan 2026 18:45:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-29 15:46:07.098875
Title: Deep Researcher with Sequential Plan Reflection and Candidates Crossover (Deep Researcher Reflect Evolve)
Title（参考訳）: 逐次計画反射と候補交叉による深部研究員(深部研究員反射運動)
Authors: Saurav Prateek,
Abstract要約: 本稿では、複雑なPhDレベルのトピックに関する詳細な研究レポートを作成するために設計された、新しいDeep Researcherアーキテクチャを提案する。本システムでは,リフレクションによる逐次研究計画修正と候補クロスオーバーアルゴリズムという,2つの重要なイノベーションを活用している。我々の建築は46.21点を達成し、先進的な深層研究エージェントを超越して優れた性能を示した。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper introduces a novel Deep Researcher architecture designed to generate detailed research reports on complex PhD level topics by addressing the inherent limitations of the Parallel Scaling paradigm. Our system utilizes two key innovations: Sequential Research Plan Refinement via Reflection and a Candidates Crossover algorithm. The sequential refinement process is demonstrated as an efficient method that allows the agent to maintain a centralized Global Research Context, enabling it to look back at current progress, reason about the research plan, and intelligently make changes at runtime. This dynamic adaptation contrasts with parallel approaches, which often suffer from siloed knowledge. The Candidates Crossover algorithm further enhances search efficiency by deploying multiple LLM candidates with varied parameters to explore a larger search space, with their findings synthesized to curate a comprehensive final research response. The process concludes with One Shot Report Generation, ensuring the final document is informed by a unified narrative and high fact density. Powered by the Gemini 2.5 Pro model, our Deep Researcher was evaluated on the DeepResearch Bench, a globally recognized benchmark of 100 doctoral level research tasks. Our architecture achieved an overall score of 46.21, demonstrating superior performance by surpassing leading deep research agents such as Claude Researcher, Nvidia AIQ Research Assistant, Perplexity Research, Kimi Researcher and Grok Deeper Search present on the DeepResearch Bench actively running leaderboard. This performance marginally exceeds our previous work, Static DRA, and reinforces the finding that sequential scaling consistently outperforms the parallel self consistency paradigm.
Abstract（参考訳）: 本稿では,Parallel Scalingパラダイムの本質的な制約に対処して,複雑なPhDレベルのトピックに関する詳細な研究レポートを生成するために設計された,新しいDeep Researcherアーキテクチャを提案する。本システムでは,リフレクションによる逐次研究計画修正と候補クロスオーバーアルゴリズムという,2つの重要なイノベーションを活用している。シーケンシャルリファインメントプロセスは、エージェントが集中したグローバルリサーチコンテキストを維持でき、現在の進捗を振り返り、研究計画について推論し、実行時に知的に変更を加えることができる効率的な方法として実証される。この動的適応は、しばしばサイロ知識に悩まされる平行アプローチとは対照的である。 The Candidates Crossover algorithm has further further search efficiency by deploy multiple LLM candidate with various parameters to explore a larger search space, their findings synthesised to curate a comprehensive final research response。このプロセスは、最終文書が統一された物語と高い事実密度によって通知されることを保証するOne Shot Report Generationで締めくくられる。 Gemini 2.5 Proモデルによって、私たちのDeep Researcherは、世界中の100の博士レベルの研究タスクのベンチマークであるDeepResearch Benchで評価されました。私たちのアーキテクチャは46.21という総合的なスコアを獲得し、Claude Researcher、Nvidia AIQ Research Assistant、Perplexity Research、Kim Researcher、Grok Deeper Searchといった、DeepResearch Benchがアクティブに運営しているリーダーボードを上回り、優れたパフォーマンスを示している。このパフォーマンスは、これまでの作業であるStatic DRAをわずかに上回り、シーケンシャルなスケーリングが並列的な自己整合パラダイムを一貫して上回っていることの発見を強化します。

論文の概要: Deep Researcher with Sequential Plan Reflection and Candidates Crossover (Deep Researcher Reflect Evolve)

関連論文リスト