Fugu-MT 論文翻訳(概要): DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

論文の概要: DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

arxiv url: http://arxiv.org/abs/2606.07299v1
Date: Fri, 05 Jun 2026 14:10:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.771317
Title: DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning
Title（参考訳）: DuMate-DeepResearch:Recursive SearchとRubric-Grounded Reasoningを用いた聴取可能なマルチエージェントシステム
Authors: Lingyong Yan, Can Xu, Yukun Zhao, Wenxuan Li, Qingyang Chen, Jiulong Wu, Wenli Song, Xiangnan Li, Weixian Shi, Yiqun Chen, Xuchen Ma, Yuchen Li, Jiashu Zhao, Shuaiqiang Wang, Jianmin Wu, Dawei Yin,
Abstract要約: Deep Research(DR)は、複雑でオープンな研究課題に取り組むための新しいエージェントパラダイムとして登場した。本稿では,Qianfan Agent Foundry上に構築されたマルチエージェントDRフレームワークであるDuMate-DeepResearchを紹介する。
参考スコア（独自算出の注目度）: 58.09293442211248
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Deep Research (DR) has emerged as a new agentic paradigm to tackle complex, open-ended research tasks, demanding systems that can iteratively frame problems, acquire evidence, verify sources, and synthesize long-form reports. In practice, however, current DR systems are constrained by four interrelated limitations: long-horizon planning over an underspecified scope, the bottleneck of decomposing and scheduling such tasks within a single agent, hallucination risk in long-form synthesis, and limited process auditability. This technical report presents DuMate-DeepResearch, a multi-agent DR framework built on the Qianfan Agent Foundry. The framework decouples the Agent Core, which handles task understanding, planning, and scheduling, from an extensible Tool Ecosystem for retrieval, evidence acquisition, and report rendering, making every intermediate decision and tool invocation explicitly traceable. Building on this infrastructure, DuMate-DeepResearch further introduces three mechanisms: (i) a graph-based dynamic planning strategy expands the research roadmap coarse-to-fine and continuously revises it through reflection, re-planning, backtracking, and parallel branching; (ii) a recursive two-level execution design delegates each complex search sub-task to an inner Search Agent that runs its own planning loop, isolating noisy retrieval and stabilizing long-horizon execution; (iii) a rubric-based test-time optimization mechanism dynamically generates task-specific quality criteria and uses them as live reasoning scaffolds for evidence-grounded synthesis and adaptive stopping. Across two deep research benchmarks, DuMate-DeepResearch establishes new state-of-the-art results: the best overall score (58.03%) on DeepResearch Bench, and the best overall score (61.95%) on DeepResearch Bench II while ranking first in information recall and analysis.
Abstract（参考訳）: Deep Research(DR)は、複雑でオープンな研究課題に取り組むための新しいエージェントパラダイムとして登場し、問題を反復的にフレーム化し、証拠を取得し、情報源を確認し、ロングフォームなレポートを合成するシステムを要求している。しかし、実際には、現在のDRシステムは、不特定範囲における長期計画、単一エージェント内でのタスクの分解とスケジューリングのボトルネック、長期合成における幻覚リスク、プロセス監査可能性の4つの制限によって制限されている。この技術レポートでは、Qianfan Agent Foundry上に構築されたマルチエージェントDRフレームワークであるDuMate-DeepResearchを紹介します。このフレームワークは、タスク理解、計画、スケジューリングを処理するエージェントコアを、検索、エビデンス取得、レポートレンダリングのための拡張可能なツールエコシステムから切り離し、中間的な決定とツールの実行を明示的にトレース可能にする。このインフラストラクチャ上に構築されたDuMate-DeepResearchはさらに3つのメカニズムを紹介している。 i) グラフベースの動的計画戦略は、研究ロードマップを粗大に拡大し、リフレクション、再計画、バックトラック、並列ブランチを通じて継続的に更新する。 2) 再帰的二段階実行設計は、各複合検索サブタスクを、独自の計画ループを実行し、ノイズ検索を分離し、長期実行を安定化する内部検索エージェントに委譲する。三ルーブリックベースのテストタイム最適化機構は、タスク固有の品質基準を動的に生成し、エビデンス基底合成と適応停止のためのライブ推論足場として利用する。 2つのディープリサーチベンチマークで、DuMate-DeepResearchは、DeepResearch Benchの最高スコア(58.03%)とDeepResearch Bench IIの最高スコア(61.95%)という、新しい最先端の結果を確立している。

論文の概要: DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

関連論文リスト