Fugu-MT 論文翻訳(概要): DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation

論文の概要: DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation

arxiv url: http://arxiv.org/abs/2603.07416v1
Date: Sun, 08 Mar 2026 02:03:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:14.497683
Title: DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation
Title（参考訳）: DualSpec:Dual-Process Action Speculationによるディープリサーチエージェントの高速化
Authors: Shuzhang Zhong, Baotong Lu, Qi Chen, Chuanjie Liu, Fan Yang, Meng Li,
Abstract要約: 推論フレームワークは、推論でアクションの実行を重複させることでレイテンシを低減することを目的としている。既存のアプローチは通常、一様投機戦略と厳密なアクションマッチングに依存している。軽量な信頼性に基づくセマンティック検証機能を備えた異種投機フレームワークであるDualSpecを提案する。
参考スコア（独自算出の注目度）: 10.076688905947103
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language model-based deep research agents have been increasingly popular for addressing long-horizon information-seeking tasks, but they often incur high end-to-end latency due to extensive reasoning and frequent tool use. Speculation frameworks aim to reduce latency by overlapping action execution with reasoning; however, existing approaches typically rely on uniform speculation strategies and strict action matching, which limits inference speedups and robustness. In this work, we revisit the speculate-verify paradigm for deep research agents through the lens of action heterogeneity. We show that \textit{Search} and \textit{Visit} actions exhibit fundamentally different reasoning and model capacity requirements: entropy-based analysis reveals that Search decisions have higher uncertainty and benefit significantly from explicit reasoning, whereas Visit decisions have lower entropy and depend primarily on model capacity. Motivated by this dual-process characteristic, we propose DualSpec, a heterogeneous speculation framework equipped with a lightweight, confidence-based semantic verifier. Experiments across multiple models and benchmarks demonstrate that DualSpec achieves up to 3.28$\times$ end-to-end speedup while maintaining accuracy comparable to fully reasoning agents.
Abstract（参考訳）: 大規模言語モデルに基づくディープリサーチエージェントは、長い水平情報探索タスクに対処するために人気が高まっているが、広範囲な推論と頻繁なツール使用のために、しばしばエンドツーエンドの遅延が発生する。推論フレームワークは、推論と重なり合うアクション実行によってレイテンシを低減することを目的としているが、既存のアプローチでは、推論のスピードアップとロバスト性を制限する、一様の投機戦略と厳密なアクションマッチングに依存するのが一般的である。本研究では, 深部研究エージェントに対する投機的検証パラダイムを, アクションの不均一性のレンズを通して再検討する。エントロピーに基づく分析では、検索決定は明確な推論から高い不確実性を持ち、利点は大きいが、訪問決定はエントロピーが低く、主にモデルキャパシティに依存している。このデュアルプロセス特性に触発されたDualSpecは、軽量な信頼性に基づくセマンティック検証機能を備えた異種投機フレームワークである。複数のモデルやベンチマークでの実験では、DualSpecは最大3.28$\times$ end-to-endのスピードアップを実現し、完全推論エージェントに匹敵する精度を維持している。

論文の概要: DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation

関連論文リスト