Fugu-MT 論文翻訳(概要): STAR: Failure-Aware Markovian Routing for Multi-Agent Spatiotemporal Reasoning

論文の概要: STAR: Failure-Aware Markovian Routing for Multi-Agent Spatiotemporal Reasoning

arxiv url: http://arxiv.org/abs/2605.10057v2
Date: Tue, 12 May 2026 04:38:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 18:21:07.04837
Title: STAR: Failure-Aware Markovian Routing for Multi-Agent Spatiotemporal Reasoning
Title（参考訳）: STAR:マルチエージェント時空間推論のためのフェールアウェアマルコフ型ルーティング
Authors: Ruiyi Yang, Lihuan Li, Hao Xue, Flora D. Salim,
Abstract要約: 本稿では,現在のエージェント,タスクタイプ,型付き実行状態に対する状態条件付きポリシとして,エージェント間制御を外部化するフェールアウェアルーティングフレームワークSTARを提案する。障害状態のトレースは異なるため、ルータは不正な出力、依存関係の欠如、ツール-グラウンド抽出-- 計算-デポジットプロトコルに異なる応答をし、中間結果を共有ブラックボードに書き込み、下流融合を行う。その結果、トレーニング中にトレースが失敗すると、ルーティングポリシのエラー状態のサポートが拡大し、成功のみのトレーニングでは表現できないリカバリ移行が可能になることが証明された。
参考スコア（独自算出の注目度）: 13.329339723676147
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Compositional spatiotemporal reasoning often requires a system to invoke multiple heterogeneous specialists, such as geometric, temporal, topological, and trajectory agents. A central question is how such a system should route among specialists when execution does not simply succeed or fail, but fails in qualitatively different ways. Existing tool-augmented and multi-agent LLM systems typically leave this routing decision implicit in language generation, making recovery ad hoc, difficult to interpret, and hard to optimize. This paper presents STAR (Spatio-Temporal Agent Router), a failure-aware routing framework that externalizes inter-agent control as a state-conditioned transition policy over the current agent, task type, and typed execution status. At the center of STARis an agent routing matrix that combines expert-specified nominal routes with recovery transitions learned from execution traces. Because the matrix conditions on distinct failure states, the router can respond differently to malformed outputs, missing dependencies, and tool--query mismatches, rather than collapsing them into a generic retry signal. Specialists execute through a tool-grounded extract--compute--deposit protocol and write intermediate results to a shared blackboard for downstream fusion. Results prove that retaining unsuccessful traces during training enlarges the support of the routing policy on error states, enabling recovery transitions that success-only training cannot represent. Across three spatiotemporal benchmarks and eight backbone LLMs, STAR improves over multiple baselines with the clearest gains on queries whose execution deviates from the nominal routing path. Router-specific ablations and recovery analyses further show that typed failure-aware routing, rather than specialist composition alone, is a key factor for these improvements.
Abstract（参考訳）: 構成時空間推論は、幾何学的、時間的、トポロジカル、軌跡的エージェントなど、複数の異種の専門家を呼び出すシステムを必要とすることが多い。中心的な問題は、そのようなシステムが単に成功したり失敗したりするのではなく、質的に異なる方法で失敗した場合、専門家の間でどのようにルートするべきかである。既存のツール拡張およびマルチエージェントLLMシステムは、言語生成においてこのルーティング決定を暗黙に残し、リカバリをアドホックにし、解釈が難しく、最適化が難しい。本稿では,現在のエージェント,タスクタイプ,型付き実行状態に対する状態条件遷移ポリシとして,エージェント間制御を外部化する障害対応ルーティングフレームワークSTAR(Spatio-Temporal Agent Router)を提案する。 STARisの中央には、専門家が特定した名目ルートと、実行トレースから学んだリカバリトランジションを組み合わせたエージェントルーティング行列がある。異なる障害状態の行列条件のため、ルータは一般的な再試行信号に分解するのではなく、不正な出力、依存関係の欠如、ツールクエリミスマッチに対して異なる応答をすることができる。専門家はツールグラウンドの抽出--compute--depositプロトコルを通じて実行し、中間結果を共有ブラックボードに書き込み、下流の核融合を行う。その結果、トレーニング中にトレースが失敗すると、ルーティングポリシのエラー状態のサポートが拡大し、成功のみのトレーニングでは表現できないリカバリ移行が可能になることが証明された。 3つの時空間ベンチマークと8つのバックボーンLDMで、STARは、名前付きルーティングパスから逸脱したクエリに対して、最も明確なゲインで、複数のベースラインを改善する。ルータ固有のアブリゲーションとリカバリ分析により、これらの改善の鍵となる要因は、専門的な構成ではなく、型付き障害対応ルーティングであることが示された。

論文の概要: STAR: Failure-Aware Markovian Routing for Multi-Agent Spatiotemporal Reasoning

関連論文リスト