Fugu-MT 論文翻訳(概要): From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents

論文の概要: From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents

arxiv url: http://arxiv.org/abs/2602.04326v1
Date: Wed, 04 Feb 2026 08:43:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-05 19:45:11.438203
Title: From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents
Title（参考訳）: 想定から行動へ:LLM推論を不確実性を考慮したエージェントの計画に転換する
Authors: SeungWon Seo, SooBin Lim, SeongRae Noh, Haneul Kim, HyeongYeop Kang,
Abstract要約: マルチエージェントで活動し、部分的に観察可能で、分散化された環境では、広範囲にわたる不確実性にもかかわらず計画し行動しなければならない。我々は,大規模言語モデルで潜在する断片化仮定を構造化決定木に変換するPlanner-Composer-EvaluatorフレームワークであるPCEを紹介する。また, PCEは, トークン使用率とタスク効率において, コミュニケーション中心のベースラインを一貫して上回り, トークン使用率と同等であることを示す。
参考スコア（独自算出の注目度）: 5.817643726988822
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Embodied agents operating in multi-agent, partially observable, and decentralized environments must plan and act despite pervasive uncertainty about hidden objects and collaborators' intentions. Recent advances in applying Large Language Models (LLMs) to embodied agents have addressed many long-standing challenges, such as high-level goal decomposition and online adaptation. Yet, uncertainty is still primarily mitigated through frequent inter-agent communication. This incurs substantial token and time costs, and can disrupt established workflows, when human partners are involved. We introduce PCE, a Planner-Composer-Evaluator framework that converts the fragmented assumptions latent in LLM reasoning traces into a structured decision tree. Internal nodes encode environment assumptions and leaves map to actions; each path is then scored by scenario likelihood, goal-directed gain, and execution cost to guide rational action selection without heavy communication. Across two challenging multi-agent benchmarks (C-WAH and TDW-MAT) and three diverse LLM backbones, PCE consistently outperforms communication-centric baselines in success rate and task efficiency while showing comparable token usage. Ablation results indicate that the performance gains obtained by scaling model capacity or reasoning depth persist even when PCE is applied, while PCE consistently raises the baseline across both capacity and reasoning-depth scales, confirming that structured uncertainty handling complements both forms of scaling. A user study further demonstrates that PCE produces communication patterns that human partners perceive as more efficient and trustworthy. Together, these results establish a principled route for turning latent LLM assumptions into reliable strategies for uncertainty-aware planning.
Abstract（参考訳）: マルチエージェント、部分的に観察可能、分散化された環境で活動する身体エージェントは、隠された物体や協力者の意図に対する広範囲な不確実性にもかかわらず、計画し行動しなければならない。エンボディエージェントにLarge Language Models(LLM)を適用する最近の進歩は、高レベルの目標分解やオンライン適応など、長年にわたる課題に対処してきた。しかし、不確実性は、多くの場合、エージェント間通信によって緩和される。これは相当なトークンと時間的コストをもたらし、人間のパートナーが関与する場合、確立したワークフローを混乱させる可能性がある。我々は,LLM推論に潜む断片化された仮定を構造化決定木に変換するPlanner-Composer-EvaluatorフレームワークであるPCEを紹介する。内部ノードは環境の仮定を符号化してアクションにマップし、それぞれのパスはシナリオ可能性、目標指向のゲイン、そして実行コストによってスコアされ、重いコミュニケーションなしに合理的なアクション選択を導く。 2つの挑戦的なマルチエージェントベンチマーク(C-WAHとTDW-MAT)と3つの多様なLCMバックボーンにおいて、PCEは、同等のトークン使用率を示しながら、成功率とタスク効率においてコミュニケーション中心のベースラインを一貫して上回っている。その結果,PCEが適用してもモデルキャパシティのスケーリングや推論深度が持続する一方で,PCEはキャパシティと推論深度の両方のベースラインを一貫して引き上げ,構造的不確実性処理が両方のスケーリング形式を補完することを確認した。ユーザー研究により、PCEは人間のパートナーがより効率的で信頼できると考えるコミュニケーションパターンを生み出すことが示されている。これらの結果と合わせて、潜伏LLM仮定を不確実性を考慮した計画のための信頼性の高い戦略に変換するための原則的経路を確立した。

論文の概要: From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents

関連論文リスト