Fugu-MT 論文翻訳(概要): Information Seeking for Robust Decision Making under Partial Observability

論文の概要: Information Seeking for Robust Decision Making under Partial Observability

arxiv url: http://arxiv.org/abs/2510.01531v1
Date: Thu, 02 Oct 2025 00:06:32 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.911833
Title: Information Seeking for Robust Decision Making under Partial Observability
Title（参考訳）: 部分観測可能性を考慮したロバスト意思決定のための情報探索
Authors: Djengo Cyun-Jyun Fang, Tsung-Wei Ke,
Abstract要約: InfoSeekerは、タスク指向の計画と内部のダイナミクスを整合させ、不確実性の下で最適な決定を行う情報を統合する計画フレームワークである。 InfoSeekerは、サンプル効率を犠牲にすることなく、以前のメソッドよりも74%の絶対的なパフォーマンス向上を実現している。これらの知見は、部分的に観測可能な環境での堅牢な行動を求める計画と情報の統合の重要性を浮き彫りにした。
参考スコア（独自算出の注目度）: 4.722684644310843
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Explicit information seeking is essential to human problem-solving in practical environments characterized by incomplete information and noisy dynamics. When the true environmental state is not directly observable, humans seek information to update their internal dynamics and inform future decision-making. Although existing Large Language Model (LLM) planning agents have addressed observational uncertainty, they often overlook discrepancies between their internal dynamics and the actual environment. We introduce Information Seeking Decision Planner (InfoSeeker), an LLM decision-making framework that integrates task-oriented planning with information seeking to align internal dynamics and make optimal decisions under uncertainty in both agent observations and environmental dynamics. InfoSeeker prompts an LLM to actively gather information by planning actions to validate its understanding, detect environmental changes, or test hypotheses before generating or revising task-oriented plans. To evaluate InfoSeeker, we introduce a novel benchmark suite featuring partially observable environments with incomplete observations and uncertain dynamics. Experiments demonstrate that InfoSeeker achieves a 74% absolute performance gain over prior methods without sacrificing sample efficiency. Moreover, InfoSeeker generalizes across LLMs and outperforms baselines on established benchmarks such as robotic manipulation and web navigation. These findings underscore the importance of tightly integrating planning and information seeking for robust behavior in partially observable environments. The project page is available at https://infoseekerllm.github.io
Abstract（参考訳）: 不完全な情報と雑音のダイナミクスを特徴とする実用環境では、人間の問題解決には明示的な情報探索が不可欠である。真の環境状態が直接観察できない場合、人間は内部のダイナミクスを更新し、将来の意思決定を知らせるために情報を求めます。既存のLarge Language Model (LLM) 計画エージェントは観測の不確実性に対処してきたが、しばしば内部力学と実際の環境との相違点を見落としている。本稿では,情報探索決定計画(Information Seeking Decision Planner,InfoSeeker)について紹介する。内部力学の整合性を求める情報とタスク指向計画を統合し,エージェント観測と環境力学の両面において不確実性の下で最適な意思決定を行う。 InfoSeeker は LLM に対して,その理解の検証,環境変化の検出,あるいは仮説の検証を行うための計画行動による情報収集を積極的に行うように促す。 InfoSeekerを評価するために、不完全な観測と不確実なダイナミクスを備えた部分的に観測可能な環境を備えた新しいベンチマークスイートを提案する。実験では、InfoSeekerは、サンプル効率を犠牲にすることなく、以前のメソッドよりも74%の絶対的なパフォーマンス向上を実現している。さらに、InfoSeekerはLLMをまたいで一般化し、ロボット操作やWebナビゲーションといった確立したベンチマークでベースラインを上回ります。これらの知見は、部分的に観測可能な環境での堅牢な行動を求める計画と情報の統合の重要性を浮き彫りにした。プロジェクトページはhttps://infoseekerllm.github.ioで公開されている。

論文の概要: Information Seeking for Robust Decision Making under Partial Observability

関連論文リスト