Fugu-MT 論文翻訳(概要): Optimizing Trajectory-Trees in Belief Space: An Application from Model Predictive Control to Task and Motion Planning

論文の概要: Optimizing Trajectory-Trees in Belief Space: An Application from Model Predictive Control to Task and Motion Planning

arxiv url: http://arxiv.org/abs/2605.01860v1
Date: Sun, 03 May 2026 13:06:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.969978
Title: Optimizing Trajectory-Trees in Belief Space: An Application from Model Predictive Control to Task and Motion Planning
Title（参考訳）: 空間における軌道軌道の最適化:モデル予測制御からタスク・運動計画への応用
Authors: Camille Phiquepal, Marc Toussaint,
Abstract要約: 本稿では,部分的に観測可能なロボット計画問題に対して,一般的な逐次的トラジェクトリではなく,アーボラキシートラジェクトリ(トラジェクトリ木)の利点を考察する。システムの1つの前方進化をモデル化するシーケンシャルなトラジェクトリとは異なり、トラジェクトリツリーは複数の可能なコンティニュエーションをキャプチャする。
参考スコア（独自算出の注目度）: 12.236092368066332
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper explores the benefits of computing arborescent trajectories (trajectory-trees) instead of commonly used sequential trajectories for partially observable robotic planning problems. In such environments, a robot infers knowledge from observations, and the optimal course of action depends on these observations. \revise{Trajectory-trees, optimized in belief space, naturally capture this dependency by branching where the belief state is expected to evolve into multiple distinct scenarios, such as upon receiving an observation. Unlike sequential trajectories, which model a single forward evolution of the system, trajectory-trees capture multiple possible contingencies.} First, we focus on Model Predictive Control (MPC) and demonstrate the benefits of planning tree-like trajectories. We formulate the control problem as the optimization of a tree with a single branching (PO-MPC). This improves performance by reducing control costs through more informed planning. To satisfy the real-time constraints of MPC, we develop an optimization algorithm called Distributed Augmented Lagrangian (D-AuLa), which leverages the decomposability of the PO-MPC formulation to parallelize and accelerate the optimization. We apply the method to both linear and non-linear MPC problems using autonomous driving examples. Second, we address Task And Motion Planning (TAMP), and introduce a planner (PO-LGP) reasoning on decision trees at task level, and trajectory-trees at motion-planning level. This approach builds upon the Logic-Geometric-Programming Framework (LGP) and extends it to partially observable problems. The experiments show the method's applicability to problems with a small belief state size, and scales to larger problems by optimizing explorative policies, which are used as macro-actions in an overarching task plan.
Abstract（参考訳）: 本稿では,部分的に観測可能なロボット計画問題に対して,一般的な逐次的トラジェクトリではなく,アーボラキシートラジェクトリ(トラジェクトリ木)の利点を考察する。このような環境では、ロボットは観察から知識を推測し、最適な行動経路はこれらの観察に依存する。信念空間に最適化された \revise{Trajectory-trees は、信念状態が観察を受けるなど、複数の異なるシナリオに進化することが期待される分岐によって、この依存を自然に捉えている。システムの1つの前方進化をモデル化するシーケンシャルなトラジェクトリとは異なり、トラジェクトリツリーは複数の可能なコンティニュエーションをキャプチャする。まず,モデル予測制御(MPC)に注目し,木のような軌道計画の利点を実証する。一つの分岐(PO-MPC)を持つ木の最適化として制御問題を定式化する。これにより、より詳細な計画を立てることで、制御コストを削減することでパフォーマンスが向上する。 MPCのリアルタイム制約を満たすために,PO-MPC定式化の分解性を活用し,最適化の並列化と高速化を図る分散拡張ラグランジアン (D-AuLa) と呼ばれる最適化アルゴリズムを開発した。本手法を自律運転例を用いて線形および非線形MPC問題に適用する。第2に、タスク・アンド・モーション・プランニング(TAMP)に取り組み、タスクレベルでの決定木を推論するプランナー(PO-LGP)と、動作計画レベルでの軌道木を導入する。このアプローチは、Logic-Geometric-Programming Framework (LGP)の上に構築され、部分的に観察可能な問題にまで拡張される。実験では,提案手法の適用可能性を示すとともに,大規模タスク計画におけるマクロアクションとして使用される爆発的ポリシを最適化することにより,より大きな問題にスケールする。

論文の概要: Optimizing Trajectory-Trees in Belief Space: An Application from Model Predictive Control to Task and Motion Planning

関連論文リスト