Fugu-MT 論文翻訳(概要): Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints

論文の概要: Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints

arxiv url: http://arxiv.org/abs/2606.06877v1
Date: Fri, 05 Jun 2026 03:44:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.550906
Title: Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints
Title（参考訳）: 複雑な論理的制約下での長期タスク計画のためのニューロ・シンボリック学習
Authors: Qiwei Du, Zitong Zhan, Shaoshu Su, Bowen Li, Yi Du, Zhipeng Zhao, Taimeng Fu, Sebastian Scherer, Jiaoyang Li, Chen Wang,
Abstract要約: 最近のニューロシンボリックな手法は、課題非関連オブジェクトに対する対象重要度スコアを学習することで、計画効率を向上させる。並列リカバリ,再スタート,ロールバックを用いて,上位レベルの学習に信頼性と適応的なフィードバックを提供する3R戦略を低レベル計画に導入する。 3つの挑戦的なベンチマークの実験では、失敗率80.04%、計画時間57.14%の削減など、最先端のパフォーマンスが示されている。
参考スコア（独自算出の注目度）: 12.864903019173106
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Task planning often suffers from severe efficiency bottlenecks when robots must reason over long-horizon action sequences under complex logical constraints, including object affordances, spatial relationships, and sequential action dependencies. Recent neuro-symbolic methods improve planning efficiency by learning object-importance scores to prune task-irrelevant objects, but they typically rely on fixed offline supervision generated from full search spaces. This creates a train-test mismatch: at deployment, the planner operates in pruned search spaces induced by the model's own imperfect predictions, leading to exposure bias and degraded planning performance. To address this challenge, we formulate object-importance learning for task planning as an imperative learning-based bilevel optimization problem. The upper level optimizes a neural scorer, while the lower level solves a symbolic planning problem in the score-pruned search space. To stabilize this learning process, we introduce a 3R strategy into the lower-level planning, using parallel Repair, Restart, and Rollback recovery to provide reliable and adaptive feedback for upper-level learning. Experiments on three challenging benchmarks demonstrate state-of-the-art performance, including an 80.04% reduction in failure rate and a 57.14% reduction in planning time. We further validate the framework on a quadruped-based mobile manipulator in simulation and the real world, demonstrating its potential for efficient and deployable neuro-symbolic task planning.
Abstract（参考訳）: タスクプランニングは、オブジェクトの空き時間、空間的関係、シーケンシャルなアクション依存を含む複雑な論理的制約の下で、ロボットが長い水平アクションシーケンスを推論しなければならない場合、深刻な効率のボトルネックに悩まされることが多い。近年のニューロシンボリックな手法は、タスク非関連オブジェクトに対して、オブジェクト重要度スコアを学習することで計画効率を向上させるが、それらは通常、完全な検索空間から生成される固定されたオフライン監視に依存している。デプロイ時に、プランナーはモデル自身の不完全な予測によって誘導されるプルーニングされた検索スペースで動作し、露出バイアスと計画性能が低下する。この課題に対処するために,課題計画のためのオブジェクト指向学習を命令型学習に基づく二段階最適化問題として定式化する。上位レベルはニューラルスコアラを最適化し、下位レベルはスコア処理された検索空間におけるシンボリックプランニング問題を解く。この学習プロセスを安定させるために,並列修復,再スタート,ロールバックによる3R戦略を導入し,高次学習に信頼性と適応的なフィードバックを提供する。 3つの挑戦的なベンチマークの実験では、失敗率80.04%、計画時間57.14%の削減など、最先端のパフォーマンスが示されている。さらに、シミュレーションと実世界における四足歩行型移動マニピュレータの枠組みを検証し、効率よく展開可能なニューロシンボリックタスクプランニングの可能性を示す。

論文の概要: Neuro-Symbolic Learning for Long-Horizon Task Planning Under Complex Logical Constraints

関連論文リスト