Fugu-MT 論文翻訳(概要): PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation

論文の概要: PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation

arxiv url: http://arxiv.org/abs/2601.07060v1
Date: Sun, 11 Jan 2026 21:00:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:40.734977
Title: PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation
Title（参考訳）: PALM:長軸ロボットマニピュレーションのためのAffordance Reasoningによるプログレッシブ・アウェア・ポリシー学習
Authors: Yuanzhe Liu, Jingyuan Zhu, Yuchen Mo, Gen Li, Xu Cao, Jin Jin, Yifan Shen, Zhengyuan Li, Tianjiao Yu, Wenzhen Yuan, Fangqiang Ding, Ismini Lourentzou,
Abstract要約: PALMは、インタラクション中心のアベイランス推論とサブタスクプログレスキューに関するポリシー学習を構築する。 Palmはシミュレーションや実世界の実験において、一貫してベースラインを上回っている。
参考スコア（独自算出の注目度）: 27.791908160098625
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advancements in vision-language-action (VLA) models have shown promise in robotic manipulation, yet they continue to struggle with long-horizon, multi-step tasks. Existing methods lack internal reasoning mechanisms that can identify task-relevant interaction cues or track progress within a subtask, leading to critical execution errors such as repeated actions, missed steps, and premature termination. To address these challenges, we introduce PALM, a VLA framework that structures policy learning around interaction-centric affordance reasoning and subtask progress cues. PALM distills complementary affordance representations that capture object relevance, contact geometry, spatial placements, and motion dynamics, and serve as task-relevant anchors for visuomotor control. To further stabilize long-horizon execution, PALM predicts continuous within-subtask progress, enabling seamless subtask transitions. Across extensive simulation and real-world experiments, PALM consistently outperforms baselines, achieving a 91.8% success rate on LIBERO-LONG, a 12.5% improvement in average length on CALVIN ABC->D, and a 2x improvement over real-world baselines across three long-horizon generalization settings.
Abstract（参考訳）: 視覚言語アクション(VLA)モデルの最近の進歩はロボット操作において有望であることを示しているが、長い水平・多段階のタスクに苦戦し続けている。既存のメソッドには、タスク関連インタラクションのキューを識別したり、サブタスク内で進捗を追跡できる内部推論機構が欠如しており、繰り返しアクションや失敗ステップ、早期終了といった致命的な実行エラーにつながる。これらの課題に対処するために,対話中心のアベイランス推論とサブタスクプログレスキューを中心とした政策学習を構築するVLAフレームワークであるPALMを紹介した。 PALMは、オブジェクトの関連性、接触幾何学、空間配置、運動力学をキャプチャする補完的な空白表現を蒸留し、ビジュモータ制御のためのタスク関連アンカーとして機能する。長期実行をさらに安定させるために、PALMは連続的なサブタスク内進行を予測し、シームレスなサブタスク遷移を可能にする。 CALVIN ABC->Dの平均長を12.5%改善し、3つの長距離一般化設定で現実世界のベースラインを2倍改善した。

論文の概要: PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation

関連論文リスト