Fugu-MT 論文翻訳(概要): Learning Diffusion Policy from Primitive Skills for Robot Manipulation

論文の概要: Learning Diffusion Policy from Primitive Skills for Robot Manipulation

arxiv url: http://arxiv.org/abs/2601.01948v1
Date: Mon, 05 Jan 2026 09:56:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:40.632827
Title: Learning Diffusion Policy from Primitive Skills for Robot Manipulation
Title（参考訳）: ロボットマニピュレーションのための原始スキルからの拡散政策の学習
Authors: Zhihao Gu, Ming Yang, Difan Zou, Dong Xu,
Abstract要約: 拡散政策(DP)は近年,ロボット操作における行動の生成において大きな期待を抱いている。本稿では,解釈可能なスキル学習と条件付きアクションプランニングを統合した,スキル条件付きDPであるSDPを提案する。
参考スコア（独自算出の注目度）: 36.95867683028485
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion policies (DP) have recently shown great promise for generating actions in robotic manipulation. However, existing approaches often rely on global instructions to produce short-term control signals, which can result in misalignment in action generation. We conjecture that the primitive skills, referred to as fine-grained, short-horizon manipulations, such as ``move up'' and ``open the gripper'', provide a more intuitive and effective interface for robot learning. To bridge this gap, we propose SDP, a skill-conditioned DP that integrates interpretable skill learning with conditional action planning. SDP abstracts eight reusable primitive skills across tasks and employs a vision-language model to extract discrete representations from visual observations and language instructions. Based on them, a lightweight router network is designed to assign a desired primitive skill for each state, which helps construct a single-skill policy to generate skill-aligned actions. By decomposing complex tasks into a sequence of primitive skills and selecting a single-skill policy, SDP ensures skill-consistent behavior across diverse tasks. Extensive experiments on two challenging simulation benchmarks and real-world robot deployments demonstrate that SDP consistently outperforms SOTA methods, providing a new paradigm for skill-based robot learning with diffusion policies.
Abstract（参考訳）: 拡散政策(DP)は近年,ロボット操作における行動の生成において大きな期待を抱いている。しかし、既存のアプローチはしばしば、短期的な制御信号を生成するためのグローバルな命令に依存しており、結果として行動生成の誤りが生じる可能性がある。我々は,ロボット学習において,より直感的かつ効果的なインタフェースを提供するため,「ムーブアップ」や「グリップを開放する」といった,細粒で短い水平操作と呼ばれる原始的スキルを推察する。このギャップを埋めるために,解釈可能なスキル学習と条件付きアクションプランニングを統合したスキル条件付きDPのSDPを提案する。 SDPは、タスク間で再利用可能な8つのプリミティブスキルを抽象化し、視覚観察と言語指示から個別表現を抽出するために視覚言語モデルを使用する。それらに基づいて、軽量ルータネットワークは、各状態に所望のプリミティブスキルを割り当てるように設計されている。複雑なタスクを一連のプリミティブなスキルに分解し、シングルスキルのポリシーを選択することで、SDPは多様なタスクにまたがるスキル一貫性の行動を保証する。 2つの挑戦的なシミュレーションベンチマークと実世界のロボット展開に関する大規模な実験により、SDPはSOTA法を一貫して上回り、拡散ポリシーを備えたスキルベースのロボット学習の新しいパラダイムを提供する。

論文の概要: Learning Diffusion Policy from Primitive Skills for Robot Manipulation

関連論文リスト