Fugu-MT 論文翻訳(概要): RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation

論文の概要: RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation

arxiv url: http://arxiv.org/abs/2603.07165v1
Date: Sat, 07 Mar 2026 12:12:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:42.04283
Title: RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation
Title（参考訳）: RoTri-Diff:双方向マニピュレーションのための空間ロボット-物体相互作用誘導拡散モデル
Authors: Zixuan Chen, Nga Teng Chan, Yiwen Hou, Chenrui Tie, Zixuan Liu, Haonan Chen, Junting Chen, Jieqi Shi, Yang Gao, Jing Huo, Lin Shao,
Abstract要約: 両腕操作は、両腕間の連続的かつ正確な調整を必要とする基本的なロボット技術である。既存のアプローチでは、ロボット中心かオブジェクト中心かは、しばしば2つの腕と操作対象の間の動的幾何学的関係を見落としている。この制限は、しばしば兵器間衝突、不安定な把握、複雑なタスクにおける性能低下につながる。本稿では,RoTri制約とロボットのキーポーズとオブジェクトの動きを組み合わせた拡散型模倣学習フレームワークであるRoTri-Diffを紹介する。
参考スコア（独自算出の注目度）: 44.32374761780659
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Bimanual manipulation is a fundamental robotic skill that requires continuous and precise coordination between two arms. While imitation learning (IL) is the dominant paradigm for acquiring this capability, existing approaches, whether robot-centric or object-centric, often overlook the dynamic geometric relationship among the two arms and the manipulated object. This limitation frequently leads to inter-arm collisions, unstable grasps, and degraded performance in complex tasks. To address this, in this paper we explicitly models the Robot-Object Triadic Interaction (RoTri) representation in bimanual systems, by encoding the relative 6D poses between the two arms and the object to capture their spatial triadic relationship and establish continuous triangular geometric constraints. Building on this, we further introduce RoTri-Diff, a diffusion-based imitation learning framework that combines RoTri constraints with robot keyposes and object motion in a hierarchical diffusion process. This enables the generation of stable, coordinated trajectories and robust execution across different modes of bimanual manipulation. Extensive experiments show that our approach outperforms state-of-the-art baselines by 10.2% on 11 representative RLBench2 tasks and achieves stable performance on 4 challenging real-world bimanual tasks. Project website: https://rotri-diff.github.io/.
Abstract（参考訳）: 両腕操作は、両腕間の連続的かつ正確な調整を必要とする基本的なロボット技術である。模倣学習(IL)は、この能力を獲得するための主要なパラダイムであるが、ロボット中心であれオブジェクト中心であれ、既存のアプローチは、しばしば2つの腕と操作対象の間の動的幾何学的関係を見落としている。この制限は、しばしば兵器間衝突、不安定な把握、複雑なタスクにおける性能低下につながる。そこで本論文では,両腕と物体間の相対的な6Dポーズを符号化し,それらの空間的3次元関係を捕捉し,連続的な三角形の幾何学的制約を定式化することにより,ロボット・オブジェクトの3次元インタラクション(RoTri)表現をバイマンシステムで明示的にモデル化する。さらに,RoTri-Diffという拡散に基づく模倣学習フレームワークを導入し,ロボットのキーポーズとオブジェクトの動きを階層的拡散プロセスで組み合わせた。これにより、安定な座標軌道の生成と、バイマニュアル操作の異なるモードでの堅牢な実行が可能になる。大規模な実験により,11のRLBench2タスクに対して,最先端のベースラインを10.2%向上させ,実世界の4つの課題に対して安定した性能を実現することができた。プロジェクトウェブサイト: https://rotri-diff.github.io/.com

論文の概要: RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation

関連論文リスト