Fugu-MT 論文翻訳(概要): MonoDuo: Using One Robot Arm to Learn Bimanual Policies

論文の概要: MonoDuo: Using One Robot Arm to Learn Bimanual Policies

arxiv url: http://arxiv.org/abs/2605.29298v1
Date: Thu, 28 May 2026 03:27:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:55.629441
Title: MonoDuo: Using One Robot Arm to Learn Bimanual Policies
Title（参考訳）: MonoDuo:ロボットアームを使ってバイマニカルなポリシーを学ぶ
Authors: Sandeep Bajamahal, Lawrence Yunliang Chen, Toru Lin, Zehan Ma, Jitendra Malik, Ken Goldberg,
Abstract要約: 単腕ロボットのデモと人間のコラボレーションを組み合わせ,双方向操作ポリシーを学習するフレームワークであるMonoDuoについて述べる。 MonoDuoは、片腕ロボットを遠隔操作して、両腕のタスクの片面を実行する。ボックスリフト,バックパックパッキング,布の折り畳み,ジャケットのジッピング,ハンドオーバープレートの5つのタスクについてMonoDuoを評価した。
参考スコア（独自算出の注目度）: 40.1404286878975
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Bimanual coordination is essential for many real-world manipulation tasks, yet learning bimanual robot policies is limited by the scarcity of bimanual robots and datasets. Single-arm robots, however, are widely available in research labs. Can we leverage them to train bimanual robot policies? We present MonoDuo, a framework for learning bimanual manipulation policies using single-arm robot demonstrations paired with human collaboration. MonoDuo collects data by teleoperating a single-arm robot to perform one side of a bimanual task while a human performs the other, then swapping roles to cover both sides. RGB-D observations from a wrist-mounted and fixed camera are augmented into synthetic demonstrations for target bimanual robots using state-of-the-art hand pose estimation, image and point cloud segmentation, and inpainting. These synthetic demonstrations, grounded in real robot kinematics, are used to train bimanual policies. We evaluate MonoDuo on five tasks: box lifting, backpack packing, cloth folding, jacket zipping, and plate handover. Compared to approaches relying solely on human bimanual videos, MonoDuo enables zero-shot deployment on unseen bimanual robot configurations, achieving success rates up to 70%. With only 25 target robot demonstrations, few-shot finetuning further boosts success rates by 65-70% over training from scratch, demonstrating MonoDuo's effectiveness in efficiently transferring knowledge from single-arm robot data to bimanual robot policies.
Abstract（参考訳）: 多くの実世界の操作タスクにおいて、双方向調整は不可欠であるが、バイマニュアルロボットとデータセットの不足により、バイマニュアルロボットポリシーの学習は制限される。しかし、シングルアームロボットは研究室で広く利用することができる。バイマニュアルロボットポリシーのトレーニングに活用できるのか? 単腕ロボットのデモと人間のコラボレーションを組み合わせ,双方向操作ポリシーを学習するフレームワークであるMonoDuoについて述べる。 MonoDuoは、片腕ロボットを遠隔操作してバイマニュアルタスクの片面を実行し、もう片面を人間が実行し、両方の側面をカバーするために役割を交換することで、データを収集する。手首に装着された固定されたカメラからのRGB-D観測は、最先端の手ポーズ推定、画像と点雲のセグメンテーション、塗装を用いて、標的となるバイマニアルロボットのための合成デモに拡張される。これらの人工的なデモは、実際のロボットキネマティクスに基礎を置いており、バイマニュアルポリシーの訓練に使われている。ボックスリフト,バックパックパッキング,布の折り畳み,ジャケットジッピング,プレートハンドオーバーの5つのタスクについてMonoDuoを評価した。人間のバイマニュアルビデオにのみ依存するアプローチと比較して、MonoDuoは目に見えないバイマニュアルなロボット構成をゼロショットでデプロイすることができ、最大70%の成功率を達成することができる。わずか25の目標ロボットデモで、数発のファインタニングにより、スクラッチからトレーニングよりも65-70%の成功率が向上し、シングルアームロボットデータからバイマニュアルロボットポリシーへの知識の効率的な転送におけるMonoDuoの有効性が証明された。

論文の概要: MonoDuo: Using One Robot Arm to Learn Bimanual Policies

関連論文リスト