Fugu-MT 論文翻訳(概要): SuperSuit: An Isomorphic Bimodal Interface for Scalable Mobile Manipulation

論文の概要: SuperSuit: An Isomorphic Bimodal Interface for Scalable Mobile Manipulation

arxiv url: http://arxiv.org/abs/2603.06280v1
Date: Fri, 06 Mar 2026 13:40:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:41.992946
Title: SuperSuit: An Isomorphic Bimodal Interface for Scalable Mobile Manipulation
Title（参考訳）: SuperSuit: スケーラブルなモバイル操作のための同型バイモーダルインタフェース
Authors: Tongqing Chen, Hang Wu, Jiasen Wang, Xiaotao Li, Zhu Jin, Lu Fang,
Abstract要約: ロボット・イン・ザ・ループの遠隔操作とアクティブなデモンストレーションの両方をサポートするバイモーダルデータ取得フレームワークである textbfSuperSuit を,共有キネマティックインタフェース下で提供する。長距離移動操作タスクにおける実世界実験では、遠隔操作ベースラインと比較してアクティブモードでの2.6$times$高いデモスループット、固定データセットサイズでのアクティブなデモンストレーションによる遠隔操作データ置換時のポリシー性能、アクティブなデータボリュームの増加に伴うモノトニックパフォーマンスの向上が示されている。
参考スコア（独自算出の注目度）: 8.367600706539774
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: High-quality, long-horizon demonstrations are essential for embodied AI, yet acquiring such data for tightly coupled wheeled mobile manipulators remains a fundamental bottleneck. Unlike fixed-base systems, mobile manipulators require continuous coordination between $SE(2)$ locomotion and precise manipulation, exposing limitations in existing teleoperation and wearable interfaces. We present \textbf{SuperSuit}, a bimodal data acquisition framework that supports both robot-in-the-loop teleoperation and active demonstration under a shared kinematic interface. Both modalities produce structurally identical joint-space trajectories, enabling direct data mixing without modifying downstream policies. For locomotion, SuperSuit maps natural human stepping to continuous planar base velocities, eliminating discrete command switches. For manipulation, it employs a strictly isomorphic wearable arm in both modes, while policy training is formulated in a shift-invariant delta-joint representation to mitigate calibration offsets and structural compliance without inverse kinematics. Real-world experiments on long-horizon mobile manipulation tasks show 2.6$\times$ higher demonstration throughput in active mode compared to a teleoperation baseline, comparable policy performance when substituting teleoperation data with active demonstrations at fixed dataset size, and monotonic performance improvement as active data volume increases. These results indicate that consistent kinematic representations across collection modalities enable scalable data acquisition for long-horizon mobile manipulation.
Abstract（参考訳）: 高品質で長期にわたるデモンストレーションは、AIを具現化する上で不可欠だが、タイトに結合した移動マニピュレータのためにそのようなデータを取得することは、依然として基本的なボトルネックである。固定ベースシステムとは異なり、移動マニピュレータは$SE(2)$の移動と正確な操作を連続的に調整する必要がある。ロボット・イン・ザ・ループの遠隔操作とアクティブなデモンストレーションの両方をサポートするバイモーダルデータ取得フレームワークである \textbf{SuperSuit} を,共有キネマティックインタフェース下で提供する。両方のモダリティは構造的に同一の結合空間軌道を生成し、下流のポリシーを変更することなく直接データ混合を可能にする。移動のために、SuperSuitは自然の人間の歩数を連続した平面基底速度にマッピングし、個別のコマンドスイッチを除去する。操作には両モードで厳密な同型ウェアラブルアームを使用し、ポリシートレーニングはシフト不変のデルタ接合表現で定式化され、キャリブレーションオフセットと構造コンプライアンスを逆運動学なしで緩和する。長距離移動操作タスクにおける実世界実験では、遠隔操作ベースラインと比較してアクティブモードでの2.6$\times$高いデモスループット、固定データセットサイズでのアクティブなデモンストレーションによる遠隔操作データ置換時のポリシー性能、アクティブなデータボリュームの増加に伴うモノトニックパフォーマンスの向上が示されている。これらの結果は,コレクションモダリティ間の一貫したキネマティック表現が,長期移動操作のためのスケーラブルなデータ取得を可能にすることを示唆している。

論文の概要: SuperSuit: An Isomorphic Bimodal Interface for Scalable Mobile Manipulation

関連論文リスト