Fugu-MT 論文翻訳(概要): Scalable Trajectory Generation for Whole-Body Mobile Manipulation

論文の概要: Scalable Trajectory Generation for Whole-Body Mobile Manipulation

arxiv url: http://arxiv.org/abs/2604.12565v1
Date: Tue, 14 Apr 2026 10:47:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.398006
Title: Scalable Trajectory Generation for Whole-Body Mobile Manipulation
Title（参考訳）: 全体移動操作のためのスケーラブルな軌道生成
Authors: Yida Niu, Xinhai Chang, Xin Liu, Ziyuan Jiao, Yixin Zhu,
Abstract要約: 我々は、AKRモデリング、ベース、アーム、オブジェクトキネマティクスを単一のチェーンに統合するGPUアクセラレーションフレームワークであるAutoMoMaを紹介する。 AutoMoMaは、330のシーンにまたがる500万以上の物理的に有効な軌跡、多彩な調音されたオブジェクト、複数のロボットの実施状況のデータセットを生成する。
参考スコア（独自算出の注目度）: 10.909540204939598
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Robots deployed in unstructured environments must coordinate whole-body motion -- simultaneously moving a mobile base and arm -- to interact with the physical world. This coupled mobility and dexterity yields a state space that grows combinatorially with scene and object diversity, demanding datasets far larger than those sufficient for fixed-base manipulation. Yet existing acquisition methods, including teleoperation and planning, are either labor-intensive or computationally prohibitive at scale. The core bottleneck is the lack of a scalable pipeline for generating large-scale, physically valid, coordinated trajectory data across diverse embodiments and environments. Here we introduce AutoMoMa, a GPU-accelerated framework that unifies AKR modeling, which consolidates base, arm, and object kinematics into a single chain, with parallelized trajectory optimization. AutoMoMa achieves 5,000 episodes per GPU-hour (over $80\times$ faster than CPU-based baselines), producing a dataset of over 500k physically valid trajectories spanning 330 scenes, diverse articulated objects, and multiple robot embodiments. Prior datasets were forced to compromise on scale, diversity, or kinematic fidelity; AutoMoMa addresses all three simultaneously. Training downstream IL policies further reveals that even a single articulated-object task requires tens of thousands of demonstrations for SOTA methods to reach $\approx 80\%$ success, confirming that data scarcity -- not algorithmic limitations -- has been the binding constraint. AutoMoMa thus bridges high-performance planning and reliable IL-based control, providing the infrastructure previously missing for coordinated mobile manipulation research. By making large-scale, kinematically valid training data practical, AutoMoMa showcases generalizable whole-body robot policies capable of operating in the diverse, unstructured settings of the real world.
Abstract（参考訳）: 非構造環境に配備されたロボットは、物理的な世界と対話するために、体全体の動き(同時に移動台と腕を動かす)を調整する必要がある。この結合されたモビリティとデクスタリティは、シーンやオブジェクトの多様性と組み合わせて成長する状態空間をもたらし、固定ベース操作に十分なデータセットよりもはるかに大きなデータセットを要求する。しかし、遠隔操作と計画を含む既存の取得方法は、大規模に労働集約的または計算的に禁止されている。コアボトルネックは、さまざまな実施環境や環境にまたがって、大規模で、物理的に有効な、コーディネートされたトラジェクトリデータを生成する、スケーラブルなパイプラインがないことだ。本稿では、AKRモデリングを統一したGPUアクセラレーションフレームワークであるAutoMoMaを紹介し、ベース、アーム、オブジェクトキネマティクスを1つのチェーンに統合し、並列化された軌道最適化を行う。 AutoMoMaはGPU時間あたり5000回(CPUベースベースラインより80ドル以上速い)を達成し、330のシーンにまたがる物理的に有効なトラジェクトリ500万以上のデータセット、多彩な記述されたオブジェクト、複数のロボットエボディメントを生成する。以前のデータセットでは、スケール、多様性、あるいはキネマティックフィリティを妥協せざるを得なかった。ダウンストリームのILポリシのトレーニングはさらに、アルゴリズム的な制限ではなく、データの不足がバインディングの制約であることを確認して、SOTAメソッドが$\approx 80\%以上の成功に達するためには、単一の明示されたオブジェクトタスクでさえ何万ものデモが必要であることを明らかにしている。したがってAutoMoMaは高性能な計画と信頼性の高いILベースの制御を橋渡しし、これまでモバイル操作の協調研究に欠けていたインフラを提供する。大規模かつキネマティックに有効なトレーニングデータを実用的なものにすることで、AutoMoMaは現実世界の多様な非構造的な環境で動作可能な汎用可能な全身ロボットポリシーを紹介している。

論文の概要: Scalable Trajectory Generation for Whole-Body Mobile Manipulation

関連論文リスト