Fugu-MT 論文翻訳(概要): Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation

論文の概要: Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation

arxiv url: http://arxiv.org/abs/2408.11812v1
Date: Wed, 21 Aug 2024 17:57:51 GMT
ステータス: 翻訳完了
システム内更新日: 2024-08-22 15:58:35.754968
Title: Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Title（参考訳）: クロス・エボディード・ラーニングのスケーリング: 操作・ナビゲーション・ロコモーション・航空のための一つのポリシー
Authors: Ria Doshi, Homer Walke, Oier Mees, Sudeep Dasari, Sergey Levine,
Abstract要約: さまざまな種類のロボットにまたがって単一のポリシーを訓練することによって、ロボット学習はより広範囲で多様なデータセットを活用することができる。そこで我々はCrossFormerを提案する。CrossFormerはスケーラブルでフレキシブルなトランスフォーマーベースのポリシーで、どんな実施形態からでもデータを消費できる。我々は、同じネットワークウェイトがシングルアームとデュアルアームの操作システム、車輪付きロボット、クワッドコプター、四足歩行など、非常に異なるロボットを制御できることを実証した。
参考スコア（独自算出の注目度）: 49.03165169369552
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Modern machine learning systems rely on large datasets to attain broad generalization, and this often poses a challenge in robot learning, where each robotic platform and task might have only a small dataset. By training a single policy across many different kinds of robots, a robot learning method can leverage much broader and more diverse datasets, which in turn can lead to better generalization and robustness. However, training a single policy on multi-robot data is challenging because robots can have widely varying sensors, actuators, and control frequencies. We propose CrossFormer, a scalable and flexible transformer-based policy that can consume data from any embodiment. We train CrossFormer on the largest and most diverse dataset to date, 900K trajectories across 20 different robot embodiments. We demonstrate that the same network weights can control vastly different robots, including single and dual arm manipulation systems, wheeled robots, quadcopters, and quadrupeds. Unlike prior work, our model does not require manual alignment of the observation or action spaces. Extensive experiments in the real world show that our method matches the performance of specialist policies tailored for each embodiment, while also significantly outperforming the prior state of the art in cross-embodiment learning.
Abstract（参考訳）: 現代の機械学習システムは、広範な一般化を達成するために大規模なデータセットに依存しており、ロボット学習において、各ロボットプラットフォームとタスクが小さなデータセットしか持たないという課題をしばしば生じさせる。さまざまな種類のロボットにまたがって単一のポリシーを訓練することによって、ロボット学習はより広範囲で多様なデータセットを活用することができる。しかし、ロボットは広い範囲のセンサー、アクチュエータ、制御周波数を持つことができるため、マルチロボットデータに対する単一ポリシーのトレーニングは困難である。そこで我々はCrossFormerを提案する。CrossFormerはスケーラブルでフレキシブルなトランスフォーマーベースのポリシーで、どんな実施形態からでもデータを消費できる。これまでで最大かつ最も多様なデータセットでCrossFormerをトレーニングしています。我々は、同じネットワークウェイトがシングルアームとデュアルアームの操作システム、車輪付きロボット、クワッドコプター、四足歩行など、非常に異なるロボットを制御できることを実証した。従来の作業とは異なり、我々のモデルは観察空間や行動空間を手動でアライメントする必要がない。実世界における大規模な実験により,本手法は各実施形態に合わせて調整された専門的政策のパフォーマンスと一致し,また,クロス・エボデーメント・ラーニングにおける先行技術よりも著しく優れていた。

論文の概要: Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation

関連論文リスト