Fugu-MT 論文翻訳(概要): CLAW: Composable Language-Annotated Whole-body Motion Generation

論文の概要: CLAW: Composable Language-Annotated Whole-body Motion Generation

arxiv url: http://arxiv.org/abs/2604.11251v2
Date: Tue, 14 Apr 2026 04:29:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 14:01:13.437006
Title: CLAW: Composable Language-Annotated Whole-body Motion Generation
Title（参考訳）: CLAW: 構成可能な言語アノテーション付き全身運動生成
Authors: Jianuo Cao, Yuxin Chen, Masayoshi Tomizuka,
Abstract要約: ヒューマノイドロボットのための言語条件付き全身制御装置の訓練には、動き軌跡と自然言語記述とをペアリングする大規模なデータセットが必要である。我々は,Unitree G1ヒューマノイドロボットのための対話型WebベースパイプラインであるCLAWを提案する。
参考スコア（独自算出の注目度）: 55.99805728566105
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training language-conditioned whole-body controllers for humanoid robots requires large-scale datasets pairing motion trajectories with natural-language descriptions. Existing approaches based on motion capture are costly and limited in diversity, while text-to-motion generative models produce purely kinematic outputs that are not guaranteed to be physically feasible. Therefore, we present CLAW, an interactive web-based pipeline for scalable generation of language-annotated whole-body motion data for the Unitree G1 humanoid robot. CLAW treats the motion modes of a kinematic planner as composable building blocks, each parameterized by movement, heading, speed, pelvis height and duration, and provides two browser-based interfaces -- a real-time keyboard mode and a timeline-based sequence editor -- for exploratory and batch data collection. A low-level whole-body controller tracks the planner's kinematic references in MuJoCo simulation, producing physically grounded trajectories recorded at 50Hz. Simultaneously, a deterministic template-based annotation engine generates diverse natural-language descriptions at multiple stylistic registers for every segment and for the full trajectory. We release the system as open source to support scalable generation of language-motion paired data for humanoid robot learning.
Abstract（参考訳）: ヒューマノイドロボットのための言語条件付き全身制御装置の訓練には、動き軌跡と自然言語記述とをペアリングする大規模なデータセットが必要である。既存のモーションキャプチャーに基づくアプローチは、コストがかかり多様性が制限される一方、テキスト・トゥ・モーション生成モデルは、物理的に実現可能であることが保証されていない純粋にキネマティックな出力を生成する。そこで本研究では,Unitree G1ヒューマノイドロボットのための対話型WebベースパイプラインであるCLAWを提案する。 CLAWは、キネマティックプランナーの動作モードを構成可能なビルディングブロックとして扱い、それぞれが動き、方向、速度、骨盤の高さと持続時間によってパラメータ化され、2つのブラウザベースのインターフェース – リアルタイムキーボードモードとタイムラインベースのシーケンスエディタ – を探索データとバッチデータ収集用に提供する。低レベルのボディコントローラは、MuJoCoシミュレーションでプランナーのキネマティック参照を追跡し、50Hzで記録された物理的に接地された軌道を生成する。同時に、決定論的テンプレートベースのアノテーションエンジンは、各セグメントと全行に対して複数のスタイリスティックレジスタで様々な自然言語記述を生成する。我々はこのシステムをオープンソースとしてリリースし、ヒューマノイドロボット学習のためのスケーラブルな言語運動ペアデータ生成をサポートする。

論文の概要: CLAW: Composable Language-Annotated Whole-body Motion Generation

関連論文リスト