Fugu-MT 論文翻訳(概要): HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

論文の概要: HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

arxiv url: http://arxiv.org/abs/2602.06341v1
Date: Fri, 06 Feb 2026 03:11:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:41.219373
Title: HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation
Title（参考訳）: HiWET: 長軸ヒューマノイドロコマニピュレーションのための階層的世界フレームエンドエフェクタトラッキング
Authors: Zhanxiang Cao, Liyun Yan, Yang Zhang, Sirui Chen, Jianming Ma, Tianyue Zhan, Shengcheng Fu, Yufei Jia, Cewu Lu, Yue Gao,
Abstract要約: 我々は、ヒューマノイドのロコ操作におけるエンドエフェクタトラッキングとしてこの問題を再考する。動的実行からグローバルな推論を分離する階層的強化学習フレームワークであるHiWETを提案する。我々は,HuWETが長期のワールドフレームタスクにおいて,高精度かつ安定したエンドエフェクタトラッキングを実現することを示す。
参考スコア（独自算出の注目度）: 43.43128572722804
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Humanoid loco-manipulation requires executing precise manipulation tasks while maintaining dynamic stability amid base motion and impacts. Existing approaches typically formulate commands in body-centric frames, fail to inherently correct cumulative world-frame drift induced by legged locomotion. We reformulate the problem as world-frame end-effector tracking and propose HiWET, a hierarchical reinforcement learning framework that decouples global reasoning from dynamic execution. The high-level policy generates subgoals that jointly optimize end-effector accuracy and base positioning in the world frame, while the low-level policy executes these commands under stability constraints. We introduce a Kinematic Manifold Prior (KMP) that embeds the manipulation manifold into the action space via residual learning, reducing exploration dimensionality and mitigating kinematically invalid behaviors. Extensive simulation and ablation studies demonstrate that HiWET achieves precise and stable end-effector tracking in long-horizon world-frame tasks. We validate zero-shot sim-to-real transfer of the low-level policy on a physical humanoid, demonstrating stable locomotion under diverse manipulation commands. These results indicate that explicit world-frame reasoning combined with hierarchical control provides an effective and scalable solution for long-horizon humanoid loco-manipulation.
Abstract（参考訳）: ヒューマノイド・ロコ・マニピュレーションは、基礎運動と衝撃の中で動的安定性を維持しながら、正確な操作タスクを実行する必要がある。既存のアプローチは通常、体中心のフレームでコマンドを定式化するが、足の移動によって引き起こされる累積的な世界フレームのドリフトを本質的に補正することができない。本研究では,グローバルな推論を動的実行から切り離す階層的な強化学習フレームワークであるHiWETを提案する。高レベルポリシーは、世界のフレームにおけるエンドエフェクタの精度とベース位置を共同で最適化するサブゴールを生成し、低レベルポリシーは安定性の制約の下でこれらのコマンドを実行する。 KMP(Kinematic Manifold Prior)を導入し、操作多様体を残留学習により動作空間に埋め込み、探索次元を減らし、運動論的に無効な振る舞いを緩和する。大規模なシミュレーションとアブレーション研究により、HiWETは長距離世界フレームタスクにおいて高精度で安定したエンドエフェクタトラッキングを実現することが示された。低レベルのポリシーを物理的ヒューマノイド上でゼロショット・シム・トゥ・リアルに転送し、多様な操作コマンドの下で安定な移動を示す。これらの結果から, 階層制御を併用した明示的世界枠推論が, 長軸ヒューマノイドのロコ操作に有効かつスケーラブルな解をもたらすことが示唆された。

論文の概要: HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

関連論文リスト