Fugu-MT 論文翻訳(概要): HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning

論文の概要: HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning

arxiv url: http://arxiv.org/abs/2606.17833v1
Date: Tue, 16 Jun 2026 12:01:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.416975
Title: HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning
Title（参考訳）: HumanoidArena:Egocentric Hierarchical Whole-body Learningのベンチマーク
Authors: Taowen Wang, Zikang Xie, Bin Yang, Yunheng Wang, Zizhao Yuan, Yuetong Fang, Yixiao Feng, Yichi Wang, Xingyu Chen, Haodong Chen, Qiwei Wu, Weisheng Xu, Lihan Chen, Lusong Li, Zecui Zeng, Renjing Xu,
Abstract要約: 我々は,自己中心型階層型全身学習のためのシミュレーションファーストベンチマークであるHumanoidArenaを紹介する。高レベルのポリシーは、エゴセントリックなビジョン、プロプレセプション、命令をコンパクトな全身アクションに変換し、その後低レベルのGMTによって実行される。実験により、階層制御により、学習したポリシーが多様な脚クリティカルな相互作用を解決できることが示されているが、性能はトラッカー条件が強く、GMT間移動は脆弱である。
参考スコア（独自算出の注目度）: 38.43380992093738
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Humanoid robots promise whole-body interaction in human-centered environments, but scalable policy learning remains difficult because task-level decision-making and whole-body dynamic execution are tightly coupled. A practical solution is hierarchical control, where a high-level policy predicts intermediate whole-body actions and low-level general motion trackers (GMTs) execute them as stable humanoid motion. However, existing benchmarks rarely evaluate the policy-tracker interface itself, leaving open whether intermediate whole-body actions are executable, robust under task distribution shifts, and transferable across different GMT backends. We introduce HumanoidArena, a simulation-first benchmark for egocentric hierarchical whole-body learning. The benchmark formulates policy learning as a hierarchical decision making problem: a high-level policy converts egocentric vision, proprioception, and instructions into a compact whole-body action, which is subsequently executed by a low-level GMT. Instead of treating the legs as planar transport tools, HumanoidArena emphasizes interactions where lower-body coordination is structurally necessary in task completion. We therefore design 7 leg-critical HOI/HSI tasks in which success requires foot placement, balance maintenance, posture adjustment, and whole-body reorientation. To further diagnose the hierarchical system, we evaluate policies from two complementary perspectives: perturbation-conditioned generalization and GMT-conditioned transfer. Experiments show that hierarchical control enables learned policies to solve diverse leg-critical interactions, but performance is strongly tracker-conditioned and cross-GMT transfer remains fragile. These results position HumanoidArena as a benchmark for studying transferable intermediate action representations and scalable egocentric whole-body policy learning.
Abstract（参考訳）: ヒューマノイドロボットは、人間中心環境における全身の相互作用を約束するが、タスクレベルの意思決定と全身の動的実行が密結合しているため、スケーラブルなポリシー学習は依然として困難である。現実的な解決策は階層的制御であり、高レベルポリシーは中間的な全身動作を予測し、低レベル一般モーショントラッカー(GMT)は安定なヒューマノイド運動としてそれらを実行する。しかし、既存のベンチマークではポリシートラッカーインターフェース自体を評価することは滅多になく、中間のボディアクションが実行可能か、タスクの分散シフト下で堅牢か、異なるGMTバックエンド間で転送可能であるかは未定である。我々は,自己中心型階層型全身学習のためのシミュレーションファーストベンチマークであるHumanoidArenaを紹介する。このベンチマークは、ポリシー学習を階層的な意思決定問題として定式化し、高レベルのポリシーは、エゴセントリックなビジョン、プロプレセプション、命令をコンパクトな全身アクションに変換し、その後低レベルのGMTによって実行される。脚を平面輸送ツールとして扱う代わりに、HumanoidArenaは、作業完了時に下半身調整が構造的に必要となる相互作用を強調している。そこで我々は,足の配置,バランス維持,姿勢調整,体全体の順応を必要とする7つの脚クリティカルなHOI/HSIタスクを設計する。階層的なシステムをさらに診断するために、摂動条件の一般化とGMT条件の移行という2つの相補的な観点からポリシーを評価する。実験により,階層的制御により多様な脚クリティカルな相互作用を解決することができるが,性能はトラッカー条件が強く,GMT間移動は脆弱であることがわかった。これらの結果から,HumanoidArenaは移動可能な中間動作表現とスケーラブルなエゴセントリックな全身政策学習のベンチマークとして位置づけられた。

論文の概要: HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning

関連論文リスト