Fugu-MT 論文翻訳(概要): LessMimic: Long-Horizon Humanoid Interaction with Unified Distance Field Representations

論文の概要: LessMimic: Long-Horizon Humanoid Interaction with Unified Distance Field Representations

arxiv url: http://arxiv.org/abs/2602.21723v1
Date: Wed, 25 Feb 2026 09:31:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:41.70427
Title: LessMimic: Long-Horizon Humanoid Interaction with Unified Distance Field Representations
Title（参考訳）: LessMimic: Unified Distance Field Representationを用いた長距離ヒューマノイドインタラクション
Authors: Yutang Lin, Jieming Cui, Yixuan Li, Baoxiong Jia, Yixin Zhu, Siyuan Huang,
Abstract要約: 距離場(Distance Field)は、基準のない推論、幾何学的一般化、および1つのポリシーにおける長距離スキル構成を表す。 LessMimicは、非構造環境の障害を一般化し、スキルを構築し、回復するヒューマノイドロボットへのスケーラブルなパスを提供する。
参考スコア（独自算出の注目度）: 36.40622644347312
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Humanoid robots that autonomously interact with physical environments over extended horizons represent a central goal of embodied intelligence. Existing approaches rely on reference motions or task-specific rewards, tightly coupling policies to particular object geometries and precluding multi-skill generalization within a single framework. A unified interaction representation enabling reference-free inference, geometric generalization, and long-horizon skill composition within one policy remains an open challenge. Here we show that Distance Field (DF) provides such a representation: LessMimic conditions a single whole-body policy on DF-derived geometric cues--surface distances, gradients, and velocity decompositions--removing the need for motion references, with interaction latents encoded via a Variational Auto-Encoder (VAE) and post-trained using Adversarial Interaction Priors (AIP) under Reinforcement Learning (RL). Through DAgger-style distillation that aligns DF latents with egocentric depth features, LessMimic further transfers seamlessly to vision-only deployment without motion capture (MoCap) infrastructure. A single LessMimic policy achieves 80--100% success across object scales from 0.4x to 1.6x on PickUp and SitStand where baselines degrade sharply, attains 62.1% success on 5 task instances trajectories, and remains viable up to 40 sequentially composed tasks. By grounding interaction in local geometry rather than demonstrations, LessMimic offers a scalable path toward humanoid robots that generalize, compose skills, and recover from failures in unstructured environments.
Abstract（参考訳）: 地平線を越えて物理的環境と自律的に対話するヒューマノイドロボットは、インテリジェンスを具現化する中心的な目標である。既存のアプローチでは、参照動作やタスク固有の報酬、特定のオブジェクトジオメトリへの密結合ポリシー、単一のフレームワーク内でのマルチスキルの一般化を前提としています。参照のない推論、幾何学的一般化、一つのポリシー内での長距離スキル構成を可能にする統一された相互作用表現は、未解決の課題である。ここでは、距離場(DF)がこのような表現を提供することを示す: 最小限の条件は、DF由来の幾何学的手がかり(表面距離、勾配、速度分解)に対する単一の全身ポリシーであり、動き参照の必要性を排除し、ラバーショナルオートエンコーダ(VAE)を介してコード化され、Reinforcement Learning(RL)の下で対数的相互作用優先(AIP)を用いて後訓練される。 DF潜水剤をエゴセントリックな深度特徴と整合させるDAggerスタイルの蒸留を通じて、LesMimicはモーションキャプチャー(MoCap)インフラストラクチャーを使わずに、視覚のみのデプロイメントにシームレスに移行する。 LessMimicのポリシーは、PickUpとSitStandで0.4xから1.6倍のオブジェクトスケールで80-100%成功し、ベースラインは急激に低下し、5つのタスクインスタンスのトラジェクトリで62.1%成功し、最大40のシーケンシャルな構成タスクが実行可能である。デモではなく、局所幾何学の相互作用を基盤にすることで、LesMimicは、非構造化環境での失敗を一般化し、スキルを構成し、回復するヒューマノイドロボットへのスケーラブルな道を提供する。

論文の概要: LessMimic: Long-Horizon Humanoid Interaction with Unified Distance Field Representations

関連論文リスト