Fugu-MT 論文翻訳(概要): Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments

論文の概要: Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments

arxiv url: http://arxiv.org/abs/2605.21935v1
Date: Thu, 21 May 2026 03:11:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.070349
Title: Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments
Title（参考訳）: 進化への学習:動的環境におけるロバストなヒューマノイドナビゲーションのためのマルチモーダルインタラクティブフィールド
Authors: Peifeng Jiang, Hong Liu, Jin Jin, Wenshuai Wang, Xia Li,
Abstract要約: マルチモーダル・インタラクティブ・フィールド(MIF)は、信頼を意識したセマンティック3Dガウス・スプラッティング、離散性トリガー付き空間記憶更新、およびクローズドループ認識適応パイプライン内でのタスク駆動幾何再構成を統合したヒューマノイド指向システムである。実際のダイナミックオフィスのUnitree-G1ヒューマノイドでは、MIFは静的なシーングラフメモリに比べて12%から94%の非静的環境における再配置の成功を改善し、実用的なオンライン操作のための機能蒸留によってセマンティックメモリのフットプリントを91.4%削減した。
参考スコア（独自算出の注目度）: 10.149525023566712
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Safe manipulation-oriented navigation for humanoid robots requires scene memory that remains reliable under locomotion-induced perceptual distortion, environmental changes, and interaction-level geometric safety constraints. Existing semantic mapping and scene-graph systems are difficult to deploy directly in this setting because they often assume stable camera trajectories, static environments, or coarse object geometry. We introduce the Multi-modal Interactive Field (MIF), a humanoid-oriented system that integrates confidence-aware semantic 3D Gaussian Splatting, discrepancy-triggered spatial memory updates, and task-driven geometric reconstruction within a closed-loop perception-adaptation pipeline. MIF couples three fields: an uncertainty-aware 3DGS Appearance Field that suppresses gait-induced blur, a Spatial Field that maintains topological memory, and a Geometry Field that supports Interaction Pose Safety (IPS) before manipulation. A discrepancy detection score is introduced to separate locomotion-induced false-positive changes from persistent changes and updates only locally inconsistent regions. On a Unitree-G1 humanoid in a real dynamic office, MIF improves relocation success in non-static environments from 12% to 94% compared with static scene-graph memory, while reducing semantic memory footprint by 91.4% through feature distillation for practical online operation. Project page and code: https://ziya-jiang.github.io/MIF-homepage/
Abstract（参考訳）: ヒューマノイドロボットの安全な操作指向ナビゲーションには、ロコモーションによる知覚歪み、環境変化、相互作用レベルの幾何学的安全制約の下で信頼性の高いシーンメモリが必要である。既存のセマンティックマッピングやシーングラフシステムは、安定なカメラ軌跡、静的環境、粗いオブジェクト形状を前提としていることが多いため、この設定で直接デプロイすることは困難である。そこで我々は,Multi-modal Interactive Field (MIF)を導入し,信頼度を考慮したセマンティック3次元ガウス分割,離散性トリガー空間メモリ更新,クローズドループ認識適応パイプライン内のタスク駆動幾何再構成を統合したヒューマノイド指向システムを提案する。 MIFは、歩行によって引き起こされるぼかしを抑制する不確実性を認識した3DGS外見場、トポロジカルメモリを維持する空間場、操作前のインタラクション・ポーズ・セーフティ(IPS)をサポートする幾何学場という3つの分野を結合している。相違検出スコアは、移動によって引き起こされた持続的な変化から偽陽性の変化を分離し、局所的に矛盾する領域のみを更新するために導入される。実際のダイナミックオフィスのUnitree-G1ヒューマノイドでは、MIFは静的なシーングラフメモリに比べて12%から94%の非静的環境における再配置の成功を改善し、実用的なオンライン操作のための機能蒸留によってセマンティックメモリのフットプリントを91.4%削減した。プロジェクトページとコード:https://ziya-jiang.github.io/MIF-homepage/

論文の概要: Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments

関連論文リスト