Fugu-MT 論文翻訳(概要): Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction

論文の概要: Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction

arxiv url: http://arxiv.org/abs/2511.14237v1
Date: Tue, 18 Nov 2025 08:15:23 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-19 16:23:53.006158
Title: Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction
Title（参考訳）: 受動的学習トラップを打破する:人間の動作予測のためのアクティブな知覚戦略
Authors: Juncheng Hu, Zijian Zhang, Zeyu Wang, Guoyu Wang, Yingji Li, Kedi Lyu,
Abstract要約: 人間の動作予測のための能動知覚戦略(APS)を提案する。 APSは補助学習目標を導入しながら、運動特性を明示的に符号化する。実験により,本手法は新たな最先端の既存手法を大きなマージンで実現することが示された。
参考スコア（独自算出の注目度）: 17.106453744027217
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Forecasting 3D human motion is an important embodiment of fine-grained understanding and cognition of human behavior by artificial agents. Current approaches excessively rely on implicit network modeling of spatiotemporal relationships and motion characteristics, falling into the passive learning trap that results in redundant and monotonous 3D coordinate information acquisition while lacking actively guided explicit learning mechanisms. To overcome these issues, we propose an Active Perceptual Strategy (APS) for human motion prediction, leveraging quotient space representations to explicitly encode motion properties while introducing auxiliary learning objectives to strengthen spatio-temporal modeling. Specifically, we first design a data perception module that projects poses into the quotient space, decoupling motion geometry from coordinate redundancy. By jointly encoding tangent vectors and Grassmann projections, this module simultaneously achieves geometric dimension reduction, semantic decoupling, and dynamic constraint enforcement for effective motion pose characterization. Furthermore, we introduce a network perception module that actively learns spatio-temporal dependencies through restorative learning. This module deliberately masks specific joints or injects noise to construct auxiliary supervision signals. A dedicated auxiliary learning network is designed to actively adapt and learn from perturbed information. Notably, APS is model agnostic and can be integrated with different prediction models to enhance active perceptual. The experimental results demonstrate that our method achieves the new state-of-the-art, outperforming existing methods by large margins: 16.3% on H3.6M, 13.9% on CMU Mocap, and 10.1% on 3DPW.
Abstract（参考訳）: 3次元の人間の動きを予測することは、人工エージェントによる人間の行動のきめ細かい理解と認識の重要な具体化である。現在のアプローチでは、時空間関係と運動特性の暗黙的なネットワークモデリングを過度に頼り、能動的な学習メカニズムを欠きながら、冗長で単調な3D座標情報取得をもたらす受動的学習トラップに陥る。これらの課題を克服するために,人間の動き予測のための能動知覚戦略 (APS) を提案し,空間表現を利用して運動特性を明示的に符号化し,時空間モデルを強化するための補助学習目標を導入する。具体的には、まず、座標の冗長性から運動幾何学を分離し、商空間に投影するデータ知覚モジュールを設計する。接ベクトルとグラスマン射影を共同で符号化することにより、このモジュールは幾何学的次元の減少、意味的な疎結合、そして効果的な動きのポーズ特徴付けのための動的制約強制を同時に達成する。さらに,回復学習を通じて時空間依存を積極的に学習するネットワーク認識モジュールを導入する。このモジュールは、特定の関節を意図的にマスクしたり、ノイズを注入して補助的な監視信号を構成する。専用の補助学習ネットワークは、摂動情報から積極的に適応し、学習するために設計されている。特に、APSはモデル非依存であり、様々な予測モデルと統合して、アクティブな知覚を高めることができる。実験の結果,H3.6Mが16.3%,CMU Mocapが13.9%,3DPWが10.1%,従来の手法よりも高い性能を示した。

論文の概要: Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction

関連論文リスト