Fugu-MT 論文翻訳(概要): AtomicMotion: Learning Human Motion From Different Human Parts

論文の概要: AtomicMotion: Learning Human Motion From Different Human Parts

arxiv url: http://arxiv.org/abs/2605.22631v1
Date: Thu, 21 May 2026 15:39:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 20:14:18.598296
Title: AtomicMotion: Learning Human Motion From Different Human Parts
Title（参考訳）: AtomicMotion: 異なる人間のパーツから人間の動きを学ぶ
Authors: Runzhen Liu, Chuhua Xian, Fa-Ting Hong,
Abstract要約: 私たちは,3つのコアイノベーションを通じて身体のダイナミクスを分離し,再統合するように設計されたフレームワークであるAtomicMotionを紹介します。まず,スケルトンを機能的意図に基づいて5つの異なるクラスタに分解する論理体分割方式を提案する。第二に、スパース入力を高次元ポーズに頑健にマッピングするために、マスク付きフルボディプレコンディショニング戦略を採用する。第3に,バニラ空間的注意の限界に対処し,キネマティック・アテンションを提案する。
参考スコア（独自算出の注目度）: 9.72019653340761
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurately reconstructing full-body poses from sparse head and hand trajectories is a foundational challenge for immersive AR/VR telepresence. Current methods often struggle with error accumulation and unnatural joint coordination, primarily because they treat the human body as a monolithic entity, thereby failing to capture the fine-grained ``atomic intents'' embedded in subtle signal variations and overlooking the inherent structural topology. To bridge this gap, we present AtomicMotion, a framework designed to decouple and re-integrate body dynamics through three core innovations. First, we introduce a logical body partitioning scheme that decomposes the skeleton into five distinct clusters based on functional intent; this ensures that each partition preserves internal joint synergies while isolating local motion primitives. Second, to robustly map sparse inputs to high-dimensional poses, we employ a masked full-body pre-conditioning strategy during training, forcing the model to internalize global skeletal topology and latent kinematic constraints. Finally, addressing the limitations of vanilla spatial attention, which often ignores fixed physiological connectivity, we propose Kinematic Attention. By embedding the classical kinematic tree structure into the attention mechanism, we ensure biological plausibility in the synthesized motions. Extensive evaluations on the AMASS dataset demonstrate that AtomicMotion significantly outperforms existing baselines, yielding higher reconstruction fidelity and superior biomechanical realism.
Abstract（参考訳）: 没入型AR/VRテレプレゼンスのための基礎的な課題は、スパースヘッドとハンドトラジェクトリーのフルボディポーズを正確に再構築することである。現在の方法では、主に人体をモノリシックな実体として扱うため、微妙な信号のバリエーションに埋め込まれ、固有の構造的トポロジーを覆い隠すために、エラーの蓄積と不自然な共同調整に苦労することが多い。このギャップを埋めるために、私たちは3つのコアイノベーションを通じてボディダイナミクスを分離し、再統合するように設計されたフレームワークAtomicMotionを紹介します。まず,機能的意図に基づいて骨格を5つの異なるクラスタに分解する論理的体分割方式を導入する。第二に、スパース入力を高次元のポーズに頑健にマッピングするために、トレーニング中にマスク付きフルボディプレコンディショニング戦略を採用し、大域的な骨格トポロジーと潜時的制約を内在化させる。最後に,固定された生理的接続を無視したバニラ空間的注意の限界に対処するため,キネマティック・アテンションを提案する。古典的キネマティックツリー構造を注意機構に埋め込むことで、合成運動における生物学的な可視性を確保できる。 AMASSデータセットの大規模な評価は、AtomicMotionが既存のベースラインを著しく上回り、高い再構築忠実度と優れた生体力学的リアリズムをもたらすことを示している。

論文の概要: AtomicMotion: Learning Human Motion From Different Human Parts

関連論文リスト