Fugu-MT 論文翻訳(概要): D&D: Learning Human Dynamics from Dynamic Camera

論文の概要: D&D: Learning Human Dynamics from Dynamic Camera

arxiv url: http://arxiv.org/abs/2209.08790v1
Date: Mon, 19 Sep 2022 06:51:02 GMT
ステータス: 翻訳完了
システム内更新日: 2022-09-20 16:52:16.472842
Title: D&D: Learning Human Dynamics from Dynamic Camera
Title（参考訳）: d&d:ダイナミックカメラから人間のダイナミクスを学ぶ
Authors: Jiefeng Li, Siyuan Bian, Chao Xu, Gang Liu, Gang Yu, Cewu Lu
Abstract要約: 本稿では、物理の法則を活かしたD&D(Learning Human Dynamics from Dynamic Camera)を紹介する。私たちのアプローチは完全にニューラルネットワークで、物理エンジンのオフライン最適化やシミュレーションなしで動作します。
参考スコア（独自算出の注目度）: 55.60512353465175
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: 3D human pose estimation from a monocular video has recently seen significant improvements. However, most state-of-the-art methods are kinematics-based, which are prone to physically implausible motions with pronounced artifacts. Current dynamics-based methods can predict physically plausible motion but are restricted to simple scenarios with static camera view. In this work, we present D&D (Learning Human Dynamics from Dynamic Camera), which leverages the laws of physics to reconstruct 3D human motion from the in-the-wild videos with a moving camera. D&D introduces inertial force control (IFC) to explain the 3D human motion in the non-inertial local frame by considering the inertial forces of the dynamic camera. To learn the ground contact with limited annotations, we develop probabilistic contact torque (PCT), which is computed by differentiable sampling from contact probabilities and used to generate motions. The contact state can be weakly supervised by encouraging the model to generate correct motions. Furthermore, we propose an attentive PD controller that adjusts target pose states using temporal information to obtain smooth and accurate pose control. Our approach is entirely neural-based and runs without offline optimization or simulation in physics engines. Experiments on large-scale 3D human motion benchmarks demonstrate the effectiveness of D&D, where we exhibit superior performance against both state-of-the-art kinematics-based and dynamics-based methods. Code is available at https://github.com/Jeffsjtu/DnD
Abstract（参考訳）: 単眼ビデオからの3d人間のポーズ推定は、最近大幅に改善されている。しかし、最先端の手法のほとんどはキネマティックスに基づくもので、目に見える人工物を持つ物理的に目立たない動きの傾向が強い。現在の動的手法は、物理的にもっともらしい動きを予測できるが、静的カメラビューによる単純なシナリオに限定される。本研究では、物理の法則を活かしたD&D(Learning Human Dynamics from Dynamic Camera)を用いて、移動式カメラで撮影した映像から3Dの人間の動きを再現する。 d&dは、動的カメラの慣性力を考慮して、非慣性局所フレームにおける3次元人間の動きを説明する慣性力制御(ifc)を導入する。限られたアノテーションで接地接触を学習するために,接触確率の異なるサンプリングにより計算し,動きを生成する確率的接触トルク(PCT)を開発する。モデルに正しい動きを起こさせるように促すことで、接触状態が弱く監視される。さらに、時間情報を用いて目標ポーズ状態を調整し、スムーズで正確なポーズ制御を実現する注意型PDコントローラを提案する。私たちのアプローチは完全にニューラルネットワークで、物理エンジンのオフライン最適化やシミュレーションなしで動作します。大規模3次元人体運動ベンチマーク実験はD&Dの有効性を実証し, 最先端のキネマティクス法とダイナミックス法の両方に対して優れた性能を示す。コードはhttps://github.com/Jeffsjtu/DnDで入手できる。

論文の概要: D&D: Learning Human Dynamics from Dynamic Camera

関連論文リスト