KinePose: A temporally optimized inverse kinematics technique for 6DOF
human pose estimation with biomechanical constraints
- URL: http://arxiv.org/abs/2207.12841v1
- Date: Tue, 26 Jul 2022 12:17:07 GMT
- Title: KinePose: A temporally optimized inverse kinematics technique for 6DOF
human pose estimation with biomechanical constraints
- Authors: Kevin Gildea, Clara Mercadal-Baudart, Richard Blythman, Aljosa Smolic,
Ciaran Simms
- Abstract summary: We propose a technique to infer joint orientations throughout a biomechanically informed, and subject-specific kinematic chain.
We generate 3D pose motion sequences to assess the IK approach both for general accuracy, and accuracy in boundary cases.
Our temporal algorithm achieves 6DOF pose estimates with low Mean Per Joint Angular Separation (MPJAS) errors.
- Score: 11.912058874826549
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Computer vision/deep learning-based 3D human pose estimation methods aim to
localize human joints from images and videos. Pose representation is normally
limited to 3D joint positional/translational degrees of freedom (3DOFs),
however, a further three rotational DOFs (6DOFs) are required for many
potential biomechanical applications. Positional DOFs are insufficient to
analytically solve for joint rotational DOFs in a 3D human skeletal model.
Therefore, we propose a temporal inverse kinematics (IK) optimization technique
to infer joint orientations throughout a biomechanically informed, and
subject-specific kinematic chain. For this, we prescribe link directions from a
position-based 3D pose estimate. Sequential least squares quadratic programming
is used to solve a minimization problem that involves both frame-based pose
terms, and a temporal term. The solution space is constrained using joint DOFs,
and ranges of motion (ROMs). We generate 3D pose motion sequences to assess the
IK approach both for general accuracy, and accuracy in boundary cases. Our
temporal algorithm achieves 6DOF pose estimates with low Mean Per Joint Angular
Separation (MPJAS) errors (3.7{\deg}/joint overall, & 1.6{\deg}/joint for lower
limbs). With frame-by-frame IK we obtain low errors in the case of bent elbows
and knees, however, motion sequences with phases of extended/straight limbs
results in ambiguity in twist angle. With temporal IK, we reduce ambiguity for
these poses, resulting in lower average errors.
Related papers
- Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs [15.017274891943162]
Temporal 3D human pose estimation from monocular videos is a challenging task in human-centered computer vision.
Inertial sensor has been introduced to provide complementary source of information.
It remains challenging to integrate heterogeneous sensor data for producing physically rational 3D human poses.
arXiv Detail & Related papers (2024-04-27T09:02:42Z) - Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation [6.061037203796638]
Platypose is a framework that uses a diffusion model pretrained on 3D human motion sequences for zero-shot 3D pose sequence estimation.
Platypose achieves state-of-the-art calibration and competitive joint error when tested on static poses from Human3.6M, MPI-INF-3DHP and 3DPW.
arXiv Detail & Related papers (2024-03-10T10:30:34Z) - NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D
Human Pose and Shape Estimation [53.25973084799954]
We present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors.
NIKI can learn from both the forward and inverse processes with invertible networks.
arXiv Detail & Related papers (2023-05-15T12:13:24Z) - Non-Local Latent Relation Distillation for Self-Adaptive 3D Human Pose
Estimation [63.199549837604444]
3D human pose estimation approaches leverage different forms of strong (2D/3D pose) or weak (multi-view or depth) paired supervision.
We cast 3D pose learning as a self-supervised adaptation problem that aims to transfer the task knowledge from a labeled source domain to a completely unpaired target.
We evaluate different self-adaptation settings and demonstrate state-of-the-art 3D human pose estimation performance on standard benchmarks.
arXiv Detail & Related papers (2022-04-05T03:52:57Z) - Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose
Estimation [61.98690211671168]
We propose a Multi-level Attention-Decoder Network (MAED) to model multi-level attentions in a unified framework.
With the training set of 3DPW, MAED outperforms previous state-of-the-art methods by 6.2, 7.2, and 2.4 mm of PA-MPJPE.
arXiv Detail & Related papers (2021-09-06T09:06:17Z) - Improving Robustness and Accuracy via Relative Information Encoding in
3D Human Pose Estimation [59.94032196768748]
We propose a relative information encoding method that yields positional and temporal enhanced representations.
Our method outperforms state-of-the-art methods on two public datasets.
arXiv Detail & Related papers (2021-07-29T14:12:19Z) - Graph and Temporal Convolutional Networks for 3D Multi-person Pose
Estimation in Monocular Videos [33.974241749058585]
We propose a novel framework integrating graph convolutional networks (GCNs) and temporal convolutional networks (TCNs) to robustly estimate camera-centric multi-person 3D poses.
In particular, we introduce a human-joint GCN, which employs the 2D pose estimator's confidence scores to improve the pose estimation results.
The two GCNs work together to estimate the spatial frame-wise 3D poses and can make use of both visible joint and bone information in the target frame to estimate the occluded or missing human-part information.
arXiv Detail & Related papers (2020-12-22T03:01:19Z) - Beyond Weak Perspective for Monocular 3D Human Pose Estimation [6.883305568568084]
We consider the task of 3D joints location and orientation prediction from a monocular video.
We first infer 2D joints locations with an off-the-shelf pose estimation algorithm.
We then adhere to the SMPLify algorithm which receives those initial parameters.
arXiv Detail & Related papers (2020-09-14T16:23:14Z) - MotioNet: 3D Human Motion Reconstruction from Monocular Video with
Skeleton Consistency [72.82534577726334]
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video.
Our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used, motion representation.
arXiv Detail & Related papers (2020-06-22T08:50:09Z) - Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A
Geometric Approach [76.10879433430466]
We propose to estimate 3D human pose from multi-view images and a few IMUs attached at person's limbs.
It operates by firstly detecting 2D poses from the two signals, and then lifting them to the 3D space.
The simple two-step approach reduces the error of the state-of-the-art by a large margin on a public dataset.
arXiv Detail & Related papers (2020-03-25T00:26:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.