Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation
- URL: http://arxiv.org/abs/2402.02339v1
- Date: Sun, 4 Feb 2024 04:28:02 GMT
- Title: Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation
- Authors: Ti Wang, Mengyuan Liu, Hong Liu, Bin Ren, Yingxuan You, Wenhao Li,
Nicu Sebe, Xia Li
- Abstract summary: We propose an Uncertainty-Aware testing-time optimization framework for 3D human pose estimation.
Our approach outperforms the previous best result by a large margin of 4.5% on Human3.6M.
- Score: 68.75387874066647
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although data-driven methods have achieved success in 3D human pose
estimation, they often suffer from domain gaps and exhibit limited
generalization. In contrast, optimization-based methods excel in fine-tuning
for specific cases but are generally inferior to data-driven methods in overall
performance. We observe that previous optimization-based methods commonly rely
on projection constraint, which only ensures alignment in 2D space, potentially
leading to the overfitting problem. To address this, we propose an
Uncertainty-Aware testing-time Optimization (UAO) framework, which keeps the
prior information of pre-trained model and alleviates the overfitting problem
using the uncertainty of joints. Specifically, during the training phase, we
design an effective 2D-to-3D network for estimating the corresponding 3D pose
while quantifying the uncertainty of each 3D joint. For optimization during
testing, the proposed optimization framework freezes the pre-trained model and
optimizes only a latent state. Projection loss is then employed to ensure the
generated poses are well aligned in 2D space for high-quality optimization.
Furthermore, we utilize the uncertainty of each joint to determine how much
each joint is allowed for optimization. The effectiveness and superiority of
the proposed framework are validated through extensive experiments on two
challenging datasets: Human3.6M and MPI-INF-3DHP. Notably, our approach
outperforms the previous best result by a large margin of 4.5% on Human3.6M.
Our source code will be open-sourced.
Related papers
- UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues [55.69339788566899]
UPose3D is a novel approach for multi-view 3D human pose estimation.
It improves robustness and flexibility without requiring direct 3D annotations.
arXiv Detail & Related papers (2024-04-23T00:18:00Z) - Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation [29.037799937729687]
Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with significantly better performance in most benchmarks than traditional optimization-based methods.
We propose textbfZero-shot textbfDiffusion-based textbfOptimization (textbfZeDO) pipeline for 3D HPE.
Our multi-hypothesis textittextbfZeDO achieves state-of-the-art (SOTA) performance on Human3.6M, with minMPJPE $51.4$
arXiv Detail & Related papers (2023-07-07T21:03:18Z) - Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose
Estimation [70.32536356351706]
We introduce MRP-Net that constitutes a common deep network backbone with two output heads subscribing to two diverse configurations.
We derive suitable measures to quantify prediction uncertainty at both pose and joint level.
We present a comprehensive evaluation of the proposed approach and demonstrate state-of-the-art performance on benchmark datasets.
arXiv Detail & Related papers (2022-03-29T07:14:58Z) - Camera Distortion-aware 3D Human Pose Estimation in Video with
Optimization-based Meta-Learning [23.200130129530653]
Existing 3D human pose estimation algorithms trained on distortion-free datasets suffer performance drop when applied to new scenarios with a specific camera distortion.
We propose a simple yet effective model for 3D human pose estimation in video that can quickly adapt to any distortion environment.
arXiv Detail & Related papers (2021-11-30T01:35:04Z) - Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation [87.54604263202941]
We propose a tiny deep neural network of which partial layers are iteratively exploited for refining its previous estimations.
We employ learned gating criteria to decide whether to exit from the weight-sharing loop, allowing per-sample adaptation in our model.
Our method consistently outperforms state-of-the-art 2D/3D hand pose estimation approaches in terms of both accuracy and efficiency for widely used benchmarks.
arXiv Detail & Related papers (2021-11-11T23:31:34Z) - Revitalizing Optimization for 3D Human Pose and Shape Estimation: A
Sparse Constrained Formulation [21.710205047008916]
We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation.
We show that this computation scales linearly with the number of joints of a complex 3D human model, in contrast to prior work where it scales cubically due to their dense unconstrained formulation.
We present a real-time motion capture framework that estimates 3D human poses and shapes from a single image at over 30 FPS.
arXiv Detail & Related papers (2021-05-28T16:44:56Z) - Deep Optimized Priors for 3D Shape Modeling and Reconstruction [38.79018852887249]
We introduce a new learning framework for 3D modeling and reconstruction.
We show that the proposed strategy effectively breaks the barriers constrained by the pre-trained priors.
arXiv Detail & Related papers (2020-12-14T03:56:31Z) - Human Body Model Fitting by Learned Gradient Descent [48.79414884222403]
We propose a novel algorithm for the fitting of 3D human shape to images.
We show that this algorithm is fast (avg. 120ms convergence), robust to dataset, and achieves state-of-the-art results on public evaluation datasets.
arXiv Detail & Related papers (2020-08-19T14:26:47Z) - Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation [97.93687743378106]
Existing 3D pose estimation models suffer performance drop when applying to new scenarios with unseen poses.
We propose a novel framework, Inference Stage Optimization (ISO), for improving the generalizability of 3D pose models.
Remarkably, it yields new state-of-the-art of 83.6% 3D PCK on MPI-INF-3DHP, improving upon the previous best result by 9.7%.
arXiv Detail & Related papers (2020-07-04T09:45:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.