PoseExaminer: Automated Testing of Out-of-Distribution Robustness in
Human Pose and Shape Estimation
- URL: http://arxiv.org/abs/2303.07337v2
- Date: Thu, 30 Mar 2023 04:34:04 GMT
- Title: PoseExaminer: Automated Testing of Out-of-Distribution Robustness in
Human Pose and Shape Estimation
- Authors: Qihao Liu, Adam Kortylewski, Alan Yuille
- Abstract summary: We develop a simulator that can be controlled in a fine-grained manner to explore the manifold of images of human pose.
We introduce a learning-based testing method, termed PoseExaminer, that automatically diagnoses HPS algorithms.
We show that our PoseExaminer discovers a variety of limitations in current state-of-the-art models that are relevant in real-world scenarios.
- Score: 15.432266117706018
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Human pose and shape (HPS) estimation methods achieve remarkable results.
However, current HPS benchmarks are mostly designed to test models in scenarios
that are similar to the training data. This can lead to critical situations in
real-world applications when the observed data differs significantly from the
training data and hence is out-of-distribution (OOD). It is therefore important
to test and improve the OOD robustness of HPS methods. To address this
fundamental problem, we develop a simulator that can be controlled in a
fine-grained manner using interpretable parameters to explore the manifold of
images of human pose, e.g. by varying poses, shapes, and clothes. We introduce
a learning-based testing method, termed PoseExaminer, that automatically
diagnoses HPS algorithms by searching over the parameter space of human pose
images to find the failure modes. Our strategy for exploring this
high-dimensional parameter space is a multi-agent reinforcement learning
system, in which the agents collaborate to explore different parts of the
parameter space. We show that our PoseExaminer discovers a variety of
limitations in current state-of-the-art models that are relevant in real-world
scenarios but are missed by current benchmarks. For example, it finds large
regions of realistic human poses that are not predicted correctly, as well as
reduced performance for humans with skinny and corpulent body shapes. In
addition, we show that fine-tuning HPS methods by exploiting the failure modes
found by PoseExaminer improve their robustness and even their performance on
standard benchmarks by a significant margin. The code are available for
research purposes.
Related papers
- PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions [57.871692507044344]
Pose estimation aims to accurately identify anatomical keypoints in humans and animals using monocular images.
Current models are typically trained and tested on clean data, potentially overlooking the corruption during real-world deployment.
We introduce PoseBench, a benchmark designed to evaluate the robustness of pose estimation models against real-world corruption.
arXiv Detail & Related papers (2024-06-20T14:40:17Z) - Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics [54.08757792080732]
We propose integrating deep features from pre-trained visual models with a statistical analysis model to achieve opinion-unaware BIQA (OU-BIQA)
Our proposed model exhibits superior consistency with human visual perception compared to state-of-the-art BIQA models.
arXiv Detail & Related papers (2024-05-29T06:09:34Z) - SHARE: Single-view Human Adversarial REconstruction [23.655024290047308]
We introduce SHARE, a novel fine-tuning method to enhance the robustness of existing HPS techniques.
We perform a comprehensive analysis on the impact of camera poses on HPS reconstruction outcomes.
Our results illustrate a reduction in mean joint errors across single-view HPS techniques.
arXiv Detail & Related papers (2023-12-30T23:01:31Z) - Adversarial Parametric Pose Prior [106.12437086990853]
We learn a prior that restricts the SMPL parameters to values that produce realistic poses via adversarial training.
We show that our learned prior covers the diversity of the real-data distribution, facilitates optimization for 3D reconstruction from 2D keypoints, and yields better pose estimates when used for regression from images.
arXiv Detail & Related papers (2021-12-08T10:05:32Z) - LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human
Bodies [78.17425779503047]
We propose a novel neural implicit representation for the human body.
It is fully differentiable and optimizable with disentangled shape and pose latent spaces.
Our model can be trained and fine-tuned directly on non-watertight raw data with well-designed losses.
arXiv Detail & Related papers (2021-11-30T04:10:57Z) - Adapted Human Pose: Monocular 3D Human Pose Estimation with Zero Real 3D
Pose Data [14.719976311208502]
Training vs. test data domain gaps often negatively affect model performance.
We present our adapted human pose (AHuP) approach that addresses adaptation problems in both appearance and pose spaces.
AHuP is built around a practical assumption that in real applications, data from target domain could be inaccessible or only limited information can be acquired.
arXiv Detail & Related papers (2021-05-23T01:20:40Z) - Kinematic-Structure-Preserved Representation for Unsupervised 3D Human
Pose Estimation [58.72192168935338]
Generalizability of human pose estimation models developed using supervision on large-scale in-studio datasets remains questionable.
We propose a novel kinematic-structure-preserved unsupervised 3D pose estimation framework, which is not restrained by any paired or unpaired weak supervisions.
Our proposed model employs three consecutive differentiable transformations named as forward-kinematics, camera-projection and spatial-map transformation.
arXiv Detail & Related papers (2020-06-24T23:56:33Z) - Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image
Synthesis [72.34794624243281]
We propose a self-supervised learning framework to disentangle variations from unlabeled video frames.
Our differentiable formalization, bridging the representation gap between the 3D pose and spatial part maps, allows us to operate on videos with diverse camera movements.
arXiv Detail & Related papers (2020-04-09T07:55:01Z) - MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation
from Human Images [42.27703025887059]
The main problems with the standard supervised approach are that it often yields anatomically implausible poses.
We propose a semi-supervised method that can make effective use of images with and without pose annotations.
The results of experiments show that the proposed reflective architecture makes estimated poses anatomically plausible.
arXiv Detail & Related papers (2020-04-08T05:02:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.