Self-Constrained Inference Optimization on Structural Groups for Human
Pose Estimation
- URL: http://arxiv.org/abs/2207.02425v1
- Date: Wed, 6 Jul 2022 03:53:02 GMT
- Title: Self-Constrained Inference Optimization on Structural Groups for Human
Pose Estimation
- Authors: Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He
- Abstract summary: Group-wise structural correlation can be explored to improve the accuracy and robustness of human pose estimation.
We develop a self-constrained prediction-verification network to characterize and learn the structural correlation between keypoints during training.
During the inference stage, the feedback information from the verification network allows us to perform further optimization of pose prediction.
- Score: 19.630070553319506
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We observe that human poses exhibit strong group-wise structural correlation
and spatial coupling between keypoints due to the biological constraints of
different body parts. This group-wise structural correlation can be explored to
improve the accuracy and robustness of human pose estimation. In this work, we
develop a self-constrained prediction-verification network to characterize and
learn the structural correlation between keypoints during training. During the
inference stage, the feedback information from the verification network allows
us to perform further optimization of pose prediction, which significantly
improves the performance of human pose estimation. Specifically, we partition
the keypoints into groups according to the biological structure of human body.
Within each group, the keypoints are further partitioned into two subsets,
high-confidence base keypoints and low-confidence terminal keypoints. We
develop a self-constrained prediction-verification network to perform forward
and backward predictions between these keypoint subsets. One fundamental
challenge in pose estimation, as well as in generic prediction tasks, is that
there is no mechanism for us to verify if the obtained pose estimation or
prediction results are accurate or not, since the ground truth is not
available. Once successfully learned, the verification network serves as an
accuracy verification module for the forward pose prediction. During the
inference stage, it can be used to guide the local optimization of the pose
estimation results of low-confidence keypoints with the self-constrained loss
on high-confidence keypoints as the objective function. Our extensive
experimental results on benchmark MS COCO and CrowdPose datasets demonstrate
that the proposed method can significantly improve the pose estimation results.
Related papers
- Occluded Human Pose Estimation based on Limb Joint Augmentation [14.36131862057872]
We propose an occluded human pose estimation framework based on limb joint augmentation to enhance the generalization ability of the pose estimation model on the occluded human bodies.
To further enhance the localization ability of the model, this paper constructs a dynamic structure loss function based on limb graphs to explore the distribution of occluded joints.
arXiv Detail & Related papers (2024-10-13T15:48:24Z) - Structured Radial Basis Function Network: Modelling Diversity for
Multiple Hypotheses Prediction [51.82628081279621]
Multi-modal regression is important in forecasting nonstationary processes or with a complex mixture of distributions.
A Structured Radial Basis Function Network is presented as an ensemble of multiple hypotheses predictors for regression problems.
It is proved that this structured model can efficiently interpolate this tessellation and approximate the multiple hypotheses target distribution.
arXiv Detail & Related papers (2023-09-02T01:27:53Z) - 2D Human Pose Estimation with Explicit Anatomical Keypoints Structure
Constraints [15.124606575017621]
We present a novel 2D human pose estimation method with explicit anatomical keypoints structure constraints.
Our proposed model can be plugged in the most existing bottom-up or top-down human pose estimation methods.
Our methods perform favorably against the most existing bottom-up and top-down human pose estimation methods.
arXiv Detail & Related papers (2022-12-05T11:01:43Z) - Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation [87.54604263202941]
We propose a tiny deep neural network of which partial layers are iteratively exploited for refining its previous estimations.
We employ learned gating criteria to decide whether to exit from the weight-sharing loop, allowing per-sample adaptation in our model.
Our method consistently outperforms state-of-the-art 2D/3D hand pose estimation approaches in terms of both accuracy and efficiency for widely used benchmarks.
arXiv Detail & Related papers (2021-11-11T23:31:34Z) - Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [81.05772887221333]
We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework.
We present a simple yet effective approach, named disentangled keypoint regression (DEKR)
We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods.
arXiv Detail & Related papers (2021-04-06T05:54:46Z) - Confidence Adaptive Anytime Pixel-Level Recognition [86.75784498879354]
Anytime inference requires a model to make a progression of predictions which might be halted at any time.
We propose the first unified and end-to-end model approach for anytime pixel-level recognition.
arXiv Detail & Related papers (2021-04-01T20:01:57Z) - TFPose: Direct Human Pose Estimation with Transformers [83.03424247905869]
We formulate the pose estimation task into a sequence prediction problem that can effectively be solved by transformers.
Our framework is simple and direct, bypassing the drawbacks of the heatmap-based pose estimation.
Experiments on the MS-COCO and MPII datasets demonstrate that our method can significantly improve the state-of-the-art of regression-based pose estimation.
arXiv Detail & Related papers (2021-03-29T04:18:54Z) - Shape Consistent 2D Keypoint Estimation under Domain Shift [35.15266729401601]
We present a novel deep adaptation framework for estimating keypoints under domain shift.
Our method seamlessly combines three different components: feature alignment, adversarial training and self-supervision.
Our approach outperforms state-of-the-art domain adaptation methods in the 2D keypoint prediction task.
arXiv Detail & Related papers (2020-08-04T14:32:06Z) - Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates [76.51095823248104]
We present several schemes that are rarely or unthoroughly studied before for improving keypoint detection and grouping (keypoint regression) performance.
First, we exploit the keypoint heatmaps for pixel-wise keypoint regression instead of separating them for improving keypoint regression.
Second, we adopt a pixel-wise spatial transformer network to learn adaptive representations for handling the scale and orientation variance.
Third, we present a joint shape and heatvalue scoring scheme to promote the estimated poses that are more likely to be true poses.
arXiv Detail & Related papers (2020-06-28T01:14:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.