Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
- URL: http://arxiv.org/abs/2012.15175v4
- Date: Thu, 25 Mar 2021 05:38:37 GMT
- Title: Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
- Authors: Zhengxiong Luo, Zhicheng Wang, Yan Huang, Tieniu Tan, Erjin Zhou
- Abstract summary: We propose the scale-adaptive heatmap regression (SAHR) method, which can adaptively adjust the standard deviation for each keypoint.
SAHR may aggravate the imbalance between fore-background samples, which potentially hurts the improvement of SAHR.
We also introduce the weight-adaptive heatmap regression (WAHR) to help balance the fore-background samples.
- Score: 63.623787834984206
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Heatmap regression has become the most prevalent choice for nowadays human
pose estimation methods. The ground-truth heatmaps are usually constructed via
covering all skeletal keypoints by 2D gaussian kernels. The standard deviations
of these kernels are fixed. However, for bottom-up methods, which need to
handle a large variance of human scales and labeling ambiguities, the current
practice seems unreasonable. To better cope with these problems, we propose the
scale-adaptive heatmap regression (SAHR) method, which can adaptively adjust
the standard deviation for each keypoint. In this way, SAHR is more tolerant of
various human scales and labeling ambiguities. However, SAHR may aggravate the
imbalance between fore-background samples, which potentially hurts the
improvement of SAHR. Thus, we further introduce the weight-adaptive heatmap
regression (WAHR) to help balance the fore-background samples. Extensive
experiments show that SAHR together with WAHR largely improves the accuracy of
bottom-up human pose estimation. As a result, we finally outperform the
state-of-the-art model by +1.5AP and achieve 72.0AP on COCO test-dev2017, which
is com-arable with the performances of most top-down methods. Source codes are
available at https://github.com/greatlog/SWAHR-HumanPose.
Related papers
- Lightweight Super-Resolution Head for Human Pose Estimation [42.51588635059534]
Heatmap-based methods have become the mainstream method for pose estimation.
However, heatmap-based approaches suffer from significant quantization errors with downscale heatmaps.
We propose SRPose to reduce the quantization error and dependence on further post-processing.
arXiv Detail & Related papers (2023-07-31T15:35:34Z) - Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation [71.24808323646167]
We propose textbfDiffusionPose, a new scheme for learning keypoints heatmaps by a neural network.
During training, the keypoints are diffused to random distribution by adding noises and the diffusion model learns to recover ground-truth heatmaps from noised heatmaps.
Experiments show the prowess of our scheme with improvements of 1.6, 1.2, and 1.2 mAP on widely-used COCO, CrowdPose, and AI Challenge datasets.
arXiv Detail & Related papers (2023-06-29T16:24:32Z) - Bias-Compensated Integral Regression for Human Pose Estimation [21.75308054647583]
This paper uncovers an induced bias from integral regression that results from combining the softmax and the expectation operation.
We propose Bias Compensated Integral Regression (BCIR), an integral regression-based framework that compensates for the bias.
BCIR is faster to train and more accurate than the original integral regression, making it competitive with state-of-the-art detection methods.
arXiv Detail & Related papers (2023-01-25T06:54:04Z) - Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale
Persons [75.86463396561744]
In multi-person 2D pose estimation, the bottom-up methods simultaneously predict poses for all persons.
Our method achieves 38.4% improvement on bounding box precision and 39.1% improvement on bounding box recall over the state of the art (SOTA)
For the human pose AP evaluation, we achieve a new SOTA (71.0 AP) on the COCO test-dev set with the single-scale testing.
arXiv Detail & Related papers (2022-08-25T10:09:10Z) - A Simple Method to Boost Human Pose Estimation Accuracy by Correcting
the Joint Regressor for the Human3.6m Dataset [21.096409769550387]
We show that the most widely used SMPL-to-joint linear layer (joint regressor) is inaccurate.
To achieve a more accurate joint regressor, we propose a method to create pseudo-ground-truth SMPL poses.
We show that our regressor leads to improved pose estimations results on the test set without any need for retraining.
arXiv Detail & Related papers (2022-04-29T20:42:48Z) - Human Pose Regression with Residual Log-likelihood Estimation [48.30425850653223]
We propose a novel regression paradigm with Residual Log-likelihood Estimation (RLE) to capture the underlying output distribution.
RLE learns the change of the distribution instead of the unreferenced underlying distribution to facilitate the training process.
Compared to the conventional regression paradigm, regression with RLE bring 12.4 mAP improvement on MSCOCO without any test-time overhead.
arXiv Detail & Related papers (2021-07-23T15:06:31Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z) - Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates [76.51095823248104]
We present several schemes that are rarely or unthoroughly studied before for improving keypoint detection and grouping (keypoint regression) performance.
First, we exploit the keypoint heatmaps for pixel-wise keypoint regression instead of separating them for improving keypoint regression.
Second, we adopt a pixel-wise spatial transformer network to learn adaptive representations for handling the scale and orientation variance.
Third, we present a joint shape and heatvalue scoring scheme to promote the estimated poses that are more likely to be true poses.
arXiv Detail & Related papers (2020-06-28T01:14:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.