Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates
- URL: http://arxiv.org/abs/2006.15480v1
- Date: Sun, 28 Jun 2020 01:14:59 GMT
- Title: Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates
- Authors: Ke Sun, Zigang Geng, Depu Meng, Bin Xiao, Dong Liu, Zhaoxiang Zhang,
Jingdong Wang
- Abstract summary: We present several schemes that are rarely or unthoroughly studied before for improving keypoint detection and grouping (keypoint regression) performance.
First, we exploit the keypoint heatmaps for pixel-wise keypoint regression instead of separating them for improving keypoint regression.
Second, we adopt a pixel-wise spatial transformer network to learn adaptive representations for handling the scale and orientation variance.
Third, we present a joint shape and heatvalue scoring scheme to promote the estimated poses that are more likely to be true poses.
- Score: 76.51095823248104
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The typical bottom-up human pose estimation framework includes two stages,
keypoint detection and grouping. Most existing works focus on developing
grouping algorithms, e.g., associative embedding, and pixel-wise keypoint
regression that we adopt in our approach. We present several schemes that are
rarely or unthoroughly studied before for improving keypoint detection and
grouping (keypoint regression) performance. First, we exploit the keypoint
heatmaps for pixel-wise keypoint regression instead of separating them for
improving keypoint regression. Second, we adopt a pixel-wise spatial
transformer network to learn adaptive representations for handling the scale
and orientation variance to further improve keypoint regression quality. Last,
we present a joint shape and heatvalue scoring scheme to promote the estimated
poses that are more likely to be true poses. Together with the tradeoff heatmap
estimation loss for balancing the background and keypoint pixels and thus
improving heatmap estimation quality, we get the state-of-the-art bottom-up
human pose estimation result. Code is available at
https://github.com/HRNet/HRNet-Bottom-up-Pose-Estimation.
Related papers
- Hybrid model for Single-Stage Multi-Person Pose Estimation [3.592448408054345]
We propose a hybrid model for single-stage multi-person pose estimation, named HybridPose.
It is capable of not only detecting densely placed keypoints, but also filtering the non-existent keypoints in an image.
arXiv Detail & Related papers (2023-05-02T02:55:29Z) - Poseur: Direct Human Pose Regression with Transformers [119.79232258661995]
We propose a direct, regression-based approach to 2D human pose estimation from single images.
Our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints.
Ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods.
arXiv Detail & Related papers (2022-01-19T04:31:57Z) - Improving Robustness for Pose Estimation via Stable Heatmap Regression [19.108116394510258]
A heatmap regression method is proposed to alleviate network vulnerability to small perturbations.
A maximum stability training loss is used to simplify the optimization difficulty.
The proposed method achieves a significant advance in robustness over state-of-the-art approaches on two benchmark datasets.
arXiv Detail & Related papers (2021-05-08T03:07:05Z) - Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [81.05772887221333]
We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework.
We present a simple yet effective approach, named disentangled keypoint regression (DEKR)
We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods.
arXiv Detail & Related papers (2021-04-06T05:54:46Z) - Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation [63.623787834984206]
We propose the scale-adaptive heatmap regression (SAHR) method, which can adaptively adjust the standard deviation for each keypoint.
SAHR may aggravate the imbalance between fore-background samples, which potentially hurts the improvement of SAHR.
We also introduce the weight-adaptive heatmap regression (WAHR) to help balance the fore-background samples.
arXiv Detail & Related papers (2020-12-30T14:39:41Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.