Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
- URL: http://arxiv.org/abs/2007.10599v1
- Date: Tue, 21 Jul 2020 04:59:15 GMT
- Title: Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
- Authors: Jian Wang, Xiang Long, Yuan Gao, Errui Ding, Shilei Wen
- Abstract summary: We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
- Score: 54.29252286561449
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, most of the state-of-the-art human pose estimation methods are
based on heatmap regression. The final coordinates of keypoints are obtained by
decoding heatmap directly. In this paper, we aim to find a better approach to
get more accurate localization results. We mainly put forward two suggestions
for improvement: 1) different features and methods should be applied for rough
and accurate localization, 2) relationship between keypoints should be
considered. Specifically, we propose a two-stage graph-based and model-agnostic
framework, called Graph-PCNN, with a localization subnet and a graph pose
refinement module added onto the original heatmap regression network. In the
first stage, heatmap regression network is applied to obtain a rough
localization result, and a set of proposal keypoints, called guided points, are
sampled. In the second stage, for each guided point, different visual feature
is extracted by the localization subnet. The relationship between guided points
is explored by the graph pose refinement module to get more accurate
localization results. Experiments show that Graph-PCNN can be used in various
backbones to boost the performance by a large margin. Without bells and
whistles, our best model can achieve a new state-of-the-art 76.8% AP on COCO
test-dev split.
Related papers
- KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting [0.0]
KGpose is an end-to-end framework for 6D pose estimation of multiple objects.
Our approach combines keypoint-based method with learnable pose regression through keypoint-graph'
arXiv Detail & Related papers (2024-07-12T01:06:00Z) - Graph R-CNN: Towards Accurate 3D Object Detection with
Semantic-Decorated Local Graph [26.226885108862735]
Two-stage detectors have gained much popularity in 3D object detection.
Most two-stage 3D detectors utilize grid points, voxel grids, or sampled keypoints for RoI feature extraction in the second stage.
This paper solves this problem in three aspects.
arXiv Detail & Related papers (2022-08-07T02:56:56Z) - Poseur: Direct Human Pose Regression with Transformers [119.79232258661995]
We propose a direct, regression-based approach to 2D human pose estimation from single images.
Our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints.
Ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods.
arXiv Detail & Related papers (2022-01-19T04:31:57Z) - Is 2D Heatmap Representation Even Necessary for Human Pose Estimation? [44.313782042852246]
We propose a textbfSimple yet promising textbfDisentangled textbfRepresentation for keypoint coordinate (emphSimDR)
In detail, we propose to disentangle the representation of horizontal and vertical coordinates for keypoint location, leading to a more efficient scheme without extra upsampling and refinement.
arXiv Detail & Related papers (2021-07-07T16:20:12Z) - Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [81.05772887221333]
We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework.
We present a simple yet effective approach, named disentangled keypoint regression (DEKR)
We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods.
arXiv Detail & Related papers (2021-04-06T05:54:46Z) - Heatmap Regression via Randomized Rounding [105.75014893647538]
We propose a simple yet effective quantization system to address the sub-pixel localization problem.
The proposed system encodes the fractional part of numerical coordinates into the ground truth heatmap using a probabilistic approach during training.
arXiv Detail & Related papers (2020-09-01T04:54:22Z) - Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates [76.51095823248104]
We present several schemes that are rarely or unthoroughly studied before for improving keypoint detection and grouping (keypoint regression) performance.
First, we exploit the keypoint heatmaps for pixel-wise keypoint regression instead of separating them for improving keypoint regression.
Second, we adopt a pixel-wise spatial transformer network to learn adaptive representations for handling the scale and orientation variance.
Third, we present a joint shape and heatvalue scoring scheme to promote the estimated poses that are more likely to be true poses.
arXiv Detail & Related papers (2020-06-28T01:14:59Z) - Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the
Wild [104.61677518999976]
We propose Pixel-in-Pixel Net (PIPNet) for facial landmark detection.
The proposed model is equipped with a novel detection head based on heatmap regression.
To further improve the cross-domain generalization capability of PIPNet, we propose self-training with curriculum.
arXiv Detail & Related papers (2020-03-08T12:23:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.