Improving Robustness for Pose Estimation via Stable Heatmap Regression
- URL: http://arxiv.org/abs/2105.03569v1
- Date: Sat, 8 May 2021 03:07:05 GMT
- Title: Improving Robustness for Pose Estimation via Stable Heatmap Regression
- Authors: Yumeng Zhang, Li Chen, Yufeng Liu, Xiaoyan Guo, Wen Zheng, Junhai Yong
- Abstract summary: A heatmap regression method is proposed to alleviate network vulnerability to small perturbations.
A maximum stability training loss is used to simplify the optimization difficulty.
The proposed method achieves a significant advance in robustness over state-of-the-art approaches on two benchmark datasets.
- Score: 19.108116394510258
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep learning methods have achieved excellent performance in pose estimation,
but the lack of robustness causes the keypoints to change drastically between
similar images. In view of this problem, a stable heatmap regression method is
proposed to alleviate network vulnerability to small perturbations. We utilize
the correlation between different rows and columns in a heatmap to alleviate
the multi-peaks problem, and design a highly differentiated heatmap regression
to make a keypoint discriminative from surrounding points. A maximum stability
training loss is used to simplify the optimization difficulty when minimizing
the prediction gap of two similar images. The proposed method achieves a
significant advance in robustness over state-of-the-art approaches on two
benchmark datasets and maintains high performance.
Related papers
- Hybrid model for Single-Stage Multi-Person Pose Estimation [3.592448408054345]
We propose a hybrid model for single-stage multi-person pose estimation, named HybridPose.
It is capable of not only detecting densely placed keypoints, but also filtering the non-existent keypoints in an image.
arXiv Detail & Related papers (2023-05-02T02:55:29Z) - Heatmap Distribution Matching for Human Pose Estimation [12.524484316923333]
We show that optimizing the heatmap prediction in such a way, the model performance of body joint localization may not be consistently improved.
We propose to formulate the optimization of the heatmap prediction as a distribution matching problem between the predicted heatmap and the dot annotation of the body joint.
We show the effectiveness of our proposed method through extensive experiments on the COCO dataset and the MPII dataset.
arXiv Detail & Related papers (2022-10-03T07:21:30Z) - Poseur: Direct Human Pose Regression with Transformers [119.79232258661995]
We propose a direct, regression-based approach to 2D human pose estimation from single images.
Our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints.
Ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods.
arXiv Detail & Related papers (2022-01-19T04:31:57Z) - Subpixel Heatmap Regression for Facial Landmark Localization [65.41270740933656]
Heatmap regression approaches suffer from discretization-induced errors related to both the heatmap encoding and decoding process.
We propose a new approach for the heatmap encoding and decoding process by leveraging the underlying continuous distribution.
Our approach offers noticeable gains across multiple datasets setting a new state-of-the-art result in facial landmark localization.
arXiv Detail & Related papers (2021-11-03T17:21:28Z) - Low-resolution Human Pose Estimation [49.531572116079026]
We propose a novel Confidence-Aware Learning (CAL) method for low-resolution pose estimation.
CAL addresses two fundamental limitations of existing offset learning methods: inconsistent training and testing, decoupled heatmap and offset learning.
Our method outperforms significantly the state-of-the-art methods for low-resolution human pose estimation.
arXiv Detail & Related papers (2021-09-19T09:13:57Z) - TFPose: Direct Human Pose Estimation with Transformers [83.03424247905869]
We formulate the pose estimation task into a sequence prediction problem that can effectively be solved by transformers.
Our framework is simple and direct, bypassing the drawbacks of the heatmap-based pose estimation.
Experiments on the MS-COCO and MPII datasets demonstrate that our method can significantly improve the state-of-the-art of regression-based pose estimation.
arXiv Detail & Related papers (2021-03-29T04:18:54Z) - Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive
Keypoint Estimates [76.51095823248104]
We present several schemes that are rarely or unthoroughly studied before for improving keypoint detection and grouping (keypoint regression) performance.
First, we exploit the keypoint heatmaps for pixel-wise keypoint regression instead of separating them for improving keypoint regression.
Second, we adopt a pixel-wise spatial transformer network to learn adaptive representations for handling the scale and orientation variance.
Third, we present a joint shape and heatvalue scoring scheme to promote the estimated poses that are more likely to be true poses.
arXiv Detail & Related papers (2020-06-28T01:14:59Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.