Subpixel Heatmap Regression for Facial Landmark Localization
- URL: http://arxiv.org/abs/2111.02360v1
- Date: Wed, 3 Nov 2021 17:21:28 GMT
- Title: Subpixel Heatmap Regression for Facial Landmark Localization
- Authors: Adrian Bulat and Enrique Sanchez and Georgios Tzimiropoulos
- Abstract summary: Heatmap regression approaches suffer from discretization-induced errors related to both the heatmap encoding and decoding process.
We propose a new approach for the heatmap encoding and decoding process by leveraging the underlying continuous distribution.
Our approach offers noticeable gains across multiple datasets setting a new state-of-the-art result in facial landmark localization.
- Score: 65.41270740933656
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep Learning models based on heatmap regression have revolutionized the task
of facial landmark localization with existing models working robustly under
large poses, non-uniform illumination and shadows, occlusions and
self-occlusions, low resolution and blur. However, despite their wide adoption,
heatmap regression approaches suffer from discretization-induced errors related
to both the heatmap encoding and decoding process. In this work we show that
these errors have a surprisingly large negative impact on facial alignment
accuracy. To alleviate this problem, we propose a new approach for the heatmap
encoding and decoding process by leveraging the underlying continuous
distribution. To take full advantage of the newly proposed encoding-decoding
mechanism, we also introduce a Siamese-based training that enforces heatmap
consistency across various geometric image transformations. Our approach offers
noticeable gains across multiple datasets setting a new state-of-the-art result
in facial landmark localization. Code alongside the pretrained models will be
made available at https://www.adrianbulat.com/face-alignment
Related papers
- CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - Sub-pixel face landmarks using heatmaps and a bag of tricks [0.0]
We present our heatmap regression approach to accurately localize face landmarks.
We use five na"ive face landmarks from a publicly available face detector to position and align the face.
We also show that it is possible to reduce the upscaling complexity by using a mixture of deconvolution and pixel-shuffle layers.
arXiv Detail & Related papers (2021-03-04T14:34:20Z) - Heatmap Regression via Randomized Rounding [105.75014893647538]
We propose a simple yet effective quantization system to address the sub-pixel localization problem.
The proposed system encodes the fractional part of numerical coordinates into the ground truth heatmap using a probabilistic approach during training.
arXiv Detail & Related papers (2020-09-01T04:54:22Z) - A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation [50.261472059743845]
Action Units (AUs) are geometrically-based atomic facial muscle movements.
We propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity.
A Heatmap models whether an AU occurs or not at a given spatial location.
arXiv Detail & Related papers (2020-04-14T16:51:13Z) - Attentive One-Dimensional Heatmap Regression for Facial Landmark
Detection and Tracking [73.35078496883125]
We propose a novel attentive one-dimensional heatmap regression method for facial landmark localization.
First, we predict two groups of 1D heatmaps to represent the marginal distributions of the x and y coordinates.
Second, a co-attention mechanism is adopted to model the inherent spatial patterns existing in x and y coordinates.
Third, based on the 1D heatmap structures, we propose a facial landmark detector capturing spatial patterns for landmark detection on an image.
arXiv Detail & Related papers (2020-04-05T06:51:22Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.