Attentive One-Dimensional Heatmap Regression for Facial Landmark
Detection and Tracking
- URL: http://arxiv.org/abs/2004.02108v7
- Date: Thu, 27 Aug 2020 13:54:22 GMT
- Title: Attentive One-Dimensional Heatmap Regression for Facial Landmark
Detection and Tracking
- Authors: Shi Yin, Shangfei Wang, Xiaoping Chen, Enhong Chen
- Abstract summary: We propose a novel attentive one-dimensional heatmap regression method for facial landmark localization.
First, we predict two groups of 1D heatmaps to represent the marginal distributions of the x and y coordinates.
Second, a co-attention mechanism is adopted to model the inherent spatial patterns existing in x and y coordinates.
Third, based on the 1D heatmap structures, we propose a facial landmark detector capturing spatial patterns for landmark detection on an image.
- Score: 73.35078496883125
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although heatmap regression is considered a state-of-the-art method to locate
facial landmarks, it suffers from huge spatial complexity and is prone to
quantization error. To address this, we propose a novel attentive
one-dimensional heatmap regression method for facial landmark localization.
First, we predict two groups of 1D heatmaps to represent the marginal
distributions of the x and y coordinates. These 1D heatmaps reduce spatial
complexity significantly compared to current heatmap regression methods, which
use 2D heatmaps to represent the joint distributions of x and y coordinates.
With much lower spatial complexity, the proposed method can output
high-resolution 1D heatmaps despite limited GPU memory, significantly
alleviating the quantization error. Second, a co-attention mechanism is adopted
to model the inherent spatial patterns existing in x and y coordinates, and
therefore the joint distributions on the x and y axes are also captured. Third,
based on the 1D heatmap structures, we propose a facial landmark detector
capturing spatial patterns for landmark detection on an image; and a tracker
further capturing temporal patterns with a temporal refinement mechanism for
landmark tracking. Experimental results on four benchmark databases demonstrate
the superiority of our method.
Related papers
- Precise Facial Landmark Detection by Reference Heatmap Transformer [52.417964103227696]
We propose a novel Reference Heatmap Transformer (RHT) for more precise facial landmark detection.
The experimental results from challenging benchmark datasets demonstrate that our proposed method outperforms the state-of-the-art methods in the literature.
arXiv Detail & Related papers (2023-03-14T12:26:48Z) - Subpixel Heatmap Regression for Facial Landmark Localization [65.41270740933656]
Heatmap regression approaches suffer from discretization-induced errors related to both the heatmap encoding and decoding process.
We propose a new approach for the heatmap encoding and decoding process by leveraging the underlying continuous distribution.
Our approach offers noticeable gains across multiple datasets setting a new state-of-the-art result in facial landmark localization.
arXiv Detail & Related papers (2021-11-03T17:21:28Z) - Is 2D Heatmap Representation Even Necessary for Human Pose Estimation? [44.313782042852246]
We propose a textbfSimple yet promising textbfDisentangled textbfRepresentation for keypoint coordinate (emphSimDR)
In detail, we propose to disentangle the representation of horizontal and vertical coordinates for keypoint location, leading to a more efficient scheme without extra upsampling and refinement.
arXiv Detail & Related papers (2021-07-07T16:20:12Z) - Heatmap Regression via Randomized Rounding [105.75014893647538]
We propose a simple yet effective quantization system to address the sub-pixel localization problem.
The proposed system encodes the fractional part of numerical coordinates into the ground truth heatmap using a probabilistic approach during training.
arXiv Detail & Related papers (2020-09-01T04:54:22Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z) - A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation [50.261472059743845]
Action Units (AUs) are geometrically-based atomic facial muscle movements.
We propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity.
A Heatmap models whether an AU occurs or not at a given spatial location.
arXiv Detail & Related papers (2020-04-14T16:51:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.