A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation
- URL: http://arxiv.org/abs/2004.06657v1
- Date: Tue, 14 Apr 2020 16:51:13 GMT
- Title: A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation
- Authors: Ioanna Ntinou and Enrique Sanchez and Adrian Bulat and Michel Valstar
and Georgios Tzimiropoulos
- Abstract summary: Action Units (AUs) are geometrically-based atomic facial muscle movements.
We propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity.
A Heatmap models whether an AU occurs or not at a given spatial location.
- Score: 50.261472059743845
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Action Units (AUs) are geometrically-based atomic facial muscle movements
known to produce appearance changes at specific facial locations. Motivated by
this observation we propose a novel AU modelling problem that consists of
jointly estimating their localisation and intensity. To this end, we propose a
simple yet efficient approach based on Heatmap Regression that merges both
problems into a single task. A Heatmap models whether an AU occurs or not at a
given spatial location. To accommodate the joint modelling of AUs intensity, we
propose variable size heatmaps, with their amplitude and size varying according
to the labelled intensity. Using Heatmap Regression, we can inherit from the
progress recently witnessed in facial landmark localisation. Building upon the
similarities between both problems, we devise a transfer learning approach
where we exploit the knowledge of a network trained on large-scale facial
landmark datasets. In particular, we explore different alternatives for
transfer learning through a) fine-tuning, b) adaptation layers, c) attention
maps, and d) reparametrisation. Our approach effectively inherits the rich
facial features produced by a strong face alignment network, with minimal extra
computational cost. We empirically validate that our system sets a new
state-of-the-art on three popular datasets, namely BP4D, DISFA, and FERA2017.
Related papers
- Precise Facial Landmark Detection by Reference Heatmap Transformer [52.417964103227696]
We propose a novel Reference Heatmap Transformer (RHT) for more precise facial landmark detection.
The experimental results from challenging benchmark datasets demonstrate that our proposed method outperforms the state-of-the-art methods in the literature.
arXiv Detail & Related papers (2023-03-14T12:26:48Z) - Sparse Local Patch Transformer for Robust Face Alignment and Landmarks
Inherent Relation Learning [11.150290581561725]
We propose a Sparse Local Patch Transformer (S) for learning the inherent relation.
The proposed method works at the state-of-the-art level with much less computational complexity.
arXiv Detail & Related papers (2022-03-13T01:15:23Z) - Subpixel Heatmap Regression for Facial Landmark Localization [65.41270740933656]
Heatmap regression approaches suffer from discretization-induced errors related to both the heatmap encoding and decoding process.
We propose a new approach for the heatmap encoding and decoding process by leveraging the underlying continuous distribution.
Our approach offers noticeable gains across multiple datasets setting a new state-of-the-art result in facial landmark localization.
arXiv Detail & Related papers (2021-11-03T17:21:28Z) - Multi-Branch Deep Radial Basis Function Networks for Facial Emotion
Recognition [80.35852245488043]
We propose a CNN based architecture enhanced with multiple branches formed by radial basis function (RBF) units.
RBF units capture local patterns shared by similar instances using an intermediate representation.
We show it is the incorporation of local information what makes the proposed model competitive.
arXiv Detail & Related papers (2021-09-07T21:05:56Z) - CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - Facial Action Unit Intensity Estimation via Semantic Correspondence
Learning with Dynamic Graph Convolution [27.48620879003556]
We present a new learning framework that automatically learns the latent relationships of AUs via establishing semantic correspondences between feature maps.
In the heatmap regression-based network, feature maps preserve rich semantic information associated with AU intensities and locations.
This motivates us to model the correlation among feature channels, which implicitly represents the co-occurrence relationship of AU intensity levels.
arXiv Detail & Related papers (2020-04-20T23:55:30Z) - Attentive One-Dimensional Heatmap Regression for Facial Landmark
Detection and Tracking [73.35078496883125]
We propose a novel attentive one-dimensional heatmap regression method for facial landmark localization.
First, we predict two groups of 1D heatmaps to represent the marginal distributions of the x and y coordinates.
Second, a co-attention mechanism is adopted to model the inherent spatial patterns existing in x and y coordinates.
Third, based on the 1D heatmap structures, we propose a facial landmark detector capturing spatial patterns for landmark detection on an image.
arXiv Detail & Related papers (2020-04-05T06:51:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.