Feature Decomposition and Reconstruction Learning for Effective Facial
Expression Recognition
- URL: http://arxiv.org/abs/2104.05160v1
- Date: Mon, 12 Apr 2021 02:22:45 GMT
- Title: Feature Decomposition and Reconstruction Learning for Effective Facial
Expression Recognition
- Authors: Delian Ruan and YanYan and Shenqi Lai and Zhenhua Chai and Chunhua
Shen and Hanzi Wang
- Abstract summary: We propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition.
FDRL consists of two crucial networks: a Feature Decomposition Network (FDN) and a Feature Reconstruction Network (FRN)
- Score: 80.17419621762866
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a novel Feature Decomposition and Reconstruction
Learning (FDRL) method for effective facial expression recognition. We view the
expression information as the combination of the shared information (expression
similarities) across different expressions and the unique information
(expression-specific variations) for each expression. More specifically, FDRL
mainly consists of two crucial networks: a Feature Decomposition Network (FDN)
and a Feature Reconstruction Network (FRN). In particular, FDN first decomposes
the basic features extracted from a backbone network into a set of facial
action-aware latent features to model expression similarities. Then, FRN
captures the intra-feature and inter-feature relationships for latent features
to characterize expression-specific variations, and reconstructs the expression
feature. To this end, two modules including an intra-feature relation modeling
module and an inter-feature relation modeling module are developed in FRN.
Experimental results on both the in-the-lab databases (including CK+, MMI, and
Oulu-CASIA) and the in-the-wild databases (including RAF-DB and SFEW) show that
the proposed FDRL method consistently achieves higher recognition accuracy than
several state-of-the-art methods. This clearly highlights the benefit of
feature decomposition and reconstruction for classifying expressions.
Related papers
- Compound Expression Recognition via Multi Model Ensemble for the ABAW7 Challenge [6.26485278174662]
Compound Expression Recognition (CER) is vital for effective interpersonal interactions.
In this paper, we propose an ensemble learning-based solution to address this complexity.
Our method demonstrates high accuracy on the RAF-DB datasets and is capable of recognizing expressions in certain portions of the C-EXPR-DB through zero-shot learning.
arXiv Detail & Related papers (2024-07-17T01:59:34Z) - Compound Expression Recognition via Multi Model Ensemble [8.529105068848828]
Compound Expression Recognition plays a crucial role in interpersonal interactions.
We propose a solution based on ensemble learning methods for Compound Expression Recognition.
Our method achieves high accuracy on RAF-DB and is able to recognize expressions through zero-shot on certain portions of C-EXPR-DB.
arXiv Detail & Related papers (2024-03-19T09:30:56Z) - Feature Decoupling-Recycling Network for Fast Interactive Segmentation [79.22497777645806]
Recent interactive segmentation methods iteratively take source image, user guidance and previously predicted mask as the input.
We propose the Feature Decoupling-Recycling Network (FDRN), which decouples the modeling components based on their intrinsic discrepancies.
arXiv Detail & Related papers (2023-08-07T12:26:34Z) - Prior Information based Decomposition and Reconstruction Learning for
Micro-Expression Recognition [45.46357824529522]
Prior information can guide the model to learn discriminative ME features effectively.
This paper proposes a Decomposition and Reconstruction-based Graph Representation Learning model to learn ME movement features.
arXiv Detail & Related papers (2023-03-03T08:34:28Z) - Learning Diversified Feature Representations for Facial Expression
Recognition in the Wild [97.14064057840089]
We propose a mechanism to diversify the features extracted by CNN layers of state-of-the-art facial expression recognition architectures.
Experimental results on three well-known facial expression recognition in-the-wild datasets, AffectNet, FER+, and RAF-DB, show the effectiveness of our method.
arXiv Detail & Related papers (2022-10-17T19:25:28Z) - Spatio-Temporal Representation Factorization for Video-based Person
Re-Identification [55.01276167336187]
We propose Spatio-Temporal Representation Factorization module (STRF) for re-ID.
STRF is a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID.
We empirically show that STRF improves performance of various existing baseline architectures while demonstrating new state-of-the-art results.
arXiv Detail & Related papers (2021-07-25T19:29:37Z) - Out-of-distribution Generalization via Partial Feature Decorrelation [72.96261704851683]
We present a novel Partial Feature Decorrelation Learning (PFDL) algorithm, which jointly optimize a feature decomposition network and the target image classification model.
The experiments on real-world datasets demonstrate that our method can improve the backbone model's accuracy on OOD image classification datasets.
arXiv Detail & Related papers (2020-07-30T05:48:48Z) - Relational Deep Feature Learning for Heterogeneous Face Recognition [17.494718795454055]
We propose a graph-structured module called Graph Module (NIR) that extracts global relational information in addition to general facial features.
The proposed method outperforms other state-of-the-art methods on five Heterogeneous Face Recognition (HFR) databases.
arXiv Detail & Related papers (2020-03-02T07:35:23Z) - Joint Deep Learning of Facial Expression Synthesis and Recognition [97.19528464266824]
We propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER.
The proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions.
In order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm.
arXiv Detail & Related papers (2020-02-06T10:56:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.