Suppressing Uncertainties for Large-Scale Facial Expression Recognition
- URL: http://arxiv.org/abs/2002.10392v2
- Date: Fri, 6 Mar 2020 09:57:28 GMT
- Title: Suppressing Uncertainties for Large-Scale Facial Expression Recognition
- Authors: Kai Wang, Xiaojiang Peng, Jianfei Yang, Shijian Lu, Yu Qiao
- Abstract summary: This paper proposes a simple yet efficient Self-Cure Network (SCN) which suppresses the uncertainties efficiently and prevents deep networks from over-fitting uncertain facial images.
Results on public benchmarks demonstrate that our SCN outperforms current state-of-the-art methods with textbf88.14% on RAF-DB, textbf60.23% on AffectNet, and textbf89.35% on FERPlus.
- Score: 81.51495681011404
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Annotating a qualitative large-scale facial expression dataset is extremely
difficult due to the uncertainties caused by ambiguous facial expressions,
low-quality facial images, and the subjectiveness of annotators. These
uncertainties lead to a key challenge of large-scale Facial Expression
Recognition (FER) in deep learning era. To address this problem, this paper
proposes a simple yet efficient Self-Cure Network (SCN) which suppresses the
uncertainties efficiently and prevents deep networks from over-fitting
uncertain facial images. Specifically, SCN suppresses the uncertainty from two
different aspects: 1) a self-attention mechanism over mini-batch to weight each
training sample with a ranking regularization, and 2) a careful relabeling
mechanism to modify the labels of these samples in the lowest-ranked group.
Experiments on synthetic FER datasets and our collected WebEmotion dataset
validate the effectiveness of our method. Results on public benchmarks
demonstrate that our SCN outperforms current state-of-the-art methods with
\textbf{88.14}\% on RAF-DB, \textbf{60.23}\% on AffectNet, and \textbf{89.35}\%
on FERPlus. The code will be available at
\href{https://github.com/kaiwang960112/Self-Cure-Network}{https://github.com/kaiwang960112/Self-Cure-Network}.
Related papers
- Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation [63.180725016463974]
Cross-modal retrieval relies on well-matched large-scale datasets that are laborious in practice.
We introduce a novel noisy correspondence learning framework, namely textbfSelf-textbfReinforcing textbfErrors textbfMitigation (SREM)
arXiv Detail & Related papers (2023-12-27T09:03:43Z) - RARE: Robust Masked Graph Autoencoder [45.485891794905946]
Masked graph autoencoder (MGAE) has emerged as a promising self-supervised graph pre-training (SGP) paradigm.
We propose a novel SGP method termed Robust mAsked gRaph autoEncoder (RARE) to improve the certainty in inferring masked data.
arXiv Detail & Related papers (2023-04-04T03:35:29Z) - Attribute-preserving Face Dataset Anonymization via Latent Code
Optimization [64.4569739006591]
We present a task-agnostic anonymization procedure that directly optimize the images' latent representation in the latent space of a pre-trained GAN.
We demonstrate through a series of experiments that our method is capable of anonymizing the identity of the images whilst -- crucially -- better-preserving the facial attributes.
arXiv Detail & Related papers (2023-03-20T17:34:05Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z) - Robust and Precise Facial Landmark Detection by Self-Calibrated Pose
Attention Network [73.56802915291917]
We propose a semi-supervised framework to achieve more robust and precise facial landmark detection.
A Boundary-Aware Landmark Intensity (BALI) field is proposed to model more effective facial shape constraints.
A Self-Calibrated Pose Attention (SCPA) model is designed to provide a self-learned objective function that enforces intermediate supervision.
arXiv Detail & Related papers (2021-12-23T02:51:08Z) - Towards robustness under occlusion for face recognition [0.0]
In this paper, we evaluate the effects of occlusions in the performance of a face recognition pipeline that uses a ResNet backbone.
We designed 8 different occlusion masks which were applied to the input images.
In order to increase robustness under occlusions, we followed two approaches. The first is image inpainting using the pre-trained pluralistic image completion network.
The second is Cutmix, a regularization strategy consisting of mixing training images and their labels using rectangular patches.
arXiv Detail & Related papers (2021-09-19T08:27:57Z) - Consensual Collaborative Training And Knowledge Distillation Based
Facial Expression Recognition Under Noisy Annotations [2.538209532048867]
This work proposes an effective training strategy in the presence of noisy labels, called as Consensual Collaborative Training (CCT) framework.
CCT co-trains three networks jointly using a convex combination of supervision loss and consistency loss.
State-of-the-art performance is reported on the benchmark FER datasets RAFDB (90.84%), FERPlus (89.99%) and AffectNet (66%)
arXiv Detail & Related papers (2021-07-10T03:37:06Z) - Robust Self-Ensembling Network for Hyperspectral Image Classification [38.84831094095329]
We propose a robust self-ensembling network (RSEN) to address this problem.
The proposed RSEN consists of twoworks including a base network and an ensemble network.
We show that the proposed algorithm can yield competitive performance compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-04-08T13:33:14Z) - The FaceChannel: A Fast & Furious Deep Neural Network for Facial
Expression Recognition [71.24825724518847]
Current state-of-the-art models for automatic Facial Expression Recognition (FER) are based on very deep neural networks that are effective but rather expensive to train.
We formalize the FaceChannel, a light-weight neural network that has much fewer parameters than common deep neural networks.
We demonstrate how our model achieves a comparable, if not better, performance to the current state-of-the-art in FER.
arXiv Detail & Related papers (2020-09-15T09:25:37Z) - Semantic Neighborhood-Aware Deep Facial Expression Recognition [14.219890078312536]
A novel method is proposed to formulate semantic perturbation and select unreliable samples during training.
Experiments show the effectiveness of the proposed method and state-of-the-art results are reported.
arXiv Detail & Related papers (2020-04-27T11:48:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.