Locality-aware Channel-wise Dropout for Occluded Face Recognition
- URL: http://arxiv.org/abs/2107.09270v1
- Date: Tue, 20 Jul 2021 05:53:14 GMT
- Title: Locality-aware Channel-wise Dropout for Occluded Face Recognition
- Authors: Mingjie He, Jie Zhang, Shiguang Shan, Xiao Liu, Zhongqin Wu, Xilin
Chen
- Abstract summary: Face recognition is a challenging task in unconstrained scenarios, especially when faces are partially occluded.
We propose a novel and elegant occlusion-simulation method via dropping the activations of a group of neurons in some elaborately selected channel.
Experiments on various benchmarks show that the proposed method outperforms state-of-the-art methods with a remarkable improvement.
- Score: 116.2355331029041
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Face recognition remains a challenging task in unconstrained scenarios,
especially when faces are partially occluded. To improve the robustness against
occlusion, augmenting the training images with artificial occlusions has been
proved as a useful approach. However, these artificial occlusions are commonly
generated by adding a black rectangle or several object templates including
sunglasses, scarfs and phones, which cannot well simulate the realistic
occlusions. In this paper, based on the argument that the occlusion essentially
damages a group of neurons, we propose a novel and elegant occlusion-simulation
method via dropping the activations of a group of neurons in some elaborately
selected channel. Specifically, we first employ a spatial regularization to
encourage each feature channel to respond to local and different face regions.
In this way, the activations affected by an occlusion in a local region are
more likely to be located in a single feature channel. Then, the locality-aware
channel-wise dropout (LCD) is designed to simulate the occlusion by dropping
out the entire feature channel. Furthermore, by randomly dropping out several
feature channels, our method can well simulate the occlusion of larger area.
The proposed LCD can encourage its succeeding layers to minimize the
intra-class feature variance caused by occlusions, thus leading to improved
robustness against occlusion. In addition, we design an auxiliary spatial
attention module by learning a channel-wise attention vector to reweight the
feature channels, which improves the contributions of non-occluded regions.
Extensive experiments on various benchmarks show that the proposed method
outperforms state-of-the-art methods with a remarkable improvement.
Related papers
- Robust Network Learning via Inverse Scale Variational Sparsification [55.64935887249435]
We introduce an inverse scale variational sparsification framework within a time-continuous inverse scale space formulation.
Unlike frequency-based methods, our approach not only removes noise by smoothing small-scale features.
We show the efficacy of our approach through enhanced robustness against various noise types.
arXiv Detail & Related papers (2024-09-27T03:17:35Z) - Improving Apple Object Detection with Occlusion-Enhanced Distillation [1.0049237739132246]
Apples growing in natural environments often face severe visual obstructions from leaves and branches.
We introduce a technique called "Occlusion-Enhanced Distillation" (OED) to regularize the learning of semantically aligned features on occluded datasets.
Our method significantly outperforms current state-of-the-art techniques through extensive comparative experiments.
arXiv Detail & Related papers (2024-09-03T03:11:48Z) - DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery [71.6345505427213]
DPMesh is an innovative framework for occluded human mesh recovery.
It capitalizes on the profound diffusion prior about object structure and spatial relationships embedded in a pre-trained text-to-image diffusion model.
arXiv Detail & Related papers (2024-04-01T18:59:13Z) - CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using
Score-Based Diffusion Models [57.9771859175664]
Recent generative-prior-based methods have shown promising blind face restoration performance.
Generating fine-grained facial details faithful to inputs remains a challenging problem.
We introduce a diffusion-based-prior inside a VQGAN architecture that focuses on learning the distribution over uncorrupted latent embeddings.
arXiv Detail & Related papers (2024-02-08T23:51:49Z) - Occlusion-Robust FAU Recognition by Mining Latent Space of Masked
Autoencoders [23.39566752915331]
Facial action units (FAUs) are critical for fine-grained facial expression analysis.
New approach takes advantage of rich information from the latent space of masked autoencoder (MAE) and transforms it into FAU features.
FAUs can achieve comparable performance as state-of-the-art methods under normal conditions.
arXiv Detail & Related papers (2022-12-08T01:57:48Z) - Self-Supervised Robustifying Guidance for Monocular 3D Face
Reconstruction [14.203990541030445]
We propose a Self-Supervised RObustifying GUidancE (ROGUE) framework to obtain robustness against occlusions and noise in the face images.
The proposed network contains 1) the Guidance Pipeline to obtain the 3D face coefficients for the clean faces, and 2) the Robustification Pipeline to acquire the consistency between the estimated coefficients for occluded or noisy images and the clean counterpart.
On the three variations of the test dataset of CelebA, our method outperforms the current state-of-the-art method by large margins.
arXiv Detail & Related papers (2021-12-29T03:30:50Z) - Dynamic Facial Expression Recognition under Partial Occlusion with
Optical Flow Reconstruction [20.28462460359439]
We propose a new solution based on an auto-encoder with skip connections to reconstruct the occluded part of the face in the optical flow domain.
Our experiments show that the proposed method reduce significantly the gap, in terms of recognition accuracy, between occluded and non-occluded situations.
arXiv Detail & Related papers (2020-12-24T12:28:47Z) - OccInpFlow: Occlusion-Inpainting Optical Flow Estimation by Unsupervised
Learning [29.802404790103665]
Occlusion is an inevitable and critical problem in unsupervised optical flow learning.
We present OccInpFlow, an occlusion-inpainting framework to make full use of Occlusion regions.
We conduct experiments on leading flow benchmark data sets such as Flying Chairs, KITTI and MPI-Sintel.
arXiv Detail & Related papers (2020-06-30T10:01:32Z) - Occlusion-Adaptive Deep Network for Robust Facial Expression Recognition [56.11054589916299]
We propose a landmark-guided attention branch to find and discard corrupted features from occluded regions.
An attention map is first generated to indicate if a specific facial part is occluded and guide our model to attend to non-occluded regions.
This results in more diverse and discriminative features, enabling the expression recognition system to recover even though the face is partially occluded.
arXiv Detail & Related papers (2020-05-12T20:42:55Z) - Non-Local Part-Aware Point Cloud Denoising [55.50360085086123]
This paper presents a novel non-local part-aware deep neural network to denoise point clouds.
We design the non-local learning unit (NLU) customized with a graph attention module to adaptively capture non-local semantically-related features.
To enhance the denoising performance, we cascade a series of NLUs to progressively distill the noise features from the noisy inputs.
arXiv Detail & Related papers (2020-03-14T13:51:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.