Adaptively Enhancing Facial Expression Crucial Regions via Local
Non-Local Joint Network
- URL: http://arxiv.org/abs/2203.14045v2
- Date: Wed, 28 Feb 2024 03:13:16 GMT
- Title: Adaptively Enhancing Facial Expression Crucial Regions via Local
Non-Local Joint Network
- Authors: Guanghui Shi, Shasha Mao, Shuiping Gou, Dandan Yan, Licheng Jiao, Lin
Xiong
- Abstract summary: A local non-local joint network is proposed to adaptively light up the facial crucial regions in feature learning of Facial expression recognition.
The proposed method achieves more competitive performance compared with several state-of-the art methods on five benchmark datasets.
- Score: 37.665344656227624
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Facial expression recognition (FER) is still one challenging research due to
the small inter-class discrepancy in the facial expression data. In view of the
significance of facial crucial regions for FER, many existing researches
utilize the prior information from some annotated crucial points to improve the
performance of FER. However, it is complicated and time-consuming to manually
annotate facial crucial points, especially for vast wild expression images.
Based on this, a local non-local joint network is proposed to adaptively light
up the facial crucial regions in feature learning of FER in this paper. In the
proposed method, two parts are constructed based on facial local and non-local
information respectively, where an ensemble of multiple local networks are
proposed to extract local features corresponding to multiple facial local
regions and a non-local attention network is addressed to explore the
significance of each local region. Especially, the attention weights obtained
by the non-local network is fed into the local part to achieve the interactive
feedback between the facial global and local information. Interestingly, the
non-local weights corresponding to local regions are gradually updated and
higher weights are given to more crucial regions. Moreover, U-Net is employed
to extract the integrated features of deep semantic information and low
hierarchical detail information of expression images. Finally, experimental
results illustrate that the proposed method achieves more competitive
performance compared with several state-of-the art methods on five benchmark
datasets. Noticeably, the analyses of the non-local weights corresponding to
local regions demonstrate that the proposed method can automatically enhance
some crucial regions in the process of feature learning without any facial
landmark information.
Related papers
- Local and Global Feature Attention Fusion Network for Face Recognition [6.479154287764899]
We propose a Local and Global Feature Attention Fusion (LGAF) network based on feature quality.
The network adaptively allocates attention between local and global features according to feature quality and obtains more discriminative and high-quality face features.
Experimental results demonstrate that the LGAF achieves the best average performance on $4$ validation sets.
arXiv Detail & Related papers (2024-11-25T07:55:57Z) - Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning [50.88504784466931]
Multi-task dense prediction involves semantic segmentation, depth estimation, and surface normal estimation.
Existing solutions typically rely on learning global image representations for global cross-task image matching.
Our proposal involves modeling region-wise representations using Gaussian Distributions.
arXiv Detail & Related papers (2024-03-15T12:41:30Z) - Region Generation and Assessment Network for Occluded Person
Re-Identification [43.49129366128688]
Person Re-identification (ReID) plays a more and more crucial role in recent years with a wide range of applications.
Most methods tackle such challenges by utilizing external tools to locate body parts or exploiting matching strategies.
We propose a Region Generation and Assessment Network (RGANet) to effectively and efficiently detect the human body regions.
arXiv Detail & Related papers (2023-09-07T08:41:47Z) - Region-Based Semantic Factorization in GANs [67.90498535507106]
We present a highly efficient algorithm to factorize the latent semantics learned by Generative Adversarial Networks (GANs) concerning an arbitrary image region.
Through an appropriately defined generalized Rayleigh quotient, we solve such a problem without any annotations or training.
Experimental results on various state-of-the-art GAN models demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2022-02-19T17:46:02Z) - Point-Level Region Contrast for Object Detection Pre-Training [147.47349344401806]
We present point-level region contrast, a self-supervised pre-training approach for the task of object detection.
Our approach performs contrastive learning by directly sampling individual point pairs from different regions.
Compared to an aggregated representation per region, our approach is more robust to the change in input region quality.
arXiv Detail & Related papers (2022-02-09T18:56:41Z) - An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot
Learning [77.72330187258498]
We propose a novel Entropy-guided Reinforced Partial Convolutional Network (ERPCNet)
ERPCNet extracts and aggregates localities based on semantic relevance and visual correlations without human-annotated regions.
It not only discovers global-cooperative localities dynamically but also converges faster for policy gradient optimization.
arXiv Detail & Related papers (2021-11-03T11:13:13Z) - Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture
Recognition [9.131161856493486]
We propose a novel end-to-end textbfRegional Attention Network (RAN), which is a fully Convolutional Neural Network (CNN)
Our regions consist of one or more consecutive cells and are adapted from the strategies used in computing HOG (Histogram of Oriented Gradient) descriptor.
The proposed approach outperforms the state-of-the-art by a considerable margin in different metrics.
arXiv Detail & Related papers (2021-01-17T10:14:28Z) - Local Context Attention for Salient Object Segmentation [5.542044768017415]
We propose a novel Local Context Attention Network (LCANet) to generate locally reinforcement feature maps in a uniform representational architecture.
The proposed network introduces an Attentional Correlation Filter (ACF) module to generate explicit local attention by calculating the correlation feature map between coarse prediction and global context.
Comprehensive experiments are conducted on several salient object segmentation datasets, demonstrating the superior performance of the proposed LCANet against the state-of-the-art methods.
arXiv Detail & Related papers (2020-09-24T09:20:06Z) - Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization [54.00111565818903]
Cross-view geo-localization is to spot images of the same geographic target from different platforms.
Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center.
We introduce a simple and effective deep neural network, called Local Pattern Network (LPN), to take advantage of contextual information.
arXiv Detail & Related papers (2020-08-26T16:06:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.