Region attention and graph embedding network for occlusion objective
class-based micro-expression recognition
- URL: http://arxiv.org/abs/2107.05904v1
- Date: Tue, 13 Jul 2021 08:04:03 GMT
- Title: Region attention and graph embedding network for occlusion objective
class-based micro-expression recognition
- Authors: Qirong Mao, Ling Zhou, Wenming Zheng, Xiuyan Shao, Xiaohua Huang
- Abstract summary: Micro-expression recognition (textbfMER) has attracted lots of researchers' attention in a decade.
This paper deeply investigates an interesting but unexplored challenging issue in MER, ie, occlusion MER.
A underlineRegion-inspired underlineRelation underlineReasoning underlineNetwork (textbfRRRN) is proposed to model relations between various facial regions.
- Score: 26.5638344747854
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Micro-expression recognition (\textbf{MER}) has attracted lots of
researchers' attention in a decade. However, occlusion will occur for MER in
real-world scenarios. This paper deeply investigates an interesting but
unexplored challenging issue in MER, \ie, occlusion MER. First, to research MER
under real-world occlusion, synthetic occluded micro-expression databases are
created by using various mask for the community. Second, to suppress the
influence of occlusion, a \underline{R}egion-inspired \underline{R}elation
\underline{R}easoning \underline{N}etwork (\textbf{RRRN}) is proposed to model
relations between various facial regions. RRRN consists of a backbone network,
the Region-Inspired (\textbf{RI}) module and Relation Reasoning (\textbf{RR})
module. More specifically, the backbone network aims at extracting feature
representations from different facial regions, RI module computing an adaptive
weight from the region itself based on attention mechanism with respect to the
unobstructedness and importance for suppressing the influence of occlusion, and
RR module exploiting the progressive interactions among these regions by
performing graph convolutions. Experiments are conducted on handout-database
evaluation and composite database evaluation tasks of MEGC 2018 protocol.
Experimental results show that RRRN can significantly explore the importance of
facial regions and capture the cooperative complementary relationship of facial
regions for MER. The results also demonstrate RRRN outperforms the
state-of-the-art approaches, especially on occlusion, and RRRN acts more robust
to occlusion.
Related papers
- Global Occlusion-Aware Transformer for Robust Stereo Matching [11.655465312241699]
This paper introduces a novel attention-based stereo-matching network called Global Occlusion-Aware Transformer (GOAT)
GOAT exploits long-range dependency and occlusion-awareness global context for disparity estimation.
The proposed GOAT demonstrates outstanding performance among all benchmarks, particularly in the occluded regions.
arXiv Detail & Related papers (2023-12-22T12:34:58Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - Improving Vision Anomaly Detection with the Guidance of Language
Modality [64.53005837237754]
This paper tackles the challenges for vision modality from a multimodal point of view.
We propose Cross-modal Guidance (CMG) to tackle the redundant information issue and sparse space issue.
To learn a more compact latent space for the vision anomaly detector, CMLE learns a correlation structure matrix from the language modality.
arXiv Detail & Related papers (2023-10-04T13:44:56Z) - Region Generation and Assessment Network for Occluded Person
Re-Identification [43.49129366128688]
Person Re-identification (ReID) plays a more and more crucial role in recent years with a wide range of applications.
Most methods tackle such challenges by utilizing external tools to locate body parts or exploiting matching strategies.
We propose a Region Generation and Assessment Network (RGANet) to effectively and efficiently detect the human body regions.
arXiv Detail & Related papers (2023-09-07T08:41:47Z) - Learning Federated Visual Prompt in Null Space for MRI Reconstruction [83.71117888610547]
We propose a new algorithm, FedPR, to learn federated visual prompts in the null space of global prompt for MRI reconstruction.
FedPR significantly outperforms state-of-the-art FL algorithms with 6% of communication costs when given the limited amount of local training data.
arXiv Detail & Related papers (2023-03-28T17:46:16Z) - Cross-Modal Causal Intervention for Medical Report Generation [109.83549148448469]
Medical report generation (MRG) is essential for computer-aided diagnosis and medication guidance.
Due to the spurious correlations within image-text data induced by visual and linguistic biases, it is challenging to generate accurate reports reliably describing lesion areas.
We propose a novel Visual-Linguistic Causal Intervention (VLCI) framework for MRG, which consists of a visual deconfounding module (VDM) and a linguistic deconfounding module (LDM)
arXiv Detail & Related papers (2023-03-16T07:23:55Z) - PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person
Search [56.02761592710612]
We propose a novel attention-aware relation mixer (ARM) for module person search.
Our ARM module is native and does not rely on fine-grained supervision or topological assumptions.
Our PS-ARM achieves state-of-the-art performance on both datasets.
arXiv Detail & Related papers (2022-10-07T10:04:12Z) - Reasoning Structural Relation for Occlusion-Robust Facial Landmark
Localization [5.171058506312429]
This paper proposes a structural relation network (SRN) for occlusion-robust landmark localization.
Unlike most existing methods that simply exploit the shape constraint, the proposed SRN aims to capture the structural relations among different facial components.
The proposed method achieves outstanding performance on occluded and masked faces.
arXiv Detail & Related papers (2021-12-19T08:27:37Z) - Unsupervised Instance Segmentation in Microscopy Images via Panoptic
Domain Adaptation and Task Re-weighting [86.33696045574692]
We propose a Cycle Consistency Panoptic Domain Adaptive Mask R-CNN (CyC-PDAM) architecture for unsupervised nuclei segmentation in histopathology images.
We first propose a nuclei inpainting mechanism to remove the auxiliary generated objects in the synthesized images.
Secondly, a semantic branch with a domain discriminator is designed to achieve panoptic-level domain adaptation.
arXiv Detail & Related papers (2020-05-05T11:08:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.