NIR-to-VIS Face Recognition via Embedding Relations and Coordinates of
the Pairwise Features
- URL: http://arxiv.org/abs/2208.02417v1
- Date: Thu, 4 Aug 2022 02:53:44 GMT
- Title: NIR-to-VIS Face Recognition via Embedding Relations and Coordinates of
the Pairwise Features
- Authors: MyeongAh Cho, Tae-young Chun, g Taeoh Kim, Sangyoun Lee
- Abstract summary: We propose a 'Relation Module' which can simply add-on to any face recognition models.
The local features extracted from face image contain information of each component of the face.
With the proposed module, we achieve 14.81% rank-1 accuracy and 15.47% verification rate of 0.1% FAR improvements.
- Score: 5.044100238869375
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: NIR-to-VIS face recognition is identifying faces of two different domains by
extracting domain-invariant features. However, this is a challenging problem
due to the two different domain characteristics, and the lack of NIR face
dataset. In order to reduce domain discrepancy while using the existing face
recognition models, we propose a 'Relation Module' which can simply add-on to
any face recognition models. The local features extracted from face image
contain information of each component of the face. Based on two different
domain characteristics, to use the relationships between local features is more
domain-invariant than to use it as it is. In addition to these relationships,
positional information such as distance from lips to chin or eye to eye, also
provides domain-invariant information. In our Relation Module, Relation Layer
implicitly captures relationships, and Coordinates Layer models the positional
information. Also, our proposed Triplet loss with conditional margin reduces
intra-class variation in training, and resulting in additional performance
improvements. Different from the general face recognition models, our add-on
module does not need to pre-train with the large scale dataset. The proposed
module fine-tuned only with CASIA NIR-VIS 2.0 database. With the proposed
module, we achieve 14.81% rank-1 accuracy and 15.47% verification rate of 0.1%
FAR improvements compare to two baseline models.
Related papers
- Learning multi-domain feature relation for visible and Long-wave
Infrared image patch matching [39.88037892637296]
We present the largest visible and Long-wave Infrared (LWIR) image patch matching dataset, termed VL-CMIM.
In addition, a multi-domain feature relation learning network (MD-FRN) is proposed.
arXiv Detail & Related papers (2023-08-09T11:23:32Z) - Adaptive Face Recognition Using Adversarial Information Network [57.29464116557734]
Face recognition models often degenerate when training data are different from testing data.
We propose a novel adversarial information network (AIN) to address it.
arXiv Detail & Related papers (2023-05-23T02:14:11Z) - Part-guided Relational Transformers for Fine-grained Visual Recognition [59.20531172172135]
We propose a framework to learn the discriminative part features and explore correlations with a feature transformation module.
Our proposed approach does not rely on additional part branches and reaches state-the-of-art performance on 3-of-the-level object recognition.
arXiv Detail & Related papers (2022-12-28T03:45:56Z) - Decoupled Multi-task Learning with Cyclical Self-Regulation for Face
Parsing [71.19528222206088]
We propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation for face parsing.
Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection.
Our method achieves the new state-of-the-art performance on the Helen, CelebA-HQ, and LapaMask datasets.
arXiv Detail & Related papers (2022-03-28T02:12:30Z) - BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR [52.78253400327191]
BDA-SketRet is a novel framework performing a bi-level domain adaptation for aligning the spatial and semantic features of the visual data pairs.
Experimental results on the extended Sketchy, TU-Berlin, and QuickDraw exhibit sharp improvements over the literature.
arXiv Detail & Related papers (2022-01-17T18:45:55Z) - A NIR-to-VIS face recognition via part adaptive and relation attention
module [4.822208985805956]
In the face recognition application scenario, we need to process facial images captured in various conditions, such as at night by near-infrared (NIR) surveillance cameras.
The illumination difference between NIR and visible-light (VIS) causes a domain gap between facial images, and the variations in pose and emotion also make facial matching more difficult.
We propose a part relation attention module that crops facial parts obtained through a semantic mask and performs relational modeling using each of these representative features.
arXiv Detail & Related papers (2021-02-01T08:13:39Z) - Semantic Change Detection with Asymmetric Siamese Networks [71.28665116793138]
Given two aerial images, semantic change detection aims to locate the land-cover variations and identify their change types with pixel-wise boundaries.
This problem is vital in many earth vision related tasks, such as precise urban planning and natural resource management.
We present an asymmetric siamese network (ASN) to locate and identify semantic changes through feature pairs obtained from modules of widely different structures.
arXiv Detail & Related papers (2020-10-12T13:26:30Z) - DBLFace: Domain-Based Labels for NIR-VIS Heterogeneous Face Recognition [5.076419064097733]
Domain-Based Label Face (DBLFace) is a learning approach based on the assumption that a subject is not represented by a single label but by a set of labels.
In particular, a set of two labels per subject, one for the NIR images and one for the VIS images, are used for training a NIR-VIS face recognition model.
DBLFace significantly improves the rank-1 identification rate by 6.7% on the EDGE20 dataset and achieves state-of-the-art performance on the CASIA NIR-VIS 2.0 dataset.
arXiv Detail & Related papers (2020-10-08T05:22:47Z) - Multi-Margin based Decorrelation Learning for Heterogeneous Face
Recognition [90.26023388850771]
This paper presents a deep neural network approach to extract decorrelation representations in a hyperspherical space for cross-domain face images.
The proposed framework can be divided into two components: heterogeneous representation network and decorrelation representation learning.
Experimental results on two challenging heterogeneous face databases show that our approach achieves superior performance on both verification and recognition tasks.
arXiv Detail & Related papers (2020-05-25T07:01:12Z) - Associating Multi-Scale Receptive Fields for Fine-grained Recognition [5.079292308180334]
We propose a novel cross-layer non-local (CNL) module to associate multi-scale receptive fields by two operations.
CNL computes correlations between features of a query layer and all response layers.
Our model builds spatial dependencies among multi-level layers and learns more discriminative features.
arXiv Detail & Related papers (2020-05-19T01:16:31Z) - Relational Deep Feature Learning for Heterogeneous Face Recognition [17.494718795454055]
We propose a graph-structured module called Graph Module (NIR) that extracts global relational information in addition to general facial features.
The proposed method outperforms other state-of-the-art methods on five Heterogeneous Face Recognition (HFR) databases.
arXiv Detail & Related papers (2020-03-02T07:35:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.