Face Presentation Attack Detection using Taskonomy Feature
- URL: http://arxiv.org/abs/2111.11046v1
- Date: Mon, 22 Nov 2021 08:35:26 GMT
- Title: Face Presentation Attack Detection using Taskonomy Feature
- Authors: Wentian Zhang, Haozhe Liu, Raghavendra Ramachandra, Feng Liu, Linlin
Shen, Christoph Busch
- Abstract summary: Presentation Attack Detection (PAD) methods are critical to ensure the security of Face Recognition Systems (FRSs)
Existing PAD methods are highly dependent on the limited training set and cannot generalize well to unknown PAs.
We propose to apply taskonomy (task taxonomy) from other face-related tasks to solve face PAD.
- Score: 26.343512092423985
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The robustness and generalization ability of Presentation Attack Detection
(PAD) methods is critical to ensure the security of Face Recognition Systems
(FRSs). However, in the real scenario, Presentation Attacks (PAs) are various
and hard to be collected. Existing PAD methods are highly dependent on the
limited training set and cannot generalize well to unknown PAs. Unlike PAD
task, other face-related tasks trained by huge amount of real faces (e.g. face
recognition and attribute editing) can be effectively adopted into different
application scenarios. Inspired by this, we propose to apply taskonomy (task
taxonomy) from other face-related tasks to solve face PAD, so as to improve the
generalization ability in detecting PAs. The proposed method, first introduces
task specific features from other face-related tasks, then, we design a
Cross-Modal Adapter using a Graph Attention Network (GAT) to re-map such
features to adapt to PAD task. Finally, face PAD is achieved by using the
hierarchical features from a CNN-based PA detector and the re-mapped features.
The experimental results show that the proposed method can achieve significant
improvements in the complicated and hybrid datasets, when compared with the
state-of-the-art methods. In particular, when trained using OULU-NPU,
CASIA-FASD, and Idiap Replay-Attack, we obtain HTER (Half Total Error Rate) of
5.48% in MSU-MFSD, outperforming the baseline by 7.39%. Code will be made
publicly available.
Related papers
- Semantics-Oriented Multitask Learning for DeepFake Detection: A Joint Embedding Approach [77.65459419417533]
We propose an automatic dataset expansion technique to support semantics-oriented DeepFake detection tasks.
We also resort to joint embedding of face images and their corresponding labels for prediction.
Our method improves the generalizability of DeepFake detection and renders some degree of model interpretation by providing human-understandable explanations.
arXiv Detail & Related papers (2024-08-29T07:11:50Z) - Generalizable Facial Expression Recognition [41.639746139849564]
SOTA facial expression recognition (FER) methods fail on test sets with domain gaps with the train set.
Recent domain adaptation FER methods need to acquire labeled or unlabeled samples of target domains to fine-tune the FER model.
This paper aims to improve the zero-shot generalization ability of FER methods on different unseen test sets using only one train set.
arXiv Detail & Related papers (2024-08-20T07:48:45Z) - Faceptor: A Generalist Model for Face Perception [52.8066001012464]
Faceptor is proposed to adopt a well-designed single-encoder dual-decoder architecture.
Layer-Attention into Faceptor enables the model to adaptively select features from optimal layers to perform the desired tasks.
Our training framework can also be applied to auxiliary supervised learning, significantly improving performance in data-sparse tasks such as age estimation and expression recognition.
arXiv Detail & Related papers (2024-03-14T15:42:31Z) - SwinFace: A Multi-task Transformer for Face Recognition, Expression
Recognition, Age Estimation and Attribute Estimation [60.94239810407917]
This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation based on a single Swin Transformer.
To address the conflicts among multiple tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis.
Experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks.
arXiv Detail & Related papers (2023-08-22T15:38:39Z) - Watch Out for the Confusing Faces: Detecting Face Swapping with the
Probability Distribution of Face Identification Models [37.49012763328351]
We propose a novel face swapping detection approach based on face identification probability distributions.
IdP_FSD is specially designed for detecting swapped faces whose identities belong to a finite set.
IdP_FSD exploits face swapping's common nature that the identity of swapped face combines that of two faces involved in swapping.
arXiv Detail & Related papers (2023-03-23T09:33:10Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z) - Taming Self-Supervised Learning for Presentation Attack Detection:
De-Folding and De-Mixing [42.733666815035534]
Biometric systems are vulnerable to Presentation Attacks performed using various Presentation Attack Instruments (PAIs)
We propose a self-supervised learning-based method, denoted as DF-DM.
DF-DM is based on a global-local view coupled with De-Folding and De-Mixing to derive the task-specific representation for PAD.
arXiv Detail & Related papers (2021-09-09T08:38:17Z) - Towards Transferable Adversarial Attack against Deep Face Recognition [58.07786010689529]
Deep convolutional neural networks (DCNNs) have been found to be vulnerable to adversarial examples.
transferable adversarial examples can severely hinder the robustness of DCNNs.
We propose DFANet, a dropout-based method used in convolutional layers, which can increase the diversity of surrogate models.
We generate a new set of adversarial face pairs that can successfully attack four commercial APIs without any queries.
arXiv Detail & Related papers (2020-04-13T06:44:33Z) - Cross-domain Face Presentation Attack Detection via Multi-domain
Disentangled Representation Learning [109.42987031347582]
Face presentation attack detection (PAD) has been an urgent problem to be solved in the face recognition systems.
We propose an efficient disentangled representation learning for cross-domain face PAD.
Our approach consists of disentangled representation learning (DR-Net) and multi-domain learning (MD-Net)
arXiv Detail & Related papers (2020-04-04T15:45:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.