SFace: Privacy-friendly and Accurate Face Recognition using Synthetic
Data
- URL: http://arxiv.org/abs/2206.10520v1
- Date: Tue, 21 Jun 2022 16:42:04 GMT
- Title: SFace: Privacy-friendly and Accurate Face Recognition using Synthetic
Data
- Authors: Fadi Boutros, Marco Huber, Patrick Siebke, Tim Rieber, Naser Damer
- Abstract summary: We propose and investigate the feasibility of using a privacy-friendly synthetically generated face dataset to train face recognition models.
To address the privacy aspect of using such data to train a face recognition model, we provide extensive evaluation experiments on the identity relation between the synthetic dataset and the original authentic dataset used to train the generative model.
We also propose to train face recognition on our privacy-friendly dataset, SFace, using three different learning strategies, multi-class classification, label-free knowledge transfer, and combined learning of multi-class classification and knowledge transfer.
- Score: 9.249824128880707
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Recent deep face recognition models proposed in the literature utilized
large-scale public datasets such as MS-Celeb-1M and VGGFace2 for training very
deep neural networks, achieving state-of-the-art performance on mainstream
benchmarks. Recently, many of these datasets, e.g., MS-Celeb-1M and VGGFace2,
are retracted due to credible privacy and ethical concerns. This motivates this
work to propose and investigate the feasibility of using a privacy-friendly
synthetically generated face dataset to train face recognition models. Towards
this end, we utilize a class-conditional generative adversarial network to
generate class-labeled synthetic face images, namely SFace. To address the
privacy aspect of using such data to train a face recognition model, we provide
extensive evaluation experiments on the identity relation between the synthetic
dataset and the original authentic dataset used to train the generative model.
Our reported evaluation proved that associating an identity of the authentic
dataset to one with the same class label in the synthetic dataset is hardly
possible. We also propose to train face recognition on our privacy-friendly
dataset, SFace, using three different learning strategies, multi-class
classification, label-free knowledge transfer, and combined learning of
multi-class classification and knowledge transfer. The reported evaluation
results on five authentic face benchmarks demonstrated that the
privacy-friendly synthetic dataset has high potential to be used for training
face recognition models, achieving, for example, a verification accuracy of
91.87\% on LFW using multi-class classification and 99.13\% using the combined
learning strategy.
Related papers
- ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition [60.15830516741776]
Synthetic face recognition (SFR) aims to generate datasets that mimic the distribution of real face data.
We introduce a diffusion-fueled SFR model termed $textID3$.
$textID3$ employs an ID-preserving loss to generate diverse yet identity-consistent facial appearances.
arXiv Detail & Related papers (2024-09-26T06:46:40Z) - G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors [71.69161292330504]
Reversible face anonymization seeks to replace sensitive identity information in facial images with synthesized alternatives.
This paper introduces Gtextsuperscript2Face, which leverages both generative and geometric priors to enhance identity manipulation.
Our method outperforms existing state-of-the-art techniques in face anonymization and recovery, while preserving high data utility.
arXiv Detail & Related papers (2024-08-18T12:36:47Z) - SDFR: Synthetic Data for Face Recognition Competition [51.9134406629509]
Large-scale face recognition datasets are collected by crawling the Internet and without individuals' consent, raising legal, ethical, and privacy concerns.
Recently several works proposed generating synthetic face recognition datasets to mitigate concerns in web-crawled face recognition datasets.
This paper presents the summary of the Synthetic Data for Face Recognition (SDFR) Competition held in conjunction with the 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2024)
The SDFR competition was split into two tasks, allowing participants to train face recognition systems using new synthetic datasets and/or existing ones.
arXiv Detail & Related papers (2024-04-06T10:30:31Z) - If It's Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces [16.977459035497162]
Large face datasets are primarily sourced from web-based images, lacking explicit user consent.
In this paper, we examine whether and how synthetic face data can be used to train effective face recognition models.
arXiv Detail & Related papers (2024-04-04T15:45:25Z) - SynthDistill: Face Recognition with Knowledge Distillation from
Synthetic Data [8.026313049094146]
State-of-the-art face recognition networks are often computationally expensive and cannot be used for mobile applications.
We propose a new framework to train lightweight face recognition models by distilling the knowledge of a pretrained teacher face recognition model using synthetic data.
We use synthetic face images without identity labels, mitigating the problems in the intra-class variation generation of synthetic datasets.
arXiv Detail & Related papers (2023-08-28T19:15:27Z) - Identity-driven Three-Player Generative Adversarial Network for
Synthetic-based Face Recognition [14.73254194339562]
We present a three-player generative adversarial network (GAN) framework, namely IDnet, that enables the integration of identity information into the generation process.
We empirically proved that our IDnet synthetic images are of higher identity discrimination in comparison to the conventional two-player GAN.
We demonstrated the applicability of our IDnet data in training face recognition models by evaluating these models on a wide set of face recognition benchmarks.
arXiv Detail & Related papers (2023-04-30T00:04:27Z) - Attribute-preserving Face Dataset Anonymization via Latent Code
Optimization [64.4569739006591]
We present a task-agnostic anonymization procedure that directly optimize the images' latent representation in the latent space of a pre-trained GAN.
We demonstrate through a series of experiments that our method is capable of anonymizing the identity of the images whilst -- crucially -- better-preserving the facial attributes.
arXiv Detail & Related papers (2023-03-20T17:34:05Z) - Unsupervised Face Recognition using Unlabeled Synthetic Data [16.494722503803196]
We propose an unsupervised face recognition model based on unlabeled synthetic data (U SynthFace)
Our proposed U SynthFace learns to maximize the similarity between two augmented images of the same synthetic instance.
We prove the effectiveness of our U SynthFace in achieving relatively high recognition accuracies using unlabeled synthetic data.
arXiv Detail & Related papers (2022-11-14T14:05:19Z) - How to Boost Face Recognition with StyleGAN? [13.067766076889995]
State-of-the-art face recognition systems require vast amounts of labeled training data.
Self-supervised revolution in the industry motivates research on the adaptation of related techniques to facial recognition.
We show that a simple approach based on fine-tuning pSp encoder for StyleGAN allows us to improve upon the state-of-the-art facial recognition.
arXiv Detail & Related papers (2022-10-18T18:41:56Z) - SynFace: Face Recognition with Synthetic Data [83.15838126703719]
We devise the SynFace with identity mixup (IM) and domain mixup (DM) to mitigate the performance gap.
We also perform a systematically empirical analysis on synthetic face images to provide some insights on how to effectively utilize synthetic data for face recognition.
arXiv Detail & Related papers (2021-08-18T03:41:54Z) - Investigating the Impact of Inclusion in Face Recognition Training Data
on Individual Face Identification [93.5538147928669]
We audit ArcFace, a state-of-the-art, open source face recognition system, in a large-scale face identification experiment with more than one million distractor images.
We find a Rank-1 face identification accuracy of 79.71% for individuals present in the model's training data and an accuracy of 75.73% for those not present.
arXiv Detail & Related papers (2020-01-09T15:50:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.