EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale
Dataset
- URL: http://arxiv.org/abs/2110.05031v1
- Date: Mon, 11 Oct 2021 06:53:24 GMT
- Title: EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale
Dataset
- Authors: Kaihao Zhang, Dongxu Li, Wenhan Luo, Jingyu Liu, Jiankang Deng, Wei
Liu, Stefanos Zafeiriou
- Abstract summary: Recent deep face hallucination methods show stunning performance in super-resolving severely degraded facial images.
It is thus unclear how these algorithms perform on public face hallucination datasets.
This paper builds a public Ethnically Diverse Face dataset, EDFace-Celeb-1M, and design a benchmark task for face hallucination.
- Score: 92.537021496096
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent deep face hallucination methods show stunning performance in
super-resolving severely degraded facial images, even surpassing human ability.
However, these algorithms are mainly evaluated on non-public synthetic
datasets. It is thus unclear how these algorithms perform on public face
hallucination datasets. Meanwhile, most of the existing datasets do not well
consider the distribution of races, which makes face hallucination methods
trained on these datasets biased toward some specific races. To address the
above two problems, in this paper, we build a public Ethnically Diverse Face
dataset, EDFace-Celeb-1M, and design a benchmark task for face hallucination.
Our dataset includes 1.7 million photos that cover different countries, with
balanced race composition. To the best of our knowledge, it is the largest and
publicly available face hallucination dataset in the wild. Associated with this
dataset, this paper also contributes various evaluation protocols and provides
comprehensive analysis to benchmark the existing state-of-the-art methods. The
benchmark evaluations demonstrate the performance and limitations of
state-of-the-art algorithms.
Related papers
- DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis [71.40724659748787]
DiffusionFace is the first diffusion-based face forgery dataset.
It covers various forgery categories, including unconditional and Text Guide facial image generation, Img2Img, Inpaint, and Diffusion-based facial exchange algorithms.
It provides essential metadata and a real-world internet-sourced forgery facial image dataset for evaluation.
arXiv Detail & Related papers (2024-03-27T11:32:44Z) - Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant
Collection of Face Images for Various Classification Tasks [3.1133049660590615]
Recent privacy regulations have restricted the ways in which human images may be collected and used for research.
Several previously published data sets containing human faces have been removed from the internet due to inadequate data collection methods.
We present the Multi-Task Faces (MTF) image data set, a meticulously curated collection of face images designed for various classification tasks.
arXiv Detail & Related papers (2023-11-20T16:19:46Z) - Benchmarking Algorithmic Bias in Face Recognition: An Experimental
Approach Using Synthetic Faces and Human Evaluation [24.35436087740559]
We propose an experimental method for measuring bias in face recognition systems.
Our method is based on generating synthetic faces using a neural face generator.
We validate our method quantitatively by evaluating race and gender biases of three research-grade face recognition models.
arXiv Detail & Related papers (2023-08-10T08:57:31Z) - Multiface: A Dataset for Neural Face Rendering [108.44505415073579]
In this work, we present Multiface, a new multi-view, high-resolution human face dataset.
We introduce Mugsy, a large scale multi-camera apparatus to capture high-resolution synchronized videos of a facial performance.
The goal of Multiface is to close the gap in accessibility to high quality data in the academic community and to enable research in VR telepresence.
arXiv Detail & Related papers (2022-07-22T17:55:39Z) - Blind Face Restoration: Benchmark Datasets and a Baseline Model [63.053331687284064]
Blind Face Restoration (BFR) aims to construct a high-quality (HQ) face image from its corresponding low-quality (LQ) input.
We first synthesize two blind face restoration benchmark datasets called EDFace-Celeb-1M (BFR128) and EDFace-Celeb-150K (BFR512)
State-of-the-art methods are benchmarked on them under five settings including blur, noise, low resolution, JPEG compression artifacts, and the combination of them (full degradation)
arXiv Detail & Related papers (2022-06-08T06:34:24Z) - Meta Balanced Network for Fair Face Recognition [51.813457201437195]
We systematically and scientifically study bias from both data and algorithm aspects.
We propose a novel meta-learning algorithm, called Meta Balanced Network (MBN), which learns adaptive margins in large margin loss.
Extensive experiments show that MBN successfully mitigates bias and learns more balanced performance for people with different skin tones in face recognition.
arXiv Detail & Related papers (2022-05-13T10:25:44Z) - Escaping Data Scarcity for High-Resolution Heterogeneous Face
Hallucination [68.78903256687697]
In Heterogeneous Face Recognition (HFR), the objective is to match faces across two different domains such as visible and thermal.
Recent methods attempting to fill the gap via synthesis have achieved promising results, but their performance is still limited by the scarcity of paired training data.
In this paper, we propose a new face hallucination paradigm for HFR, which not only enables data-efficient synthesis but also allows to scale up model training without breaking any privacy policy.
arXiv Detail & Related papers (2022-03-30T20:44:33Z) - FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face
Extraction [3.8502825594372703]
Occlusions often occur in face images in the wild, troubling face-related tasks such as landmark detection, 3D reconstruction, and face recognition.
This paper proposes a novel face segmentation dataset with manually labeled face occlusions from the CelebA-HQ and the internet.
We trained a straightforward face segmentation model but obtained SOTA performance, convincingly demonstrating the effectiveness of the proposed dataset.
arXiv Detail & Related papers (2022-01-20T19:44:18Z) - On Recognizing Occluded Faces in the Wild [10.420394952839242]
We present the Real World Occluded Faces dataset.
This dataset contains faces with both upper face.
occluded, due to sunglasses, and lower face.
occluded, due to masks.
It is observed that the performance drop is far less when the models are tested on synthetically generated occluded faces.
arXiv Detail & Related papers (2021-09-08T14:20:10Z) - Dataset Cleaning -- A Cross Validation Methodology for Large Facial
Datasets using Face Recognition [0.40611352512781856]
In recent years, large "in the wild" face datasets have been released in an attempt to facilitate progress in tasks such as face detection, face recognition, and other tasks.
Due to the automatic way of gathering these datasets and due to their large size, many identities folder contain mislabeled samples which deteriorates the quality of the datasets.
In this work, it is presented a semi-automatic method for cleaning the noisy large face datasets with the use of face recognition.
arXiv Detail & Related papers (2020-03-24T13:01:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.