Blind Face Restoration: Benchmark Datasets and a Baseline Model
- URL: http://arxiv.org/abs/2206.03697v1
- Date: Wed, 8 Jun 2022 06:34:24 GMT
- Title: Blind Face Restoration: Benchmark Datasets and a Baseline Model
- Authors: Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Guoren Wang
- Abstract summary: Blind Face Restoration (BFR) aims to construct a high-quality (HQ) face image from its corresponding low-quality (LQ) input.
We first synthesize two blind face restoration benchmark datasets called EDFace-Celeb-1M (BFR128) and EDFace-Celeb-150K (BFR512)
State-of-the-art methods are benchmarked on them under five settings including blur, noise, low resolution, JPEG compression artifacts, and the combination of them (full degradation)
- Score: 63.053331687284064
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Blind Face Restoration (BFR) aims to construct a high-quality (HQ) face image
from its corresponding low-quality (LQ) input. Recently, many BFR methods have
been proposed and they have achieved remarkable success. However, these methods
are trained or evaluated on privately synthesized datasets, which makes it
infeasible for the subsequent approaches to fairly compare with them. To
address this problem, we first synthesize two blind face restoration benchmark
datasets called EDFace-Celeb-1M (BFR128) and EDFace-Celeb-150K (BFR512).
State-of-the-art methods are benchmarked on them under five settings including
blur, noise, low resolution, JPEG compression artifacts, and the combination of
them (full degradation). To make the comparison more comprehensive, five
widely-used quantitative metrics and two task-driven metrics including Average
Face Landmark Distance (AFLD) and Average Face ID Cosine Similarity (AFICS) are
applied. Furthermore, we develop an effective baseline model called Swin
Transformer U-Net (STUNet). The STUNet with U-net architecture applies an
attention mechanism and a shifted windowing scheme to capture long-range pixel
interactions and focus more on significant features while still being trained
efficiently. Experimental results show that the proposed baseline method
performs favourably against the SOTA methods on various BFR tasks.
Related papers
- Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression Recognition [5.750927184237346]
We focus on utilizing large unlabeled Face Recognition (FR) datasets to boost semi-supervised FER.
Specifically, we first perform face reconstruction pre-training on large-scale facial images without annotations.
To further alleviate the scarcity of labeled and diverse images, we propose a Mixup-based data augmentation strategy.
arXiv Detail & Related papers (2024-10-23T07:26:19Z) - Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos [99.42805906884499]
We first introduce a Real-world Low-Quality Face Video benchmark (RFV-LQ) to evaluate leading image-based face restoration algorithms.
We then conduct a thorough systematical analysis of the benefits and challenges associated with extending blind face image restoration algorithms to degraded face videos.
Our analysis identifies several key issues, primarily categorized into two aspects: significant jitters in facial components and noise-shape flickering between frames.
arXiv Detail & Related papers (2024-10-15T17:53:25Z) - A Simple and Generalist Approach for Panoptic Segmentation [57.94892855772925]
Generalist vision models aim for one and the same architecture for a variety of vision tasks.
While such shared architecture may seem attractive, generalist models tend to be outperformed by their bespoken counterparts.
We address this problem by introducing two key contributions, without compromising the desirable properties of generalist models.
arXiv Detail & Related papers (2024-08-29T13:02:12Z) - A visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification tasks [1.1118946307353794]
Face Anti-Spoofing (FAS) has played a crucial role in preserving the security of face recognition technology.
With the rise of counterfeit face generation techniques, the challenge posed by digitally edited faces to face anti-spoofing is escalating.
We propose a visualization method that intuitively reflects the training outcomes of models by visualizing the prediction results on datasets.
arXiv Detail & Related papers (2024-04-19T03:12:17Z) - Latent Diffusion Models for Attribute-Preserving Image Anonymization [4.080920304681247]
This paper presents the first approach to image anonymization based on Latent Diffusion Models (LDMs)
We propose two LDMs for this purpose: CAFLaGE-Base exploits a combination of pre-trained ControlNets, and a new controlling mechanism designed to increase the distance between the real and anonymized images.
arXiv Detail & Related papers (2024-03-21T19:09:21Z) - UniTSFace: Unified Threshold Integrated Sample-to-Sample Loss for Face
Recognition [35.66000285310775]
We propose a unified threshold integrated sample-to-sample based loss (USS loss)
USS loss features an explicit unified threshold for distinguishing positive from negative pairs.
We also derive the sample-to-sample based softmax and BCE losses, and discuss their relationship.
arXiv Detail & Related papers (2023-11-04T23:00:40Z) - Improving Pixel-based MIM by Reducing Wasted Modeling Capability [77.99468514275185]
We propose a new method that explicitly utilizes low-level features from shallow layers to aid pixel reconstruction.
To the best of our knowledge, we are the first to systematically investigate multi-level feature fusion for isotropic architectures.
Our method yields significant performance gains, such as 1.2% on fine-tuning, 2.8% on linear probing, and 2.6% on semantic segmentation.
arXiv Detail & Related papers (2023-08-01T03:44:56Z) - Multi-Prior Learning via Neural Architecture Search for Blind Face
Restoration [61.27907052910136]
Blind Face Restoration (BFR) aims to recover high-quality face images from low-quality ones.
Current methods still suffer from two major difficulties: 1) how to derive a powerful network architecture without extensive hand tuning; 2) how to capture complementary information from multiple facial priors in one network to improve restoration performance.
We propose a Face Restoration Searching Network (FRSNet) to adaptively search the suitable feature extraction architecture within our specified search space.
arXiv Detail & Related papers (2022-06-28T12:29:53Z) - Unsupervised Learning Facial Parameter Regressor for Action Unit
Intensity Estimation via Differentiable Renderer [51.926868759681014]
We present a framework to predict the facial parameters based on a bone-driven face model (BDFM) under different views.
The proposed framework consists of a feature extractor, a generator, and a facial parameter regressor.
arXiv Detail & Related papers (2020-08-20T09:49:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.