Dive into the Resolution Augmentations and Metrics in Low Resolution
Face Recognition: A Plain yet Effective New Baseline
- URL: http://arxiv.org/abs/2302.05621v1
- Date: Sat, 11 Feb 2023 07:31:47 GMT
- Title: Dive into the Resolution Augmentations and Metrics in Low Resolution
Face Recognition: A Plain yet Effective New Baseline
- Authors: Xu Ling, Yichen Lu, Wenqi Xu, Weihong Deng, Yingjie Zhang, Xingchen
Cui, Hongzhi Shi, Dongchao Wen
- Abstract summary: We deal with the huge domain gap between High Resolution (HR) and Low Resolution (LR) domains.
We propose a more effective Multi-Resolution Augmentation and a novel metric loss based on a LogExp distance function.
Our method could learn more general knowledge in a wide resolution range of images and balanced results can be achieved by our framework.
- Score: 33.82038623492457
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although deep learning has significantly improved Face Recognition (FR),
dramatic performance deterioration may occur when processing Low Resolution
(LR) faces. To alleviate this, approaches based on unified feature space are
proposed with the sacrifice under High Resolution (HR) circumstances. To deal
with the huge domain gap between HR and LR domains and achieve the best on both
domains, we first took a closer look at the impacts of several resolution
augmentations and then analyzed the difficulty of LR samples from the
perspective of the model gradient produced by different resolution samples.
Besides, we also find that the introduction of some resolutions could help the
learning of lower resolutions. Based on these, we divide the LR samples into
three difficulties according to the resolution and propose a more effective
Multi-Resolution Augmentation. Then, due to the rapidly increasing domain gap
as the resolution decreases, we carefully design a novel and effective metric
loss based on a LogExp distance function that provides decent gradients to
prevent oscillation near the convergence point or tolerance to small distance
errors; it could also dynamically adjust the penalty for errors in different
dimensions, allowing for more optimization of dimensions with large errors.
Combining these two insights, our model could learn more general knowledge in a
wide resolution range of images and balanced results can be achieved by our
extremely simple framework. Moreover, the augmentations and metrics are the
cornerstones of LRFR, so our method could be considered a new baseline for the
LRFR task. Experiments on the LRFR datasets: SCface, XQLFW, and large-scale
LRFR dataset: TinyFace demonstrate the effectiveness of our methods, while the
degradation on HRFR datasets is significantly reduced.
Related papers
- CCFace: Classification Consistency for Low-Resolution Face Recognition [12.129404936688752]
Deep face recognition methods have shown a significant decline in performance when applied to real-world low-resolution benchmarks like TinyFace or SCFace.
We propose a novel classification consistency knowledge distillation approach that transfers the learned classifier from a high-resolution model to a low-resolution network.
Our proposed method outperforms state-of-the-art approaches on low-resolution benchmarks, with a three percent improvement on TinyFace while maintaining performance on high-resolution benchmarks.
arXiv Detail & Related papers (2023-08-18T01:24:52Z) - Learning from Multi-Perception Features for Real-Word Image
Super-resolution [87.71135803794519]
We propose a novel SR method called MPF-Net that leverages multiple perceptual features of input images.
Our method incorporates a Multi-Perception Feature Extraction (MPFE) module to extract diverse perceptual information.
We also introduce a contrastive regularization term (CR) that improves the model's learning capability.
arXiv Detail & Related papers (2023-05-26T07:35:49Z) - Learning Resolution-Adaptive Representations for Cross-Resolution Person
Re-Identification [49.57112924976762]
Cross-resolution person re-identification problem aims to match low-resolution (LR) query identity images against high resolution (HR) gallery images.
It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.
This paper explores an alternative SR-free paradigm to directly compare HR and LR images via a dynamic metric, which is adaptive to the resolution of a query image.
arXiv Detail & Related papers (2022-07-09T03:49:51Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Best-Buddy GANs for Highly Detailed Image Super-Resolution [71.13466303340192]
We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input.
Most methods along this line rely on a predefined single-LR-single-HR mapping, which is not flexible enough for the SISR task.
We propose best-buddy GANs (Beby-GAN) for rich-detail SISR. Relaxing the immutable one-to-one constraint, we allow the estimated patches to dynamically seek the best supervision.
arXiv Detail & Related papers (2021-03-29T02:58:27Z) - Hierarchical Deep CNN Feature Set-Based Representation Learning for
Robust Cross-Resolution Face Recognition [59.29808528182607]
Cross-resolution face recognition (CRFR) is important in intelligent surveillance and biometric forensics.
Existing shallow learning-based and deep learning-based methods focus on mapping the HR-LR face pairs into a joint feature space.
In this study, we desire to fully exploit the multi-level deep convolutional neural network (CNN) feature set for robust CRFR.
arXiv Detail & Related papers (2021-03-25T14:03:42Z) - Deep Generative Adversarial Residual Convolutional Networks for
Real-World Super-Resolution [31.934084942626257]
We propose a deep Super-Resolution Residual Convolutional Generative Adversarial Network (SRResCGAN)
It follows the real-world degradation settings by adversarial training the model with pixel-wise supervision in the HR domain from its generated LR counterpart.
The proposed network exploits the residual learning by minimizing the energy-based objective function with powerful image regularization and convex optimization techniques.
arXiv Detail & Related papers (2020-05-03T00:12:38Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.