Learning Unified Representations for Multi-Resolution Face Recognition
- URL: http://arxiv.org/abs/2310.09563v1
- Date: Sat, 14 Oct 2023 11:26:43 GMT
- Title: Learning Unified Representations for Multi-Resolution Face Recognition
- Authors: Hulingxiao He, Wu Yuan, Yidian Huang, Shilong Zhao, Wen Yuan, Hanqing
Li
- Abstract summary: Branch-to-Trunk network (BTNet) is a representation learning method for multi-resolution face recognition.
Our experiments demonstrate strong performance on face recognition benchmarks, both for multi-resolution identity matching and feature aggregation, with much less amount and parameter storage.
- Score: 0.7378853859331619
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we propose Branch-to-Trunk network (BTNet), a representation
learning method for multi-resolution face recognition. It consists of a trunk
network (TNet), namely a unified encoder, and multiple branch networks (BNets),
namely resolution adapters. As per the input, a resolution-specific BNet is
used and the output are implanted as feature maps in the feature pyramid of
TNet, at a layer with the same resolution. The discriminability of tiny faces
is significantly improved, as the interpolation error introduced by rescaling,
especially up-sampling, is mitigated on the inputs. With branch distillation
and backward-compatible training, BTNet transfers discriminative
high-resolution information to multiple branches while guaranteeing
representation compatibility. Our experiments demonstrate strong performance on
face recognition benchmarks, both for multi-resolution identity matching and
feature aggregation, with much less computation amount and parameter storage.
We establish new state-of-the-art on the challenging QMUL-SurvFace 1: N face
identification task. Our code is available at
https://github.com/StevenSmith2000/BTNet.
Related papers
- W-Net: A Facial Feature-Guided Face Super-Resolution Network [8.037821981254389]
Face Super-Resolution aims to recover high-resolution (HR) face images from low-resolution (LR) ones.
Existing approaches are not ideal due to their low reconstruction efficiency and insufficient utilization of prior information.
This paper proposes a novel network architecture called W-Net to address this challenge.
arXiv Detail & Related papers (2024-06-02T09:05:40Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - CTCNet: A CNN-Transformer Cooperation Network for Face Image
Super-Resolution [64.06360660979138]
We propose an efficient CNN-Transformer Cooperation Network (CTCNet) for face super-resolution tasks.
We first devise a novel Local-Global Feature Cooperation Module (LGCM), which is composed of a Facial Structure Attention Unit (FSAU) and a Transformer block.
We then design an efficient Feature Refinement Module (FRM) to enhance the encoded features.
arXiv Detail & Related papers (2022-04-19T06:38:29Z) - Deep High-Resolution Representation Learning for Cross-Resolution Person
Re-identification [22.104449922937338]
Person re-identification (re-ID) tackles the problem of matching person images with the same identity from different cameras.
We propose a Deep High-Resolution Pseudo-Siamese Framework (PS-HRNet) to solve the problem.
Our proposed PS-HRNet improves 3.4%, 6.2%, 2.5%,1.1% and 4.2% at Rank-1 on MLR-Market-1501, MLR-CUHK03, MLR-VIPeR, MLR-DukeMTMC-reID, and CAVIAR datasets.
arXiv Detail & Related papers (2021-05-25T07:45:38Z) - Hierarchical Deep CNN Feature Set-Based Representation Learning for
Robust Cross-Resolution Face Recognition [59.29808528182607]
Cross-resolution face recognition (CRFR) is important in intelligent surveillance and biometric forensics.
Existing shallow learning-based and deep learning-based methods focus on mapping the HR-LR face pairs into a joint feature space.
In this study, we desire to fully exploit the multi-level deep convolutional neural network (CNN) feature set for robust CRFR.
arXiv Detail & Related papers (2021-03-25T14:03:42Z) - Learning Deep Interleaved Networks with Asymmetric Co-Attention for
Image Restoration [65.11022516031463]
We present a deep interleaved network (DIN) that learns how information at different states should be combined for high-quality (HQ) images reconstruction.
In this paper, we propose asymmetric co-attention (AsyCA) which is attached at each interleaved node to model the feature dependencies.
Our presented DIN can be trained end-to-end and applied to various image restoration tasks.
arXiv Detail & Related papers (2020-10-29T15:32:00Z) - Resolution Switchable Networks for Runtime Efficient Image Recognition [46.09537029831355]
We propose a general method to train a single convolutional neural network which is capable of switching image resolutions at inference.
Networks trained with the proposed method are named Resolution Switchable Networks (RS-Nets)
arXiv Detail & Related papers (2020-07-19T02:12:59Z) - HRDNet: High-resolution Detection Network for Small Objects [10.802856121451404]
Small object detection is challenging because small objects do not contain detailed information and may even disappear in the deep network.
We propose the High-Resolution Detection Network (HRDNet) to keep the benefits of high-resolution images without bringing up new problems.
We propose Multi-Depth Image Pyramid Network (MD-IPN) and Multi-Scale Feature Pyramid Network (MS-FPN) in HRDNet.
arXiv Detail & Related papers (2020-06-13T10:25:35Z) - ResNeSt: Split-Attention Networks [86.25490825631763]
We present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations.
Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification.
arXiv Detail & Related papers (2020-04-19T20:40:31Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.