Cross-Resolution Adversarial Dual Network for Person Re-Identification
and Beyond
- URL: http://arxiv.org/abs/2002.09274v2
- Date: Thu, 22 Oct 2020 18:01:01 GMT
- Title: Cross-Resolution Adversarial Dual Network for Person Re-Identification
and Beyond
- Authors: Yu-Jhe Li, Yun-Chun Chen, Yen-Yu Lin, Yu-Chiang Frank Wang
- Abstract summary: Person re-identification (re-ID) aims at matching images of the same person across camera views.
Due to varying distances between cameras and persons of interest, resolution mismatch can be expected.
We propose a novel generative adversarial network to address cross-resolution person re-ID.
- Score: 59.149653740463435
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person re-identification (re-ID) aims at matching images of the same person
across camera views. Due to varying distances between cameras and persons of
interest, resolution mismatch can be expected, which would degrade re-ID
performance in real-world scenarios. To overcome this problem, we propose a
novel generative adversarial network to address cross-resolution person re-ID,
allowing query images with varying resolutions. By advancing adversarial
learning techniques, our proposed model learns resolution-invariant image
representations while being able to recover the missing details in
low-resolution input images. The resulting features can be jointly applied for
improving re-ID performance due to preserving resolution invariance and
recovering re-ID oriented discriminative details. Extensive experimental
results on five standard person re-ID benchmarks confirm the effectiveness of
our method and the superiority over the state-of-the-art approaches, especially
when the input resolutions are not seen during training. Furthermore, the
experimental results on two vehicle re-ID benchmarks also confirm the
generalization of our model on cross-resolution visual tasks. The extensions of
semi-supervised settings further support the use of our proposed approach to
real-world scenarios and applications.
Related papers
- ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning [57.91881829308395]
Identity-preserving text-to-image generation (ID-T2I) has received significant attention due to its wide range of application scenarios like AI portrait and advertising.
We present textbfID-Aligner, a general feedback learning framework to enhance ID-T2I performance.
arXiv Detail & Related papers (2024-04-23T18:41:56Z) - Cross-Modality Perturbation Synergy Attack for Person Re-identification [66.48494594909123]
The main challenge in cross-modality ReID lies in effectively dealing with visual differences between different modalities.
Existing attack methods have primarily focused on the characteristics of the visible image modality.
This study proposes a universal perturbation attack specifically designed for cross-modality ReID.
arXiv Detail & Related papers (2024-01-18T15:56:23Z) - Modality Unifying Network for Visible-Infrared Person Re-Identification [24.186989535051623]
Visible-infrared person re-identification (VI-ReID) is a challenging task due to large cross-modality discrepancies and intra-class variations.
Existing methods mainly focus on learning modality-shared representations by embedding different modalities into the same feature space.
We propose a novel Modality Unifying Network (MUN) to explore a robust auxiliary modality for VI-ReID.
arXiv Detail & Related papers (2023-09-12T14:22:22Z) - ResFormer: Scaling ViTs with Multi-Resolution Training [100.01406895070693]
We introduce ResFormer, a framework for improved performance on a wide spectrum of, mostly unseen, testing resolutions.
In particular, ResFormer operates on replicated images of different resolutions and enforces a scale consistency loss to engage interactive information across different scales.
We demonstrate, moreover, ResFormer is flexible and can be easily extended to semantic segmentation, object detection and video action recognition.
arXiv Detail & Related papers (2022-12-01T18:57:20Z) - Learning Resolution-Adaptive Representations for Cross-Resolution Person
Re-Identification [49.57112924976762]
Cross-resolution person re-identification problem aims to match low-resolution (LR) query identity images against high resolution (HR) gallery images.
It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.
This paper explores an alternative SR-free paradigm to directly compare HR and LR images via a dynamic metric, which is adaptive to the resolution of a query image.
arXiv Detail & Related papers (2022-07-09T03:49:51Z) - Resolution based Feature Distillation for Cross Resolution Person
Re-Identification [17.86505685442293]
Person re-identification (re-id) aims to retrieve images of same identities across different camera views.
Resolution mismatch occurs due to varying distances between person of interest and cameras.
We propose a Resolution based Feature Distillation (RFD) approach to overcome the problem of multiple resolutions.
arXiv Detail & Related papers (2021-09-16T11:07:59Z) - Low Resolution Information Also Matters: Learning Multi-Resolution
Representations for Person Re-Identification [37.01666917620271]
Cross-resolution person re-ID aims to match person images captured from non-overlapped cameras.
emphtextbfMulti-Resolution textbfRepresentations textbfJoint textbfLearning (textbfMRJL)
Our method consists of a Resolution Reconstruction Network (RRN) and a Dual Feature Fusion Network (DFFN)
arXiv Detail & Related papers (2021-05-26T16:54:56Z) - Deep High-Resolution Representation Learning for Cross-Resolution Person
Re-identification [22.104449922937338]
Person re-identification (re-ID) tackles the problem of matching person images with the same identity from different cameras.
We propose a Deep High-Resolution Pseudo-Siamese Framework (PS-HRNet) to solve the problem.
Our proposed PS-HRNet improves 3.4%, 6.2%, 2.5%,1.1% and 4.2% at Rank-1 on MLR-Market-1501, MLR-CUHK03, MLR-VIPeR, MLR-DukeMTMC-reID, and CAVIAR datasets.
arXiv Detail & Related papers (2021-05-25T07:45:38Z) - Hierarchical Deep CNN Feature Set-Based Representation Learning for
Robust Cross-Resolution Face Recognition [59.29808528182607]
Cross-resolution face recognition (CRFR) is important in intelligent surveillance and biometric forensics.
Existing shallow learning-based and deep learning-based methods focus on mapping the HR-LR face pairs into a joint feature space.
In this study, we desire to fully exploit the multi-level deep convolutional neural network (CNN) feature set for robust CRFR.
arXiv Detail & Related papers (2021-03-25T14:03:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.