Unleashing the Potential of Unsupervised Pre-Training with
Intra-Identity Regularization for Person Re-Identification
- URL: http://arxiv.org/abs/2112.00317v1
- Date: Wed, 1 Dec 2021 07:16:37 GMT
- Title: Unleashing the Potential of Unsupervised Pre-Training with
Intra-Identity Regularization for Person Re-Identification
- Authors: Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao
- Abstract summary: We design an Unsupervised Pre-training framework for ReID based on the contrastive learning (CL) pipeline, dubbed UP-ReID.
We introduce an intra-identity (I$2$-)regularization in the UP-ReID, which is instantiated as two constraints coming from global image aspect and local patch aspect.
Our UP-ReID pre-trained model can significantly benefit the downstream ReID fine-tuning and achieve state-of-the-art performance.
- Score: 10.045028405219641
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing person re-identification (ReID) methods typically directly load the
pre-trained ImageNet weights for initialization. However, as a fine-grained
classification task, ReID is more challenging and exists a large domain gap
between ImageNet classification. Inspired by the great success of
self-supervised representation learning with contrastive objectives, in this
paper, we design an Unsupervised Pre-training framework for ReID based on the
contrastive learning (CL) pipeline, dubbed UP-ReID. During the pre-training, we
attempt to address two critical issues for learning fine-grained ReID features:
(1) the augmentations in CL pipeline may distort the discriminative clues in
person images. (2) the fine-grained local features of person images are not
fully-explored. Therefore, we introduce an intra-identity
(I$^2$-)regularization in the UP-ReID, which is instantiated as two constraints
coming from global image aspect and local patch aspect: a global consistency is
enforced between augmented and original person images to increase robustness to
augmentation, while an intrinsic contrastive constraint among local patches of
each image is employed to fully explore the local discriminative clues.
Extensive experiments on multiple popular Re-ID datasets, including PersonX,
Market1501, CUHK03, and MSMT17, demonstrate that our UP-ReID pre-trained model
can significantly benefit the downstream ReID fine-tuning and achieve
state-of-the-art performance. Codes and models will be released to
https://github.com/Frost-Yang-99/UP-ReID.
Related papers
- Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identification [2.552131151698595]
We proposed a novel self-supervision and supervision combining transformer-based person re-identification framework, namely SSSC-TransReID.
We designed a self-supervised contrastive learning branch, which can enhance the feature representation for person re-identification without negative samples or additional pre-training.
Our proposed model obtains superior Re-ID performance consistently and outperforms the state-of-the-art ReID methods by large margins on the mean average accuracy (mAP) and Rank-1 accuracy.
arXiv Detail & Related papers (2024-10-21T03:17:25Z) - Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training [51.87027943520492]
We present a novel paradigm Diffusion-ReID to efficiently augment and generate diverse images based on known identities.
Benefiting from our proposed paradigm, we first create a new large-scale person Re-ID dataset Diff-Person, which consists of over 777K images from 5,183 identities.
arXiv Detail & Related papers (2024-06-10T06:26:03Z) - DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior [70.46245698746874]
We present DiffBIR, a general restoration pipeline that could handle different blind image restoration tasks.
DiffBIR decouples blind image restoration problem into two stages: 1) degradation removal: removing image-independent content; 2) information regeneration: generating the lost image content.
In the first stage, we use restoration modules to remove degradations and obtain high-fidelity restored results.
For the second stage, we propose IRControlNet that leverages the generative ability of latent diffusion models to generate realistic details.
arXiv Detail & Related papers (2023-08-29T07:11:52Z) - Learning Invariance from Generated Variance for Unsupervised Person
Re-identification [15.096776375794356]
We propose to replace traditional data augmentation with a generative adversarial network (GAN)
A 3D mesh guided person image generator is proposed to disentangle a person image into id-related and id-unrelated features.
By jointly training the generative and the contrastive modules, our method achieves new state-of-the-art unsupervised person ReID performance on mainstream large-scale benchmarks.
arXiv Detail & Related papers (2023-01-02T15:40:14Z) - VTBR: Semantic-based Pretraining for Person Re-Identification [14.0819152482295]
We propose a pure semantic-based pretraining approach named VTBR.
We train convolutional networks from scratch on the captions of FineGPR-C dataset, and transfer them to downstream Re-ID tasks.
arXiv Detail & Related papers (2021-10-11T08:19:45Z) - Unsupervised Pretraining for Object Detection by Patch Reidentification [72.75287435882798]
Unsupervised representation learning achieves promising performances in pre-training representations for object detectors.
This work proposes a simple yet effective representation learning method for object detection, named patch re-identification (Re-ID)
Our method significantly outperforms its counterparts on COCO in all settings, such as different training iterations and data percentages.
arXiv Detail & Related papers (2021-03-08T15:13:59Z) - Unsupervised Pre-training for Person Re-identification [90.98552221699508]
We present a large scale unlabeled person re-identification (Re-ID) dataset "LUPerson"
We make the first attempt of performing unsupervised pre-training for improving the generalization ability of the learned person Re-ID feature representation.
arXiv Detail & Related papers (2020-12-07T14:48:26Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z) - Cross-Resolution Adversarial Dual Network for Person Re-Identification
and Beyond [59.149653740463435]
Person re-identification (re-ID) aims at matching images of the same person across camera views.
Due to varying distances between cameras and persons of interest, resolution mismatch can be expected.
We propose a novel generative adversarial network to address cross-resolution person re-ID.
arXiv Detail & Related papers (2020-02-19T07:21:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.