Unsupervised Person Re-identification via Simultaneous Clustering and
Consistency Learning
- URL: http://arxiv.org/abs/2104.00202v1
- Date: Thu, 1 Apr 2021 02:10:42 GMT
- Title: Unsupervised Person Re-identification via Simultaneous Clustering and
Consistency Learning
- Authors: Junhui Yin, Jiayan Qiu, Siqing Zhang, Jiyang Xie, Zhanyu Ma, and Jun
Guo
- Abstract summary: We design a pretext task for unsupervised re-ID by learning visual consistency from still images and temporal consistency during training process.
We optimize the model by grouping the two encoded views into same cluster, thus enhancing the visual consistency between views.
- Score: 22.008371113710137
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Unsupervised person re-identification (re-ID) has become an important topic
due to its potential to resolve the scalability problem of supervised re-ID
models. However, existing methods simply utilize pseudo labels from clustering
for supervision and thus have not yet fully explored the semantic information
in data itself, which limits representation capabilities of learned models. To
address this problem, we design a pretext task for unsupervised re-ID by
learning visual consistency from still images and temporal consistency during
training process, such that the clustering network can separate the images into
semantic clusters automatically. Specifically, the pretext task learns
semantically meaningful representations by maximizing the agreement between two
encoded views of the same image via a consistency loss in latent space.
Meanwhile, we optimize the model by grouping the two encoded views into same
cluster, thus enhancing the visual consistency between views. Experiments on
Market-1501, DukeMTMC-reID and MSMT17 datasets demonstrate that our proposed
approach outperforms the state-of-the-art methods by large margins.
Related papers
- Pose-Transformation and Radial Distance Clustering for Unsupervised Person Re-identification [5.522856885199346]
Person re-identification (re-ID) aims to tackle the problem of matching identities across non-overlapping cameras.
Supervised approaches require identity information that may be difficult to obtain and are inherently biased towards the dataset they are trained on.
We propose an unsupervised approach to the person re-ID setup. Having zero knowledge of true labels, our proposed method enhances the discriminating ability of the learned features.
arXiv Detail & Related papers (2024-11-06T20:55:30Z) - Discriminative Anchor Learning for Efficient Multi-view Clustering [59.11406089896875]
We propose discriminative anchor learning for multi-view clustering (DALMC)
We learn discriminative view-specific feature representations according to the original dataset.
We build anchors from different views based on these representations, which increase the quality of the shared anchor graph.
arXiv Detail & Related papers (2024-09-25T13:11:17Z) - Self Supervised Correlation-based Permutations for Multi-View Clustering [7.972599673048582]
We propose an end-to-end deep learning-based MVC framework for general data.
Our approach involves learning meaningful fused data representations with a novel permutation-based canonical correlation objective.
We demonstrate the effectiveness of our model using ten MVC benchmark datasets.
arXiv Detail & Related papers (2024-02-26T08:08:30Z) - CGUA: Context-Guided and Unpaired-Assisted Weakly Supervised Person
Search [54.106662998673514]
We introduce a Context-Guided and Unpaired-Assisted (CGUA) weakly supervised person search framework.
Specifically, we propose a novel Context-Guided Cluster (CGC) algorithm to leverage context information in the clustering process.
Our method achieves comparable or better performance to the state-of-the-art supervised methods by leveraging more diverse unlabeled data.
arXiv Detail & Related papers (2022-03-27T13:57:30Z) - UniVIP: A Unified Framework for Self-Supervised Visual Pre-training [50.87603616476038]
We propose a novel self-supervised framework to learn versatile visual representations on either single-centric-object or non-iconic dataset.
Massive experiments show that UniVIP pre-trained on non-iconic COCO achieves state-of-the-art transfer performance.
Our method can also exploit single-centric-object dataset such as ImageNet and outperforms BYOL by 2.5% with the same pre-training epochs in linear probing.
arXiv Detail & Related papers (2022-03-14T10:04:04Z) - Mind Your Clever Neighbours: Unsupervised Person Re-identification via
Adaptive Clustering Relationship Modeling [19.532602887109668]
Unsupervised person re-identification (Re-ID) attracts increasing attention due to its potential to resolve the scalability problem of supervised Re-ID models.
Most existing unsupervised methods adopt an iterative clustering mechanism, where the network was trained based on pseudo labels generated by unsupervised clustering.
To generate high-quality pseudo-labels and mitigate the impact of clustering errors, we propose a novel clustering relationship modeling framework for unsupervised person Re-ID.
arXiv Detail & Related papers (2021-12-03T10:55:07Z) - Camera-aware Proxies for Unsupervised Person Re-Identification [60.26031011794513]
This paper tackles the purely unsupervised person re-identification (Re-ID) problem that requires no annotations.
We propose to split each single cluster into multiple proxies and each proxy represents the instances coming from the same camera.
Based on the camera-aware proxies, we design both intra- and inter-camera contrastive learning components for our Re-ID model.
arXiv Detail & Related papers (2020-12-19T12:37:04Z) - CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection [91.91911418421086]
Co-Salient Object Detection (CoSOD) aims at discovering salient objects that repeatedly appear in a given query group containing two or more relevant images.
One challenging issue is how to effectively capture co-saliency cues by modeling and exploiting inter-image relationships.
We present an end-to-end collaborative aggregation-and-distribution network (CoADNet) to capture both salient and repetitive visual patterns from multiple images.
arXiv Detail & Related papers (2020-11-10T04:28:11Z) - Temporal Continuity Based Unsupervised Learning for Person
Re-Identification [15.195514083289801]
We propose an unsupervised center-based clustering approach capable of progressively learning and exploiting the underlying re-id discriminative information.
We call our framework Temporal Continuity based Unsupervised Learning (TCUL)
Specifically, TCUL simultaneously does center based clustering of unlabeled (target) dataset and fine-tunes a convolutional neural network (CNN) pre-trained on irrelevant labeled (source) dataset.
It exploits temporally continuous nature of images within-camera jointly with spatial similarity of feature maps across-cameras to generate reliable pseudo-labels for training a re-identification model.
arXiv Detail & Related papers (2020-09-01T05:29:30Z) - Unsupervised Person Re-identification via Softened Similarity Learning [122.70472387837542]
Person re-identification (re-ID) is an important topic in computer vision.
This paper studies the unsupervised setting of re-ID, which does not require any labeled information.
Experiments on two image-based and video-based datasets demonstrate state-of-the-art performance.
arXiv Detail & Related papers (2020-04-07T17:16:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.