Related papers: Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes

Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes

URL: http://arxiv.org/abs/2107.13904v1
Date: Thu, 29 Jul 2021 11:27:50 GMT
Title: Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes
Authors: Wenhang Ge, Chunyan Pan, Ancong Wu, Hongwei Zheng, Wei-Shi Zheng
Abstract summary: Person re-identification (Re-ID) aims to match person images across non-overlapping camera views. ICS-DS Re-ID uses cross-camera unpaired data with intra-camera identity labels for training. Cross-camera feature prediction method to mine cross-camera self supervision information. Joint learning of global-level and local-level features forms a global-local cross-camera feature prediction scheme.
Score: 70.30052164401178
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Person re-identification (Re-ID) aims to match person images across non-overlapping camera views. The majority of Re-ID methods focus on small-scale surveillance systems in which each pedestrian is captured in different camera views of adjacent scenes. However, in large-scale surveillance systems that cover larger areas, it is required to track a pedestrian of interest across distant scenes (e.g., a criminal suspect escapes from one city to another). Since most pedestrians appear in limited local areas, it is difficult to collect training data with cross-camera pairs of the same person. In this work, we study intra-camera supervised person re-identification across distant scenes (ICS-DS Re-ID), which uses cross-camera unpaired data with intra-camera identity labels for training. It is challenging as cross-camera paired data plays a crucial role for learning camera-invariant features in most existing Re-ID methods. To learn camera-invariant representation from cross-camera unpaired training data, we propose a cross-camera feature prediction method to mine cross-camera self supervision information from camera-specific feature distribution by transforming fake cross-camera positive feature pairs and minimize the distances of the fake pairs. Furthermore, we automatically localize and extract local-level feature by a transformer. Joint learning of global-level and local-level features forms a global-local cross-camera feature prediction scheme for mining fine-grained cross-camera self supervision information. Finally, cross-camera self supervision and intra-camera supervision are aggregated in a framework. The experiments are conducted in the ICS-DS setting on Market-SCT, Duke-SCT and MSMT17-SCT datasets. The evaluation results demonstrate the superiority of our method, which gains significant improvements of 15.4 Rank-1 and 22.3 mAP on Market-SCT as compared to the second best method.

Related papers

CLIP-based Camera-Agnostic Feature Learning for Intra-camera Person Re-Identification [11.882424627567998]
We propose a novel framework called CLIP-based Camera-Agnostic Feature Learning (CCAFL) for ICS ReID. Two custom modules are designed to guide the model to actively learn camera-agnostic pedestrian features. In experiments on popular ReID datasets, we arrive at 58.9% in terms of mAP accuracy, surpassing state-of-the-art methods by 7.6%.
arXiv Detail & Related papers (2024-09-29T05:43:01Z)
Learning Intra and Inter-Camera Invariance for Isolated Camera Supervised Person Re-identification [6.477096324232456]
Cross-camera images are prone to being recognized as different IDs simply by camera style. This paper studies person re-ID under such isolated camera supervised (ISCS) setting.
arXiv Detail & Related papers (2023-11-02T11:32:40Z)
Domain-adaptive Person Re-identification without Cross-camera Paired Samples [12.041823465553875]
Cross-camera pedestrian samples collected from long-distance scenes often have no positive samples. It is extremely challenging to use cross-camera negative samples to achieve cross-region pedestrian identity matching. A novel domain-adaptive person re-ID method that focuses on cross-camera consistent discriminative feature learning is proposed.
arXiv Detail & Related papers (2023-07-13T02:42:28Z)
Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment [22.531044994763487]
We propose a novel multi-camera multiple people tracking method that uses anchor clustering-guided for cross-camera reassigning. Our approach aims to improve accuracy of tracking by identifying key features that are unique to every individual. The method has demonstrated robustness and effectiveness in handling both synthetic and real-world data.
arXiv Detail & Related papers (2023-04-19T07:38:15Z)
Cross-Camera Trajectories Help Person Retrieval in a Camera Network [124.65912458467643]
Existing methods often rely on purely visual matching or consider temporal constraints but ignore the spatial information of the camera network. We propose a pedestrian retrieval framework based on cross-camera generation, which integrates both temporal and spatial information. To verify the effectiveness of our method, we construct the first cross-camera pedestrian trajectory dataset.
arXiv Detail & Related papers (2022-04-27T13:10:48Z)
SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation [101.55622133406446]
We propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras. Specifically, we employ a joint network to process all the surrounding views and propose a cross-view transformer to effectively fuse the information from multiple views. In experiments, our method achieves the state-of-the-art performance on the challenging multi-camera depth estimation datasets.
arXiv Detail & Related papers (2022-04-07T17:58:47Z)
Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification [60.36551512902312]
unsupervised person re-identification (re-ID) aims to learn discriminative models with unlabeled data. One popular method is to obtain pseudo-label by clustering and use them to optimize the model. In this paper, we propose a unified framework to solve both problems.
arXiv Detail & Related papers (2021-03-08T09:13:06Z)
Towards Precise Intra-camera Supervised Person Re-identification [54.86892428155225]
Intra-camera supervision (ICS) for person re-identification (Re-ID) assumes that identity labels are independently annotated within each camera view. Lack of inter-camera labels makes the ICS Re-ID problem much more challenging than the fully supervised counterpart. Our approach performs even comparable to state-of-the-art fully supervised methods in two of the datasets.
arXiv Detail & Related papers (2020-02-12T11:56:30Z)
Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization [90.9485099181197]
This paper rethinks the working mechanism of conventional ReID approaches. We force the image data of all cameras to fall onto the same subspace, so that the distribution gap between any camera pair is largely shrunk. Experiments on a wide range of ReID tasks demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-01-23T17:22:34Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.