Robust Person Re-Identification through Contextual Mutual Boosting
- URL: http://arxiv.org/abs/2009.07491v1
- Date: Wed, 16 Sep 2020 06:33:35 GMT
- Title: Robust Person Re-Identification through Contextual Mutual Boosting
- Authors: Zhikang Wang, Lihuo He, Xinbo Gao, Jane Shen
- Abstract summary: We propose the Contextual Mutual Boosting Network (CMBN) to localize pedestrians.
It localizes pedestrians and recalibrates features by effectively exploiting contextual information and statistical inference.
Experiments on the benchmarks demonstrate the superiority of the architecture compared the state-of-the-art.
- Score: 77.1976737965566
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person Re-Identification (Re-ID) has witnessed great advance, driven by the
development of deep learning. However, modern person Re-ID is still challenged
by background clutter, occlusion and large posture variation which are common
in practice. Previous methods tackle these challenges by localizing pedestrians
through external cues (e.g., pose estimation, human parsing) or attention
mechanism, suffering from high computation cost and increased model complexity.
In this paper, we propose the Contextual Mutual Boosting Network (CMBN). It
localizes pedestrians and recalibrates features by effectively exploiting
contextual information and statistical inference. Firstly, we construct two
branches with a shared convolutional frontend to learn the foreground and
background features respectively. By enabling interaction between these two
branches, they boost the accuracy of the spatial localization mutually.
Secondly, starting from a statistical perspective, we propose the Mask
Generator that exploits the activation distribution of the transformation
matrix for generating the static channel mask to the representations. The mask
recalibrates the features to amplify the valuable characteristics and diminish
the noise. Finally, we propose the Contextual-Detachment Strategy to optimize
the two branches jointly and independently, which further enhances the
localization precision. Experiments on the benchmarks demonstrate the
superiority of the architecture compared the state-of-the-art.
Related papers
- Robust Ensemble Person Re-Identification via Orthogonal Fusion with Occlusion Handling [4.431087385310259]
Occlusion remains one of the major challenges in person reidentification (ReID)
We propose a deep ensemble model that harnesses both CNN and Transformer architectures to generate robust feature representations.
arXiv Detail & Related papers (2024-03-29T18:38:59Z) - UGMAE: A Unified Framework for Graph Masked Autoencoders [67.75493040186859]
We propose UGMAE, a unified framework for graph masked autoencoders.
We first develop an adaptive feature mask generator to account for the unique significance of nodes.
We then design a ranking-based structure reconstruction objective joint with feature reconstruction to capture holistic graph information.
arXiv Detail & Related papers (2024-02-12T19:39:26Z) - Mutual Distillation Learning For Person Re-Identification [27.350415735863184]
We propose a novel approach, Mutual Distillation Learning For Person Re-identification (termed as MDPR)
Our approach encompasses two branches: a hard content branch to extract local features via a uniform horizontal partitioning strategy and a Soft Content Branch to dynamically distinguish between foreground and background.
Our method achieves an impressive $88.7%/94.4%$ in mAP/Rank-1 on the DukeC-reID dataset, surpassing the current state-of-the-art results.
arXiv Detail & Related papers (2024-01-12T07:49:02Z) - PersonMAE: Person Re-Identification Pre-Training with Masked
AutoEncoders [132.60355401780407]
Pre-training is playing an increasingly important role in learning generic feature representation for Person Re-identification (ReID)
We propose PersonMAE, which involves two core designs into masked autoencoders to better serve the task of Person Re-ID.
PersonMAE with ViT-B backbone achieves 79.8% and 69.5% mAP on the MSMT17 and OccDuke datasets.
arXiv Detail & Related papers (2023-11-08T07:02:27Z) - Object Segmentation by Mining Cross-Modal Semantics [68.88086621181628]
We propose a novel approach by mining the Cross-Modal Semantics to guide the fusion and decoding of multimodal features.
Specifically, we propose a novel network, termed XMSNet, consisting of (1) all-round attentive fusion (AF), (2) coarse-to-fine decoder (CFD), and (3) cross-layer self-supervision.
arXiv Detail & Related papers (2023-05-17T14:30:11Z) - Dynamic Prototype Mask for Occluded Person Re-Identification [88.7782299372656]
Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part.
We propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge.
Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously.
arXiv Detail & Related papers (2022-07-19T03:31:13Z) - Gait Recognition with Mask-based Regularization [31.901166591272464]
We propose a novel mask-based regularization method named ReverseMask.
By injecting on the feature map, the proposed regularization method helps convolutional architecture learn the discriminative representations.
The plug-and-play Inception-like ReverseMask block is simple and effective to generalize networks.
arXiv Detail & Related papers (2022-03-08T12:13:29Z) - Short Range Correlation Transformer for Occluded Person
Re-Identification [4.339510167603376]
We propose a partial feature transformer-based person re-identification framework named PFT.
The proposed PFT utilizes three modules to enhance the efficiency of vision transformer.
Experimental results over occluded and holistic re-identification datasets demonstrate that the proposed PFT network achieves superior performance consistently.
arXiv Detail & Related papers (2022-01-04T11:12:39Z) - A Self-Supervised Gait Encoding Approach with Locality-Awareness for 3D
Skeleton Based Person Re-Identification [65.18004601366066]
Person re-identification (Re-ID) via gait features within 3D skeleton sequences is a newly-emerging topic with several advantages.
This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID.
arXiv Detail & Related papers (2020-09-05T16:06:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.