Enhancing Long-Term Person Re-Identification Using Global, Local Body
Part, and Head Streams
- URL: http://arxiv.org/abs/2403.02892v1
- Date: Tue, 5 Mar 2024 11:57:10 GMT
- Title: Enhancing Long-Term Person Re-Identification Using Global, Local Body
Part, and Head Streams
- Authors: Duy Tran Thanh and Yeejin Lee and Byeongkeun Kang
- Abstract summary: We propose a novel framework that effectively learns and utilizes both global and local information.
The proposed framework is trained by backpropagating the weighted summation of the identity classification loss.
- Score: 8.317899947627202
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work addresses the task of long-term person re-identification.
Typically, person re-identification assumes that people do not change their
clothes, which limits its applications to short-term scenarios. To overcome
this limitation, we investigate long-term person re-identification, which
considers both clothes-changing and clothes-consistent scenarios. In this
paper, we propose a novel framework that effectively learns and utilizes both
global and local information. The proposed framework consists of three streams:
global, local body part, and head streams. The global and head streams encode
identity-relevant information from an entire image and a cropped image of the
head region, respectively. Both streams encode the most distinct, less
distinct, and average features using the combinations of adversarial erasing,
max pooling, and average pooling. The local body part stream extracts
identity-related information for each body part, allowing it to be compared
with the same body part from another image. Since body part annotations are not
available in re-identification datasets, pseudo-labels are generated using
clustering. These labels are then utilized to train a body part segmentation
head in the local body part stream. The proposed framework is trained by
backpropagating the weighted summation of the identity classification loss, the
pair-based loss, and the pseudo body part segmentation loss. To demonstrate the
effectiveness of the proposed method, we conducted experiments on three
publicly available datasets (Celeb-reID, PRCC, and VC-Clothes). The
experimental results demonstrate that the proposed method outperforms the
previous state-of-the-art method.
Related papers
- Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition [57.97930719585095]
We introduce Part-aware Unified Representation between Language and Skeleton (PURLS) to explore visual-semantic alignment at both local and global scales.
Our approach is evaluated on various skeleton/language backbones and three large-scale datasets.
The results showcase the universality and superior performance of PURLS, surpassing prior skeleton-based solutions and standard baselines from other domains.
arXiv Detail & Related papers (2024-06-19T08:22:32Z) - Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification [78.52704557647438]
We propose a novel FIne-grained Representation and Recomposition (FIRe$2$) framework to tackle both limitations without any auxiliary annotation or data.
Experiments demonstrate that FIRe$2$ can achieve state-of-the-art performance on five widely-used cloth-changing person Re-ID benchmarks.
arXiv Detail & Related papers (2023-08-21T12:59:48Z) - Body Part-Based Representation Learning for Occluded Person
Re-Identification [102.27216744301356]
Occluded person re-identification (ReID) is a person retrieval task which aims at matching occluded person images with holistic ones.
Part-based methods have been shown beneficial as they offer fine-grained information and are well suited to represent partially visible human bodies.
We propose BPBreID, a body part-based ReID model for solving the above issues.
arXiv Detail & Related papers (2022-11-07T16:48:41Z) - Adaptive Local-Component-aware Graph Convolutional Network for One-shot
Skeleton-based Action Recognition [54.23513799338309]
We present an Adaptive Local-Component-aware Graph Convolutional Network for skeleton-based action recognition.
Our method provides a stronger representation than the global embedding and helps our model reach state-of-the-art.
arXiv Detail & Related papers (2022-09-21T02:33:07Z) - Self-Supervised Video Object Segmentation via Cutout Prediction and
Tagging [117.73967303377381]
We propose a novel self-supervised Video Object (VOS) approach that strives to achieve better object-background discriminability.
Our approach is based on a discriminative learning loss formulation that takes into account both object and background information.
Our proposed approach, CT-VOS, achieves state-of-the-art results on two challenging benchmarks: DAVIS-2017 and Youtube-VOS.
arXiv Detail & Related papers (2022-04-22T17:53:27Z) - Motion-Aware Transformer For Occluded Person Re-identification [1.9899263094148867]
We propose a self-supervised deep learning method to improve the location performance for human parts through occluded person Re-ID.
Unlike previous works, we find that motion information derived from the photos of various human postures can help identify major human body components.
arXiv Detail & Related papers (2022-02-09T02:53:10Z) - Graph-based Person Signature for Person Re-Identifications [17.181807593574764]
We propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph.
The graph is integrated into a multi-branch multi-task framework for person re-identification.
Our approach achieves competitive results among the state of the art and outperforms other attribute-based or mask-guided methods.
arXiv Detail & Related papers (2021-04-14T10:54:36Z) - Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture
Recognition [9.131161856493486]
We propose a novel end-to-end textbfRegional Attention Network (RAN), which is a fully Convolutional Neural Network (CNN)
Our regions consist of one or more consecutive cells and are adapted from the strategies used in computing HOG (Histogram of Oriented Gradient) descriptor.
The proposed approach outperforms the state-of-the-art by a considerable margin in different metrics.
arXiv Detail & Related papers (2021-01-17T10:14:28Z) - DomainMix: Learning Generalizable Person Re-Identification Without Human
Annotations [89.78473564527688]
This paper shows how to use labeled synthetic dataset and unlabeled real-world dataset to train a universal model.
In this way, human annotations are no longer required, and it is scalable to large and diverse real-world datasets.
Experimental results show that the proposed annotation-free method is more or less comparable to the counterpart trained with full human annotations.
arXiv Detail & Related papers (2020-11-24T08:15:53Z) - Person Re-identification: Implicitly Defining the Receptive Fields of
Deep Learning Classification Frameworks [5.123298347655088]
This paper describes a solution for implicitly driving the inference of the networks' receptive fields.
We use a segmentation module to distinguish between the foreground (important)/background (irrelevant) parts of each learning instance.
This strategy typically drives the networks to early convergence and appropriate solutions, where the identity and descriptions are not correlated.
arXiv Detail & Related papers (2020-01-30T11:45:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.