Using Auxiliary Information for Person Re-Identification -- A Tutorial
Overview
- URL: http://arxiv.org/abs/2211.08565v1
- Date: Tue, 15 Nov 2022 23:12:36 GMT
- Title: Using Auxiliary Information for Person Re-Identification -- A Tutorial
Overview
- Authors: Tharindu Fernando, Clinton Fookes, Sridha Sridharan, Dana Michalski
- Abstract summary: This paper explores the fusion of multiple information to generate a more discriminant person descriptor.
It is the first work that explores the fusion of multiple information to generate a more discriminant person descriptor.
- Score: 32.67404002095918
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person re-identification (re-id) is a pivotal task within an intelligent
surveillance pipeline and there exist numerous re-id frameworks that achieve
satisfactory performance in challenging benchmarks. However, these systems
struggle to generate acceptable results when there are significant differences
between the camera views, illumination conditions, or occlusions. This result
can be attributed to the deficiency that exists within many recently proposed
re-id pipelines where they are predominately driven by appearance-based
features and little attention is paid to other auxiliary information that could
aid the re-id. In this paper, we systematically review the current
State-Of-The-Art (SOTA) methods in both uni-modal and multimodal person re-id.
Extending beyond a conceptual framework, we illustrate how the existing SOTA
methods can be extended to support these additional auxiliary information and
quantitatively evaluate the utility of such auxiliary feature information,
ranging from logos printed on the objects carried by the subject or printed on
the clothes worn by the subject, through to his or her behavioural
trajectories. To the best of our knowledge, this is the first work that
explores the fusion of multiple information to generate a more discriminant
person descriptor and the principal aim of this paper is to provide a thorough
theoretical analysis regarding the implementation of such a framework. In
addition, using model interpretation techniques, we validate the contributions
from different combinations of the auxiliary information versus the original
features that the SOTA person re-id models extract. We outline the limitations
of the proposed approaches and propose future research directions that could be
pursued to advance the area of multi-modal person re-id.
Related papers
- Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey [16.89460694470542]
Implicit Neural Representations (INRs) have emerged as a paradigm in knowledge representation.
INRs leverage multilayer perceptrons (MLPs) to model data as continuous implicit functions.
This survey introduces a clear taxonomy that categorises them into four key areas: activation functions, position encoding, combined strategies, and network structure.
arXiv Detail & Related papers (2024-11-06T06:14:24Z) - A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends [67.43992456058541]
Image restoration (IR) refers to the process of improving visual quality of images while removing degradation, such as noise, blur, weather effects, and so on.
Traditional IR methods typically target specific types of degradation, which limits their effectiveness in real-world scenarios with complex distortions.
The all-in-one image restoration (AiOIR) paradigm has emerged, offering a unified framework that adeptly addresses multiple degradation types.
arXiv Detail & Related papers (2024-10-19T11:11:09Z) - A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Learning Cross-modality Information Bottleneck Representation for
Heterogeneous Person Re-Identification [61.49219876388174]
Visible-Infrared person re-identification (VI-ReID) is an important and challenging task in intelligent video surveillance.
Existing methods mainly focus on learning a shared feature space to reduce the modality discrepancy between visible and infrared modalities.
We present a novel mutual information and modality consensus network, namely CMInfoNet, to extract modality-invariant identity features.
arXiv Detail & Related papers (2023-08-29T06:55:42Z) - Visual Information Extraction in the Wild: Practical Dataset and
End-to-end Solution [48.693941280097974]
We propose a large-scale dataset consisting of camera images for visual information extraction (VIE)
We propose a novel framework for end-to-end VIE that combines the stages of OCR and information extraction in an end-to-end learning fashion.
We evaluate the existing end-to-end methods for VIE on the proposed dataset and observe that the performance of these methods has a distinguishable drop from SROIE to our proposed dataset due to the larger variance of layout and entities.
arXiv Detail & Related papers (2023-05-12T14:11:47Z) - Robust Saliency-Aware Distillation for Few-shot Fine-grained Visual
Recognition [57.08108545219043]
Recognizing novel sub-categories with scarce samples is an essential and challenging research topic in computer vision.
Existing literature addresses this challenge by employing local-based representation approaches.
This article proposes a novel model, Robust Saliency-aware Distillation (RSaD), for few-shot fine-grained visual recognition.
arXiv Detail & Related papers (2023-05-12T00:13:17Z) - Person Re-identification: A Retrospective on Domain Specific Open
Challenges and Future Trends [2.4907242954727926]
Person re-identification (Re-ID) is one of the primary components of an automated visual surveillance system.
It aims to automatically identify/search persons in a multi-camera network having non-overlapping field-of-views.
arXiv Detail & Related papers (2022-02-26T11:55:57Z) - Explainable Recommender Systems via Resolving Learning Representations [57.24565012731325]
Explanations could help improve user experience and discover system defects.
We propose a novel explainable recommendation model through improving the transparency of the representation learning process.
arXiv Detail & Related papers (2020-08-21T05:30:48Z) - Survey on Reliable Deep Learning-Based Person Re-Identification Models:
Are We There Yet? [19.23187114221822]
Person re-identification (PReID) is one of the most critical problems in intelligent video-surveillance (IVS)
Deep neural networks (DNNs) given their compelling performance on similar vision problems and fast execution at test time.
We present descriptions of each model along with their evaluation on a set of benchmark datasets.
arXiv Detail & Related papers (2020-04-30T16:09:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.