Using Auxiliary Information for Person Re-Identification -- A Tutorial
Overview
- URL: http://arxiv.org/abs/2211.08565v1
- Date: Tue, 15 Nov 2022 23:12:36 GMT
- Title: Using Auxiliary Information for Person Re-Identification -- A Tutorial
Overview
- Authors: Tharindu Fernando, Clinton Fookes, Sridha Sridharan, Dana Michalski
- Abstract summary: This paper explores the fusion of multiple information to generate a more discriminant person descriptor.
It is the first work that explores the fusion of multiple information to generate a more discriminant person descriptor.
- Score: 32.67404002095918
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person re-identification (re-id) is a pivotal task within an intelligent
surveillance pipeline and there exist numerous re-id frameworks that achieve
satisfactory performance in challenging benchmarks. However, these systems
struggle to generate acceptable results when there are significant differences
between the camera views, illumination conditions, or occlusions. This result
can be attributed to the deficiency that exists within many recently proposed
re-id pipelines where they are predominately driven by appearance-based
features and little attention is paid to other auxiliary information that could
aid the re-id. In this paper, we systematically review the current
State-Of-The-Art (SOTA) methods in both uni-modal and multimodal person re-id.
Extending beyond a conceptual framework, we illustrate how the existing SOTA
methods can be extended to support these additional auxiliary information and
quantitatively evaluate the utility of such auxiliary feature information,
ranging from logos printed on the objects carried by the subject or printed on
the clothes worn by the subject, through to his or her behavioural
trajectories. To the best of our knowledge, this is the first work that
explores the fusion of multiple information to generate a more discriminant
person descriptor and the principal aim of this paper is to provide a thorough
theoretical analysis regarding the implementation of such a framework. In
addition, using model interpretation techniques, we validate the contributions
from different combinations of the auxiliary information versus the original
features that the SOTA person re-id models extract. We outline the limitations
of the proposed approaches and propose future research directions that could be
pursued to advance the area of multi-modal person re-id.
Related papers
- A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based
Person Re-Identification [18.01407937934588]
We present a new framework called Multi-Prompts ReID (MP-ReID) based on prompt learning and language models.
MP-ReID learns to hallucinate diverse, informative, and promptable sentences for describing the query images.
Explicit prompts are obtained by ensembling generation models, such as ChatGPT and VQA models.
arXiv Detail & Related papers (2023-12-28T03:00:19Z) - Learning Cross-modality Information Bottleneck Representation for
Heterogeneous Person Re-Identification [61.49219876388174]
Visible-Infrared person re-identification (VI-ReID) is an important and challenging task in intelligent video surveillance.
Existing methods mainly focus on learning a shared feature space to reduce the modality discrepancy between visible and infrared modalities.
We present a novel mutual information and modality consensus network, namely CMInfoNet, to extract modality-invariant identity features.
arXiv Detail & Related papers (2023-08-29T06:55:42Z) - Visual Information Extraction in the Wild: Practical Dataset and
End-to-end Solution [48.693941280097974]
We propose a large-scale dataset consisting of camera images for visual information extraction (VIE)
We propose a novel framework for end-to-end VIE that combines the stages of OCR and information extraction in an end-to-end learning fashion.
We evaluate the existing end-to-end methods for VIE on the proposed dataset and observe that the performance of these methods has a distinguishable drop from SROIE to our proposed dataset due to the larger variance of layout and entities.
arXiv Detail & Related papers (2023-05-12T14:11:47Z) - Robust Saliency-Aware Distillation for Few-shot Fine-grained Visual
Recognition [57.08108545219043]
Recognizing novel sub-categories with scarce samples is an essential and challenging research topic in computer vision.
Existing literature addresses this challenge by employing local-based representation approaches.
This article proposes a novel model, Robust Saliency-aware Distillation (RSaD), for few-shot fine-grained visual recognition.
arXiv Detail & Related papers (2023-05-12T00:13:17Z) - Person Re-identification: A Retrospective on Domain Specific Open
Challenges and Future Trends [2.4907242954727926]
Person re-identification (Re-ID) is one of the primary components of an automated visual surveillance system.
It aims to automatically identify/search persons in a multi-camera network having non-overlapping field-of-views.
arXiv Detail & Related papers (2022-02-26T11:55:57Z) - Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance
Video [128.41392860714635]
We introduce Weakly-Supervised Snoma-Temporally Detection (WSSTAD) in surveillance video.
WSSTAD aims to localize a-temporal tube (i.e. sequence of bounding boxes at consecutive times) that encloses abnormal event.
We propose a dual-branch network which takes as input proposals with multi-granularities in both spatial-temporal domains.
arXiv Detail & Related papers (2021-08-09T06:11:14Z) - Explainable Recommender Systems via Resolving Learning Representations [57.24565012731325]
Explanations could help improve user experience and discover system defects.
We propose a novel explainable recommendation model through improving the transparency of the representation learning process.
arXiv Detail & Related papers (2020-08-21T05:30:48Z) - Survey on Reliable Deep Learning-Based Person Re-Identification Models:
Are We There Yet? [19.23187114221822]
Person re-identification (PReID) is one of the most critical problems in intelligent video-surveillance (IVS)
Deep neural networks (DNNs) given their compelling performance on similar vision problems and fast execution at test time.
We present descriptions of each model along with their evaluation on a set of benchmark datasets.
arXiv Detail & Related papers (2020-04-30T16:09:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.