Deep Learning based Person Re-identification
- URL: http://arxiv.org/abs/2005.03293v1
- Date: Thu, 7 May 2020 07:30:28 GMT
- Title: Deep Learning based Person Re-identification
- Authors: Nirbhay Kumar Tagore, Ayushman Singh, Sumanth Manche, Pratik
Chattopadhyay
- Abstract summary: We propose an efficient hierarchical re-identification approach in which color histogram based comparison is first employed to find the closest matches in the gallery set.
A silhouette part-based feature extraction scheme is adopted in each level of hierarchy to preserve the relative locations of the different body structures.
Results reveal that it outperforms most state-of-the-art approaches in terms of overall accuracy.
- Score: 2.9631016562930546
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated person re-identification in a multi-camera surveillance setup is
very important for effective tracking and monitoring crowd movement. In the
recent years, few deep learning based re-identification approaches have been
developed which are quite accurate but time-intensive, and hence not very
suitable for practical purposes. In this paper, we propose an efficient
hierarchical re-identification approach in which color histogram based
comparison is first employed to find the closest matches in the gallery set,
and next deep feature based comparison is carried out using Siamese network.
Reduction in search space after the first level of matching helps in achieving
a fast response time as well as improving the accuracy of prediction by the
Siamese network by eliminating vastly dissimilar elements. A silhouette
part-based feature extraction scheme is adopted in each level of hierarchy to
preserve the relative locations of the different body structures and make the
appearance descriptors more discriminating in nature. The proposed approach has
been evaluated on five public data sets and also a new data set captured by our
team in our laboratory. Results reveal that it outperforms most
state-of-the-art approaches in terms of overall accuracy.
Related papers
- Deep Homography Estimation for Visual Place Recognition [49.235432979736395]
We propose a transformer-based deep homography estimation (DHE) network.
It takes the dense feature map extracted by a backbone network as input and fits homography for fast and learnable geometric verification.
Experiments on benchmark datasets show that our method can outperform several state-of-the-art methods.
arXiv Detail & Related papers (2024-02-25T13:22:17Z) - Self-Supervised Representation Learning from Temporal Ordering of
Automated Driving Sequences [49.91741677556553]
We propose TempO, a temporal ordering pretext task for pre-training region-level feature representations for perception tasks.
We embed each frame by an unordered set of proposal feature vectors, a representation that is natural for object detection or tracking systems.
Extensive evaluations on the BDD100K, nuImages, and MOT17 datasets show that our TempO pre-training approach outperforms single-frame self-supervised learning methods.
arXiv Detail & Related papers (2023-02-17T18:18:27Z) - Deepfake Detection via Joint Unsupervised Reconstruction and Supervised
Classification [25.84902508816679]
We introduce a novel approach for deepfake detection, which considers the reconstruction and classification tasks simultaneously.
This method shares the information learned by one task with the other, which focuses on a different aspect other existing works rarely consider.
Our method achieves state-of-the-art performance on three commonly-used datasets.
arXiv Detail & Related papers (2022-11-24T05:44:26Z) - End-to-end Person Search Sequentially Trained on Aggregated Dataset [1.9766522384767227]
We propose a new end-to-end model that jointly computes detection and feature extraction steps.
We show that aggregating more pedestrian detection datasets without costly identity annotations makes the shared feature maps more generic.
arXiv Detail & Related papers (2022-01-24T11:22:15Z) - SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up
Human Pose Estimation [81.03485688525133]
We propose a novel multi-person pose estimation framework, SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation (SIMPLE)
Specifically, in the training process, we enable SIMPLE to mimic the pose knowledge from the high-performance top-down pipeline.
Besides, SIMPLE formulates human detection and pose estimation as a unified point learning framework to complement each other in single-network.
arXiv Detail & Related papers (2021-04-06T13:12:51Z) - Distribution Alignment: A Unified Framework for Long-tail Visual
Recognition [52.36728157779307]
We propose a unified distribution alignment strategy for long-tail visual recognition.
We then introduce a generalized re-weight method in the two-stage learning to balance the class prior.
Our approach achieves the state-of-the-art results across all four recognition tasks with a simple and unified framework.
arXiv Detail & Related papers (2021-03-30T14:09:53Z) - Fast and Incremental Loop Closure Detection with Deep Features and
Proximity Graphs [13.328790865796224]
This article proposes an appearance-based loop closure detection pipeline named FILD++"
The system is fed by consecutive images and, via passing them twice through a single convolutional neural network, global and local deep features are extracted.
An image-to-image pairing follows, which exploits local features to evaluate the spatial information.
arXiv Detail & Related papers (2020-09-29T02:42:47Z) - Robust Person Re-Identification through Contextual Mutual Boosting [77.1976737965566]
We propose the Contextual Mutual Boosting Network (CMBN) to localize pedestrians.
It localizes pedestrians and recalibrates features by effectively exploiting contextual information and statistical inference.
Experiments on the benchmarks demonstrate the superiority of the architecture compared the state-of-the-art.
arXiv Detail & Related papers (2020-09-16T06:33:35Z) - A Self-Supervised Gait Encoding Approach with Locality-Awareness for 3D
Skeleton Based Person Re-Identification [65.18004601366066]
Person re-identification (Re-ID) via gait features within 3D skeleton sequences is a newly-emerging topic with several advantages.
This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID.
arXiv Detail & Related papers (2020-09-05T16:06:04Z) - A Convolutional Baseline for Person Re-Identification Using Vision and
Language Descriptions [24.794592610444514]
In real-world surveillance scenarios, frequently no visual information will be available about the queried person.
A two stream deep convolutional neural network framework supervised by cross entropy loss is presented.
The learnt visual representations are more robust and perform 22% better during retrieval as compared to a single modality system.
arXiv Detail & Related papers (2020-02-20T10:12:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.