A Novel end-to-end Framework for Occluded Pixel Reconstruction with
Spatio-temporal Features for Improved Person Re-identification
- URL: http://arxiv.org/abs/2304.07721v1
- Date: Sun, 16 Apr 2023 08:14:29 GMT
- Title: A Novel end-to-end Framework for Occluded Pixel Reconstruction with
Spatio-temporal Features for Improved Person Re-identification
- Authors: Prathistith Raj Medi, Ghanta Sai Krishna, Praneeth Nemani,
Satyanarayana Vollala, Santosh Kumar
- Abstract summary: Person re-identification is vital for monitoring and tracking crowd movement to enhance public security.
In this work, we propose a plausible solution by developing effective occlusion detection and reconstruction framework for RGB images/videos consisting of Deep Neural Networks.
Specifically, a CNN-based occlusion detection model classifies individual input frames, followed by a Conv-LSTM and Autoencoder to reconstruct the occluded pixels corresponding to the occluded frames for sequential (video) and non-sequential (image) data.
- Score: 0.842885453087587
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Person re-identification is vital for monitoring and tracking crowd movement
to enhance public security. However, re-identification in the presence of
occlusion substantially reduces the performance of existing systems and is a
challenging area. In this work, we propose a plausible solution to this problem
by developing effective occlusion detection and reconstruction framework for
RGB images/videos consisting of Deep Neural Networks. Specifically, a CNN-based
occlusion detection model classifies individual input frames, followed by a
Conv-LSTM and Autoencoder to reconstruct the occluded pixels corresponding to
the occluded frames for sequential (video) and non-sequential (image) data,
respectively. The quality of the reconstructed RGB frames is further refined
and fine-tuned using a Conditional Generative Adversarial Network (cGAN). Our
method is evaluated on four well-known public data sets of the domain, and the
qualitative reconstruction results are indeed appealing. Quantitative
evaluation in terms of re-identification accuracy of the Siamese network showed
an exceptional Rank-1 accuracy after occluded pixel reconstruction on various
datasets. A comparative analysis with state-of-the-art approaches also
demonstrates the robustness of our work for use in real-life surveillance
systems.
Related papers
- Multi-feature Reconstruction Network using Crossed-mask Restoration for Unsupervised Industrial Anomaly Detection [4.742650815342744]
Unsupervised anomaly detection is of great significance for quality inspection in industrial manufacturing.
We propose a multi-feature reconstruction network, MFRNet, using crossed-mask restoration in this paper.
Our method is highly competitive with or significantly outperforms other state-of-the-arts on four public available datasets and one self-made dataset.
arXiv Detail & Related papers (2024-04-20T05:13:56Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Uncertainty-Aware Null Space Networks for Data-Consistent Image
Reconstruction [0.0]
State-of-the-art reconstruction methods have been developed based on recent advances in deep learning.
For such approaches to be used in safety-critical domains such as medical imaging, the network reconstruction should not only provide the user with a reconstructed image, but also with some level of confidence in the reconstruction.
This work is the first approach to solving inverse problems that additionally models data-dependent uncertainty by estimating an input-dependent scale map.
arXiv Detail & Related papers (2023-04-14T06:58:44Z) - Exploring Resolution and Degradation Clues as Self-supervised Signal for
Low Quality Object Detection [77.3530907443279]
We propose a novel self-supervised framework to detect objects in degraded low resolution images.
Our methods has achieved superior performance compared with existing methods when facing variant degradation situations.
arXiv Detail & Related papers (2022-08-05T09:36:13Z) - NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction [64.36535692191343]
Implicit neural representations have shown compelling results in offline 3D reconstruction and also recently demonstrated the potential for online SLAM systems.
This paper addresses two key challenges: 1) seeking a criterion to measure the quality of the candidate viewpoints for the view planning based on the new representations, and 2) learning the criterion from data that can generalize to different scenes instead of hand-crafting one.
Our method demonstrates significant improvements on various metrics for the rendered image quality and the geometry quality of the reconstructed 3D models when compared with variants using TSDF or reconstruction without view planning.
arXiv Detail & Related papers (2022-07-22T10:05:36Z) - Gait Cycle Reconstruction and Human Identification from Occluded
Sequences [2.198430261120653]
We propose an effective neural network-based model to reconstruct the occluded frames in an input sequence before carrying out gait recognition.
We employ LSTM networks to predict an embedding for each occluded frame both from the forward and the backward directions.
While the LSTMs are trained to minimize the mean-squared loss, the fusion network is trained to optimize the pixel-wise cross-entropy loss between the ground-truth and the reconstructed samples.
arXiv Detail & Related papers (2022-06-20T16:04:31Z) - BGaitR-Net: Occluded Gait Sequence reconstructionwith temporally
constrained model for gait recognition [1.151614782416873]
We develop novel deep learning-based algorithms to identify occluded frames in an input sequence.
We then reconstruct these frames by exploiting next-temporal information present in the gait sequence.
Our LSTM-based model reconstructs occlusion and generates frames that are temporally consistent with the periodic pattern of a gait cycle.
arXiv Detail & Related papers (2021-10-18T18:28:18Z) - Spatially-Adaptive Image Restoration using Distortion-Guided Networks [51.89245800461537]
We present a learning-based solution for restoring images suffering from spatially-varying degradations.
We propose SPAIR, a network design that harnesses distortion-localization information and dynamically adjusts to difficult regions in the image.
arXiv Detail & Related papers (2021-08-19T11:02:25Z) - Limited View Tomographic Reconstruction Using a Deep Recurrent Framework
with Residual Dense Spatial-Channel Attention Network and Sinogram
Consistency [25.16002539710169]
We propose a novel recurrent reconstruction framework that stacks the same block multiple times.
We develop a sinogram consistency layer interleaved in our recurrent framework to ensure that the sampled sinogram is consistent with the sinogram of the intermediate outputs of the recurrent blocks.
Our algorithm achieves a consistent and significant improvement over the existing state-of-the-art neural methods on both limited angle reconstruction and sparse view reconstruction.
arXiv Detail & Related papers (2020-09-03T16:39:48Z) - Iterative Network for Image Super-Resolution [69.07361550998318]
Single image super-resolution (SISR) has been greatly revitalized by the recent development of convolutional neural networks (CNN)
This paper provides a new insight on conventional SISR algorithm, and proposes a substantially different approach relying on the iterative optimization.
A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.
arXiv Detail & Related papers (2020-05-20T11:11:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.