BGaitR-Net: Occluded Gait Sequence reconstructionwith temporally
constrained model for gait recognition
- URL: http://arxiv.org/abs/2110.09564v1
- Date: Mon, 18 Oct 2021 18:28:18 GMT
- Title: BGaitR-Net: Occluded Gait Sequence reconstructionwith temporally
constrained model for gait recognition
- Authors: Somnath Sendhil Kumara, Pratik Chattopadhyaya, Lipo Wang
- Abstract summary: We develop novel deep learning-based algorithms to identify occluded frames in an input sequence.
We then reconstruct these frames by exploiting next-temporal information present in the gait sequence.
Our LSTM-based model reconstructs occlusion and generates frames that are temporally consistent with the periodic pattern of a gait cycle.
- Score: 1.151614782416873
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advancements in computational resources and Deep Learning
methodologies has significantly benefited development of intelligent
vision-based surveillance applications. Gait recognition in the presence of
occlusion is one of the challenging research topics in this area, and the
solutions proposed by researchers to date lack in robustness and also dependent
of several unrealistic constraints, which limits their practical applicability.
We improve the state-of-the-art by developing novel deep learning-based
algorithms to identify the occluded frames in an input sequence and next
reconstruct these occluded frames by exploiting the spatio-temporal information
present in the gait sequence. The multi-stage pipeline adopted in this work
consists of key pose mapping, occlusion detection and reconstruction, and
finally gait recognition. While the key pose mapping and occlusion detection
phases are done %using Constrained KMeans Clustering and via a graph sorting
algorithm, reconstruction of occluded frames is done by fusing the key
pose-specific information derived in the previous step along with the
spatio-temporal information contained in a gait sequence using a Bi-Directional
Long Short Time Memory. This occlusion reconstruction model has been trained
using synthetically occluded CASIA-B and OU-ISIR data, and the trained model is
termed as Bidirectional Gait Reconstruction Network BGait-R-Net. Our LSTM-based
model reconstructs occlusion and generates frames that are temporally
consistent with the periodic pattern of a gait cycle, while simultaneously
preserving the body structure.
Related papers
- State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era [59.279784235147254]
This survey provides an in-depth summary of the latest approaches that are based on recurrent models for sequential data processing.
The emerging picture suggests that there is room for thinking of novel routes, constituted by learning algorithms which depart from the standard Backpropagation Through Time.
arXiv Detail & Related papers (2024-06-13T12:51:22Z) - Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach [49.995833831087175]
This work proposes a novel method for generating generic Video-temporal PAs by inpainting a masked out region of an image.
In addition, we present a simple unified framework to detect real-world anomalies under the OCC setting.
Our method performs on par with other existing state-of-the-art PAs generation and reconstruction based methods under the OCC setting.
arXiv Detail & Related papers (2023-11-27T13:14:06Z) - Revisiting the Temporal Modeling in Spatio-Temporal Predictive Learning
under A Unified View [73.73667848619343]
We introduce USTEP (Unified S-TEmporal Predictive learning), an innovative framework that reconciles the recurrent-based and recurrent-free methods by integrating both micro-temporal and macro-temporal scales.
arXiv Detail & Related papers (2023-10-09T16:17:42Z) - A Novel end-to-end Framework for Occluded Pixel Reconstruction with
Spatio-temporal Features for Improved Person Re-identification [0.842885453087587]
Person re-identification is vital for monitoring and tracking crowd movement to enhance public security.
In this work, we propose a plausible solution by developing effective occlusion detection and reconstruction framework for RGB images/videos consisting of Deep Neural Networks.
Specifically, a CNN-based occlusion detection model classifies individual input frames, followed by a Conv-LSTM and Autoencoder to reconstruct the occluded pixels corresponding to the occluded frames for sequential (video) and non-sequential (image) data.
arXiv Detail & Related papers (2023-04-16T08:14:29Z) - Gated Recurrent Neural Networks with Weighted Time-Delay Feedback [59.125047512495456]
We introduce a novel gated recurrent unit (GRU) with a weighted time-delay feedback mechanism.
We show that $tau$-GRU can converge faster and generalize better than state-of-the-art recurrent units and gated recurrent architectures.
arXiv Detail & Related papers (2022-12-01T02:26:34Z) - Gait Cycle Reconstruction and Human Identification from Occluded
Sequences [2.198430261120653]
We propose an effective neural network-based model to reconstruct the occluded frames in an input sequence before carrying out gait recognition.
We employ LSTM networks to predict an embedding for each occluded frame both from the forward and the backward directions.
While the LSTMs are trained to minimize the mean-squared loss, the fusion network is trained to optimize the pixel-wise cross-entropy loss between the ground-truth and the reconstructed samples.
arXiv Detail & Related papers (2022-06-20T16:04:31Z) - Recurrence-in-Recurrence Networks for Video Deblurring [58.49075799159015]
State-of-the-art video deblurring methods often adopt recurrent neural networks to model the temporal dependency between the frames.
In this paper, we propose recurrence-in-recurrence network architecture to cope with the limitations of short-ranged memory.
arXiv Detail & Related papers (2022-03-12T11:58:13Z) - An Attention-based ConvLSTM Autoencoder with Dynamic Thresholding for
Unsupervised Anomaly Detection in Multivariate Time Series [2.9685635948299995]
We propose an unsupervised Attention-based Convolutional Long Short-Term Memory (ConvLSTM) Autoencoder with Dynamic Thresholding (ACLAE-DT) framework for anomaly detection and diagnosis.
The framework starts by pre-processing and enriching the data, before constructing feature images to characterize the system statuses.
The constructed feature images are fed into an attention-based ConvLSTM autoencoder, which aims to encode the constructed feature images and capture the temporal behavior.
The reconstruction errors are then computed and subjected to a statistical-based, dynamic thresholding mechanism to detect and diagnose the anomalies
arXiv Detail & Related papers (2022-01-23T04:01:43Z) - Supporting Optimal Phase Space Reconstructions Using Neural Network
Architecture for Time Series Modeling [68.8204255655161]
We propose an artificial neural network with a mechanism to implicitly learn the phase spaces properties.
Our approach is either as competitive as or better than most state-of-the-art strategies.
arXiv Detail & Related papers (2020-06-19T21:04:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.