Making Reconstruction-based Method Great Again for Video Anomaly
Detection
- URL: http://arxiv.org/abs/2301.12048v1
- Date: Sat, 28 Jan 2023 01:57:57 GMT
- Title: Making Reconstruction-based Method Great Again for Video Anomaly
Detection
- Authors: Yizhou Wang, Can Qin, Yue Bai, Yi Xu, Xu Ma, Yun Fu
- Abstract summary: Anomaly detection in videos is a significant yet challenging problem.
Existing reconstruction-based methods rely on old-fashioned convolutional autoencoders.
We propose a new autoencoder model for enhanced consecutive frame reconstruction.
- Score: 64.19326819088563
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Anomaly detection in videos is a significant yet challenging problem.
Previous approaches based on deep neural networks employ either
reconstruction-based or prediction-based approaches. Nevertheless, existing
reconstruction-based methods 1) rely on old-fashioned convolutional
autoencoders and are poor at modeling temporal dependency; 2) are prone to
overfit the training samples, leading to indistinguishable reconstruction
errors of normal and abnormal frames during the inference phase. To address
such issues, firstly, we get inspiration from transformer and propose ${\textbf
S}$patio-${\textbf T}$emporal ${\textbf A}$uto-${\textbf T}$rans-${\textbf
E}$ncoder, dubbed as $\textbf{STATE}$, as a new autoencoder model for enhanced
consecutive frame reconstruction. Our STATE is equipped with a specifically
designed learnable convolutional attention module for efficient temporal
learning and reasoning. Secondly, we put forward a novel reconstruction-based
input perturbation technique during testing to further differentiate anomalous
frames. With the same perturbation magnitude, the testing reconstruction error
of the normal frames lowers more than that of the abnormal frames, which
contributes to mitigating the overfitting problem of reconstruction. Owing to
the high relevance of the frame abnormality and the objects in the frame, we
conduct object-level reconstruction using both the raw frame and the
corresponding optical flow patches. Finally, the anomaly score is designed
based on the combination of the raw and motion reconstruction errors using
perturbed inputs. Extensive experiments on benchmark video anomaly detection
datasets demonstrate that our approach outperforms previous
reconstruction-based methods by a notable margin, and achieves state-of-the-art
anomaly detection performance consistently. The code is available at
https://github.com/wyzjack/MRMGA4VAD.
Related papers
- Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach [49.995833831087175]
This work proposes a novel method for generating generic Video-temporal PAs by inpainting a masked out region of an image.
In addition, we present a simple unified framework to detect real-world anomalies under the OCC setting.
Our method performs on par with other existing state-of-the-art PAs generation and reconstruction based methods under the OCC setting.
arXiv Detail & Related papers (2023-11-27T13:14:06Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - Multi-Contextual Predictions with Vision Transformer for Video Anomaly
Detection [22.098399083491937]
understanding of thetemporal context of a video plays a vital role in anomaly detection.
We design a transformer model with three different contextual prediction streams: masked, whole and partial.
By learning to predict the missing frames of consecutive normal frames, our model can effectively learn various normality patterns in the video.
arXiv Detail & Related papers (2022-06-17T05:54:31Z) - Unsupervised Anomaly Detection in Medical Images with a Memory-augmented
Multi-level Cross-attentional Masked Autoencoder [33.5760501931736]
Unsupervised anomaly detection (UAD) aims to find anomalous images by optimising a detector using a training set that contains only normal images.
UAD approaches can be based on reconstruction methods, self-supervised approaches, and Imagenet pre-trained models.
We introduce a new reconstruction-based UAD approach that addresses the low reconstruction error issue for anomalous images.
arXiv Detail & Related papers (2022-03-22T13:32:42Z) - Self-Supervised Predictive Convolutional Attentive Block for Anomaly
Detection [97.93062818228015]
We propose to integrate the reconstruction-based functionality into a novel self-supervised predictive architectural building block.
Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field.
We demonstrate the generality of our block by integrating it into several state-of-the-art frameworks for anomaly detection on image and video.
arXiv Detail & Related papers (2021-11-17T13:30:31Z) - Robust lEarned Shrinkage-Thresholding (REST): Robust unrolling for
sparse recover [87.28082715343896]
We consider deep neural networks for solving inverse problems that are robust to forward model mis-specifications.
We design a new robust deep neural network architecture by applying algorithm unfolding techniques to a robust version of the underlying recovery problem.
The proposed REST network is shown to outperform state-of-the-art model-based and data-driven algorithms in both compressive sensing and radar imaging problems.
arXiv Detail & Related papers (2021-10-20T06:15:45Z) - A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow
Reconstruction and Flow-Guided Frame Prediction [34.565986275769745]
$textHF2$-VAD is a Hybrid framework that integrates Flow reconstruction and Frame prediction.
We design the network of ML-MemAE-SC to memorize normal patterns for optical flow reconstruction.
We then employ a Conditional Variational Autoencoder to predict the next frame given several previous frames.
arXiv Detail & Related papers (2021-08-16T01:37:29Z) - Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly
Detection [40.778313918994996]
Reconstruction-based methods play an important role in unsupervised anomaly detection in images.
In this work, we interpret the reconstruction of an image as a divide-and-assemble procedure.
We achieve state-of-the-art performance on the challenging MVTec AD dataset.
arXiv Detail & Related papers (2021-07-28T01:14:32Z) - Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings [29.93983580779689]
We present a novel approach for anomaly detection, which utilizes discriminative prototypes of normal data to reconstruct video frames.
In this way, the model will favor the reconstruction of normal events and distort the reconstruction of abnormal events.
We evaluate the effectiveness of our method on three benchmark datasets and experimental results demonstrate the proposed method outperforms the state-of-the-art.
arXiv Detail & Related papers (2021-04-30T12:16:52Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.