A Video Anomaly Detection Framework based on Appearance-Motion Semantics
Representation Consistency
- URL: http://arxiv.org/abs/2204.04151v1
- Date: Fri, 8 Apr 2022 15:59:57 GMT
- Title: A Video Anomaly Detection Framework based on Appearance-Motion Semantics
Representation Consistency
- Authors: Xiangyu Huang, Caidan Zhao, Yilin Wang, Zhiqiang Wu
- Abstract summary: We propose a framework that uses normal data's appearance and motion semantic representation consistency to handle anomaly detection.
We design a two-stream encoder to encode the appearance and motion information representations of normal samples.
Lower consistency of appearance and motion features of anomalous samples can be used to generate predicted frames with larger reconstruction error.
- Score: 18.06814233420315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Video anomaly detection refers to the identification of events that deviate
from the expected behavior. Due to the lack of anomalous samples in training,
video anomaly detection becomes a very challenging task. Existing methods
almost follow a reconstruction or future frame prediction mode. However, these
methods ignore the consistency between appearance and motion information of
samples, which limits their anomaly detection performance. Anomalies only occur
in the moving foreground of surveillance videos, so the semantics expressed by
video frame sequences and optical flow without background information in
anomaly detection should be highly consistent and significant for anomaly
detection. Based on this idea, we propose Appearance-Motion Semantics
Representation Consistency (AMSRC), a framework that uses normal data's
appearance and motion semantic representation consistency to handle anomaly
detection. Firstly, we design a two-stream encoder to encode the appearance and
motion information representations of normal samples and introduce constraints
to further enhance the consistency of the feature semantics between appearance
and motion information of normal samples so that abnormal samples with low
consistency appearance and motion feature representation can be identified.
Moreover, the lower consistency of appearance and motion features of anomalous
samples can be used to generate predicted frames with larger reconstruction
error, which makes anomalies easier to spot. Experimental results demonstrate
the effectiveness of the proposed method.
Related papers
- Appearance Blur-driven AutoEncoder and Motion-guided Memory Module for Video Anomaly Detection [14.315287192621662]
Video anomaly detection (VAD) often learns the distribution of normal samples and detects the anomaly through measuring significant deviations.
Most VADs cannot cope with cross-dataset validation for new target domains.
We propose a novel VAD method with a motion-guided memory module to achieve cross-dataset validation with zero-shot.
arXiv Detail & Related papers (2024-09-26T07:48:20Z) - Open-Vocabulary Video Anomaly Detection [57.552523669351636]
Video anomaly detection (VAD) with weak supervision has achieved remarkable performance in utilizing video-level labels to discriminate whether a video frame is normal or abnormal.
Recent studies attempt to tackle a more realistic setting, open-set VAD, which aims to detect unseen anomalies given seen anomalies and normal videos.
This paper takes a step further and explores open-vocabulary video anomaly detection (OVVAD), in which we aim to leverage pre-trained large models to detect and categorize seen and unseen anomalies.
arXiv Detail & Related papers (2023-11-13T02:54:17Z) - Updated version: A Video Anomaly Detection Framework based on
Appearance-Motion Semantics Representation Consistency [2.395616571632115]
We propose a framework of Appearance-Motion Semantics Consistency Representation.
The two-stream structure is designed to encode the appearance and motion information representation of normal samples.
A novel consistency loss is proposed to enhance the consistency of feature semantics so that anomalies with low consistency can be identified.
arXiv Detail & Related papers (2023-03-09T08:28:34Z) - Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection [4.873362301533825]
We propose a Lossy Causal Temporal Convolutional Neural Network Autoencoder for anomaly detection.
Our framework uses a rate-distortion loss and an entropy bottleneck to learn a compressed latent representation for the task.
arXiv Detail & Related papers (2022-12-05T14:29:16Z) - Spatio-temporal predictive tasks for abnormal event detection in videos [60.02503434201552]
We propose new constrained pretext tasks to learn object level normality patterns.
Our approach consists in learning a mapping between down-scaled visual queries and their corresponding normal appearance and motion characteristics.
Experiments on several benchmark datasets demonstrate the effectiveness of our approach to localize and track anomalies.
arXiv Detail & Related papers (2022-10-27T19:45:12Z) - Object-centric and memory-guided normality reconstruction for video
anomaly detection [56.64792194894702]
This paper addresses anomaly detection problem for videosurveillance.
Due to the inherent rarity and heterogeneity of abnormal events, the problem is viewed as a normality modeling strategy.
Our model learns object-centric normal patterns without seeing anomalous samples during training.
arXiv Detail & Related papers (2022-03-07T19:28:39Z) - SLA$^2$P: Self-supervised Anomaly Detection with Adversarial
Perturbation [77.71161225100927]
Anomaly detection is a fundamental yet challenging problem in machine learning.
We propose a novel and powerful framework, dubbed as SLA$2$P, for unsupervised anomaly detection.
arXiv Detail & Related papers (2021-11-25T03:53:43Z) - Explainable Deep Few-shot Anomaly Detection with Deviation Networks [123.46611927225963]
We introduce a novel weakly-supervised anomaly detection framework to train detection models.
The proposed approach learns discriminative normality by leveraging the labeled anomalies and a prior probability.
Our model is substantially more sample-efficient and robust, and performs significantly better than state-of-the-art competing methods in both closed-set and open-set settings.
arXiv Detail & Related papers (2021-08-01T14:33:17Z) - Video Anomaly Detection By The Duality Of Normality-Granted Optical Flow [1.8065361710947974]
We propose to discriminate anomalies from normal ones by the duality of normality-granted optical flow.
We extend the appearance-motion correspondence scheme from frame reconstruction to prediction.
arXiv Detail & Related papers (2021-05-10T12:25:00Z) - Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit
Latent Features [8.407188666535506]
Most existing methods use an autoencoder to learn to reconstruct normal videos.
We propose an implicit two-path AE (ITAE), a structure in which two encoders implicitly model appearance and motion features.
For the complex distribution of normal scenes, we suggest normal density estimation of ITAE features.
NF models intensify ITAE performance by learning normality through implicitly learned features.
arXiv Detail & Related papers (2020-10-15T05:02:02Z) - A Background-Agnostic Framework with Adversarial Training for Abnormal
Event Detection in Video [120.18562044084678]
Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years.
We propose a background-agnostic framework that learns from training videos containing only normal events.
arXiv Detail & Related papers (2020-08-27T18:39:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.