Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video
Anomaly Detection
- URL: http://arxiv.org/abs/2305.07328v1
- Date: Fri, 12 May 2023 09:03:38 GMT
- Title: Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video
Anomaly Detection
- Authors: Kai Cheng, Xinhua Zeng, Yang Liu, Tian Wang, Chengxin Pang, Jing Teng,
Zhaoyang Xia, and Jing Liu
- Abstract summary: Video anomaly detection (VAD) is a vital task with great practical applications in industrial surveillance, security system, and traffic control.
We design a spatial-temporal hierarchical architecture (STHA) as a architecture to flexibly detect different degrees of anomaly.
We conduct experiments on three benchmarks and perform extensive analysis, and the results demonstrate that our method performs comparablely to the state-of-the-art methods.
- Score: 10.956907116728267
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Video anomaly detection (VAD) is a vital task with great practical
applications in industrial surveillance, security system, and traffic control.
Unlike previous unsupervised VAD methods that adopt a fixed structure to learn
normality without considering different detection demands, we design a
spatial-temporal hierarchical architecture (STHA) as a configurable
architecture to flexibly detect different degrees of anomaly. The comprehensive
structure of the STHA is delineated into a tripartite hierarchy, encompassing
the following tiers: the stream level, the stack level, and the block level.
Specifically, we design several auto-encoder-based blocks that possess varying
capacities for extracting normal patterns. Then, we stack blocks according to
the complexity degrees with both intra-stack and inter-stack residual links to
learn hierarchical normality gradually. Considering the multisource knowledge
of videos, we also model the spatial normality of video frames and temporal
normality of RGB difference by designing two parallel streams consisting of
stacks. Thus, STHA can provide various representation learning abilities by
expanding or contracting hierarchically to detect anomalies of different
degrees. Since the anomaly set is complicated and unbounded, our STHA can
adjust its detection ability to adapt to the human detection demands and the
complexity degree of anomaly that happened in the history of a scene. We
conduct experiments on three benchmarks and perform extensive analysis, and the
results demonstrate that our method performs comparablely to the
state-of-the-art methods. In addition, we design a toy dataset to prove that
our model can better balance the learning ability to adapt to different
detection demands.
Related papers
- ARC: A Generalist Graph Anomaly Detector with In-Context Learning [62.202323209244]
ARC is a generalist GAD approach that enables a one-for-all'' GAD model to detect anomalies across various graph datasets on-the-fly.
equipped with in-context learning, ARC can directly extract dataset-specific patterns from the target dataset.
Extensive experiments on multiple benchmark datasets from various domains demonstrate the superior anomaly detection performance, efficiency, and generalizability of ARC.
arXiv Detail & Related papers (2024-05-27T02:42:33Z) - USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time Series [6.055410677780381]
We introduce a combination of data augmentation and soft contrastive learning, specifically designed to capture the multifaceted nature of state behaviors more accurately.
This dual strategy significantly boosts the model's ability to distinguish between normal and abnormal states, leading to a marked improvement in fault detection performance across multiple datasets and settings.
arXiv Detail & Related papers (2024-05-25T14:48:04Z) - Generating and Reweighting Dense Contrastive Patterns for Unsupervised
Anomaly Detection [59.34318192698142]
We introduce a prior-less anomaly generation paradigm and develop an innovative unsupervised anomaly detection framework named GRAD.
PatchDiff effectively expose various types of anomaly patterns.
experiments on both MVTec AD and MVTec LOCO datasets also support the aforementioned observation.
arXiv Detail & Related papers (2023-12-26T07:08:06Z) - Unraveling the "Anomaly" in Time Series Anomaly Detection: A
Self-supervised Tri-domain Solution [89.16750999704969]
Anomaly labels hinder traditional supervised models in time series anomaly detection.
Various SOTA deep learning techniques, such as self-supervised learning, have been introduced to tackle this issue.
We propose a novel self-supervised learning based Tri-domain Anomaly Detector (TriAD)
arXiv Detail & Related papers (2023-11-19T05:37:18Z) - Open-Vocabulary Video Anomaly Detection [57.552523669351636]
Video anomaly detection (VAD) with weak supervision has achieved remarkable performance in utilizing video-level labels to discriminate whether a video frame is normal or abnormal.
Recent studies attempt to tackle a more realistic setting, open-set VAD, which aims to detect unseen anomalies given seen anomalies and normal videos.
This paper takes a step further and explores open-vocabulary video anomaly detection (OVVAD), in which we aim to leverage pre-trained large models to detect and categorize seen and unseen anomalies.
arXiv Detail & Related papers (2023-11-13T02:54:17Z) - Delving into CLIP latent space for Video Anomaly Recognition [24.37974279994544]
We introduce the novel method AnomalyCLIP, the first to combine Large Language and Vision (LLV) models, such as CLIP.
Our approach specifically involves manipulating the latent CLIP feature space to identify the normal event subspace.
When anomalous frames are projected onto these directions, they exhibit a large feature magnitude if they belong to a particular class.
arXiv Detail & Related papers (2023-10-04T14:01:55Z) - Hierarchical Spatio-Temporal Representation Learning for Gait
Recognition [6.877671230651998]
Gait recognition is a biometric technique that identifies individuals by their unique walking styles.
We propose a hierarchical-temporal representation learning framework for extracting gait features from coarse to fine.
Our method outperforms the state-of-the-art while maintaining a reasonable balance between model accuracy and complexity.
arXiv Detail & Related papers (2023-07-19T09:30:00Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - CSCAD: Correlation Structure-based Collective Anomaly Detection in
Complex System [11.739889613196619]
We propose a correlation structure-based collective anomaly detection model for high-dimensional anomaly detection problem in large systems.
Our framework utilize graph convolutional network combining a variational autoencoder to jointly exploit the feature space correlation and reconstruction deficiency of samples.
An anomaly discriminating network can then be trained using low anomalous degree samples as positive samples, and high anomalous degree samples as negative samples.
arXiv Detail & Related papers (2021-05-30T09:28:25Z) - Anomaly Detection in Video via Self-Supervised and Multi-Task Learning [113.81927544121625]
Anomaly detection in video is a challenging computer vision problem.
In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level.
arXiv Detail & Related papers (2020-11-15T10:21:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.