A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised
Video Anomaly Detection
- URL: http://arxiv.org/abs/2310.17650v1
- Date: Thu, 26 Oct 2023 17:59:19 GMT
- Title: A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised
Video Anomaly Detection
- Authors: Anas Al-lahham, Nurbek Tastan, Zaigham Zaheer, Karthik Nandakumar
- Abstract summary: Detection of anomalous events in videos is an important problem in applications such as surveillance.
We propose a simple-but-effective two-stage pseudo-label generation framework that produces segment-level (normal/anomaly) pseudo-labels.
The proposed coarse-to-fine pseudo-label generator employs carefully-designed hierarchical divisive clustering and statistical hypothesis testing.
- Score: 4.494911384096143
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Detection of anomalous events in videos is an important problem in
applications such as surveillance. Video anomaly detection (VAD) is
well-studied in the one-class classification (OCC) and weakly supervised (WS)
settings. However, fully unsupervised (US) video anomaly detection methods,
which learn a complete system without any annotation or human supervision, have
not been explored in depth. This is because the lack of any ground truth
annotations significantly increases the magnitude of the VAD challenge. To
address this challenge, we propose a simple-but-effective two-stage
pseudo-label generation framework that produces segment-level (normal/anomaly)
pseudo-labels, which can be further used to train a segment-level anomaly
detector in a supervised manner. The proposed coarse-to-fine pseudo-label
(C2FPL) generator employs carefully-designed hierarchical divisive clustering
and statistical hypothesis testing to identify anomalous video segments from a
set of completely unlabeled videos. The trained anomaly detector can be
directly applied on segments of an unseen test video to obtain segment-level,
and subsequently, frame-level anomaly predictions. Extensive studies on two
large-scale public-domain datasets, UCF-Crime and XD-Violence, demonstrate that
the proposed unsupervised approach achieves superior performance compared to
all existing OCC and US methods , while yielding comparable performance to the
state-of-the-art WS methods.
Related papers
- Generating and Reweighting Dense Contrastive Patterns for Unsupervised
Anomaly Detection [59.34318192698142]
We introduce a prior-less anomaly generation paradigm and develop an innovative unsupervised anomaly detection framework named GRAD.
PatchDiff effectively expose various types of anomaly patterns.
experiments on both MVTec AD and MVTec LOCO datasets also support the aforementioned observation.
arXiv Detail & Related papers (2023-12-26T07:08:06Z) - Open-Vocabulary Video Anomaly Detection [57.552523669351636]
Video anomaly detection (VAD) with weak supervision has achieved remarkable performance in utilizing video-level labels to discriminate whether a video frame is normal or abnormal.
Recent studies attempt to tackle a more realistic setting, open-set VAD, which aims to detect unseen anomalies given seen anomalies and normal videos.
This paper takes a step further and explores open-vocabulary video anomaly detection (OVVAD), in which we aim to leverage pre-trained large models to detect and categorize seen and unseen anomalies.
arXiv Detail & Related papers (2023-11-13T02:54:17Z) - Weakly-Supervised Video Anomaly Detection with Snippet Anomalous
Attention [22.985681654402153]
We propose an Anomalous Attention mechanism for weakly-supervised anomaly detection.
Our approach takes into account snippet-level encoded features without the supervision of pseudo labels.
arXiv Detail & Related papers (2023-09-28T10:03:58Z) - Clustering Aided Weakly Supervised Training to Detect Anomalous Events
in Surveillance Videos [20.368114998124295]
We propose a weakly supervised anomaly detection system which has multiple contributions including a random batch selection mechanism.
A clustering loss block is proposed to mitigate the label noise and to improve the representation learning for the anomalous and normal regions.
Extensive analysis of the proposed approach is provided using three popular anomaly detection datasets.
arXiv Detail & Related papers (2022-03-25T15:18:19Z) - UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection [103.06327681038304]
We propose a supervised open-set benchmark composed of multiple virtual scenes for video anomaly detection.
Unlike existing data sets, we introduce abnormal events annotated at the pixel level at training time.
We show that UBnormal can enhance the performance of a state-of-the-art anomaly detection framework.
arXiv Detail & Related papers (2021-11-16T17:28:46Z) - Cleaning Label Noise with Clusters for Minimally Supervised Anomaly
Detection [26.062659852373653]
We formulate a weakly supervised anomaly detection method that is trained using only video-level labels.
The proposed method yields 78.27% and 84.16% frame-level AUC on UCF-crime and ShanghaiTech datasets respectively.
arXiv Detail & Related papers (2021-04-30T06:03:24Z) - MIST: Multiple Instance Self-Training Framework for Video Anomaly
Detection [76.80153360498797]
We develop a multiple instance self-training framework (MIST) to efficiently refine task-specific discriminative representations.
MIST is composed of 1) a multiple instance pseudo label generator, which adapts a sparse continuous sampling strategy to produce more reliable clip-level pseudo labels, and 2) a self-guided attention boosted feature encoder.
Our method performs comparably to or even better than existing supervised and weakly supervised methods, specifically obtaining a frame-level AUC 94.83% on ShanghaiTech.
arXiv Detail & Related papers (2021-04-04T15:47:14Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z) - A Self-Reasoning Framework for Anomaly Detection Using Video-Level
Labels [17.615297975503648]
Alous event detection in surveillance videos is a challenging and practical research problem among image and video processing community.
We propose a weakly supervised anomaly detection framework based on deep neural networks which is trained in a self-reasoning fashion using only video-level labels.
The proposed framework has been evaluated on publicly available real-world anomaly detection datasets including UCF-crime, ShanghaiTech and Ped2.
arXiv Detail & Related papers (2020-08-27T02:14:15Z) - Self-trained Deep Ordinal Regression for End-to-End Video Anomaly
Detection [114.9714355807607]
We show that applying self-trained deep ordinal regression to video anomaly detection overcomes two key limitations of existing methods.
We devise an end-to-end trainable video anomaly detection approach that enables joint representation learning and anomaly scoring without manually labeled normal/abnormal data.
arXiv Detail & Related papers (2020-03-15T08:44:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.