Explainable Anomaly Detection in Images and Videos: A Survey
- URL: http://arxiv.org/abs/2302.06670v3
- Date: Tue, 9 Apr 2024 21:27:20 GMT
- Title: Explainable Anomaly Detection in Images and Videos: A Survey
- Authors: Yizhou Wang, Dongliang Guo, Sheng Li, Octavia Camps, Yun Fu,
- Abstract summary: Anomaly detection and localization of visual data, including images and videos, are of great significance in machine learning academia and applied real-world scenarios.
Despite the rapid development of visual anomaly detection techniques in recent years, the interpretations of these black-box models and reasonable explanations of why anomalies can be distinguished out are scarce.
This paper provides the first survey concentrated on explainable visual anomaly detection methods.
- Score: 49.07140708026425
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anomaly detection and localization of visual data, including images and videos, are of great significance in both machine learning academia and applied real-world scenarios. Despite the rapid development of visual anomaly detection techniques in recent years, the interpretations of these black-box models and reasonable explanations of why anomalies can be distinguished out are scarce. This paper provides the first survey concentrated on explainable visual anomaly detection methods. We first introduce the basic background of image-level and video-level anomaly detection. Then, as the main content of this survey, a comprehensive and exhaustive literature review of explainable anomaly detection methods for both images and videos is presented. Next, we analyze why some explainable anomaly detection methods can be applied to both images and videos and why others can be only applied to one modality. Additionally, we provide summaries of current 2D visual anomaly detection datasets and evaluation metrics. Finally, we discuss several promising future directions and open problems to explore the explainability of 2D visual anomaly detection. The related resource collection is given at https://github.com/wyzjack/Awesome-XAD.
Related papers
- VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs [64.60035916955837]
VANE-Bench is a benchmark designed to assess the proficiency of Video-LMMs in detecting anomalies and inconsistencies in videos.
Our dataset comprises an array of videos synthetically generated using existing state-of-the-art text-to-video generation models.
We evaluate nine existing Video-LMMs, both open and closed sources, on this benchmarking task and find that most of the models encounter difficulties in effectively identifying the subtle anomalies.
arXiv Detail & Related papers (2024-06-14T17:59:01Z) - Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection.
Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels.
Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z) - Open-Vocabulary Video Anomaly Detection [57.552523669351636]
Video anomaly detection (VAD) with weak supervision has achieved remarkable performance in utilizing video-level labels to discriminate whether a video frame is normal or abnormal.
Recent studies attempt to tackle a more realistic setting, open-set VAD, which aims to detect unseen anomalies given seen anomalies and normal videos.
This paper takes a step further and explores open-vocabulary video anomaly detection (OVVAD), in which we aim to leverage pre-trained large models to detect and categorize seen and unseen anomalies.
arXiv Detail & Related papers (2023-11-13T02:54:17Z) - PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection [28.973078719467516]
We develop Multi-pose Anomaly Detection dataset and Pose-agnostic Anomaly Detection benchmark.
Specifically, we build MAD using 20 complex-shaped LEGO toys with various poses, and high-quality and diverse 3D anomalies in both simulated and real environments.
We also propose a novel method OmniposeAD, trained using MAD, specifically designed for pose-agnostic anomaly detection.
arXiv Detail & Related papers (2023-10-11T17:59:56Z) - Understanding the Challenges and Opportunities of Pose-based Anomaly
Detection [2.924868086534434]
Pose-based anomaly detection is a video-analysis technique for detecting anomalous events or behaviors by examining human pose extracted from the video frames.
In this work, we analyze and quantify the characteristics of two well-known video anomaly datasets to better understand the difficulties of pose-based anomaly detection.
We believe these experiments are beneficial for a better comprehension of pose-based anomaly detection and the datasets currently available.
arXiv Detail & Related papers (2023-03-09T18:09:45Z) - Catching Both Gray and Black Swans: Open-set Supervised Anomaly
Detection [90.32910087103744]
A few labeled anomaly examples are often available in many real-world applications.
These anomaly examples provide valuable knowledge about the application-specific abnormality.
Those anomalies seen during training often do not illustrate every possible class of anomaly.
This paper tackles open-set supervised anomaly detection.
arXiv Detail & Related papers (2022-03-28T05:21:37Z) - A Survey of Visual Sensory Anomaly Detection [53.23336329817023]
Visual sensory anomaly detection (AD) is an essential problem in computer vision.
We provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.
arXiv Detail & Related papers (2022-02-14T19:50:03Z) - Approaches Toward Physical and General Video Anomaly Detection [0.0]
Anomaly detection in videos may enable automatic detection of malfunctions in many manufacturing, maintenance, and real-life settings.
We introduce the Physical Anomalous Trajectory or Motion dataset, which contains six different video classes.
We suggest an even harder benchmark where anomalous activities should be spotted on highly variable scenes.
arXiv Detail & Related papers (2021-12-14T18:57:44Z) - A Critical Study on the Recent Deep Learning Based Semi-Supervised Video
Anomaly Detection Methods [3.198144010381572]
This paper introduces the researchers of the field to a new perspective and reviews the recent deep-learning based semi-supervised video anomaly detection approaches.
Our goal is to help researchers develop more effective video anomaly detection methods.
arXiv Detail & Related papers (2021-11-02T14:00:33Z) - Self-Supervised Representation Learning for Visual Anomaly Detection [9.642625267699488]
We consider the problem of anomaly detection in images videos, and present a new visual anomaly detection technique for videos.
We propose a simple self-supervision approach for learning temporal coherence across video frames without the use of any optical flow information.
This intuitive approach shows superior performance of visual anomaly detection compared to numerous methods for images and videos on UCF101 and ILSVRC2015 video datasets.
arXiv Detail & Related papers (2020-06-17T04:37:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.