A Survey of Visual Sensory Anomaly Detection
- URL: http://arxiv.org/abs/2202.07006v1
- Date: Mon, 14 Feb 2022 19:50:03 GMT
- Title: A Survey of Visual Sensory Anomaly Detection
- Authors: Xi Jiang, Guoyang Xie, Jinbao Wang, Yong Liu, Chengjie Wang, Feng
Zheng, Yaochu Jin
- Abstract summary: Visual sensory anomaly detection (AD) is an essential problem in computer vision.
We provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.
- Score: 53.23336329817023
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Visual sensory anomaly detection (AD) is an essential problem in computer
vision, which is gaining momentum recently thanks to the development of AI for
good. Compared with semantic anomaly detection which detects anomaly at the
label level (semantic shift), visual sensory AD detects the abnormal part of
the sample (covariate shift). However, no thorough review has been provided to
summarize this area for the computer vision community. In this survey, we are
the first one to provide a comprehensive review of visual sensory AD and
category into three levels according to the form of anomalies. Furthermore, we
classify each kind of anomaly according to the level of supervision. Finally,
we summarize the challenges and provide open directions for this community. All
resources are available at
https://github.com/M-3LAB/awesome-visual-sensory-anomaly-detection.
Related papers
- UniMODE: Unified Monocular 3D Object Detection [70.27631528933482]
We build a detector based on the bird's-eye-view (BEV) detection paradigm.
We propose an uneven BEV grid design to handle the convergence instability caused by the challenges.
A unified detector UniMODE is derived, which surpasses the previous state-of-the-art on the challenging Omni3D dataset.
arXiv Detail & Related papers (2024-02-28T18:59:31Z) - A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect [29.006716009327032]
Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection.
This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexity of hierarchical anomalies.
arXiv Detail & Related papers (2024-01-29T18:41:21Z) - Open-Vocabulary Video Anomaly Detection [57.552523669351636]
Video anomaly detection (VAD) with weak supervision has achieved remarkable performance in utilizing video-level labels to discriminate whether a video frame is normal or abnormal.
Recent studies attempt to tackle a more realistic setting, open-set VAD, which aims to detect unseen anomalies given seen anomalies and normal videos.
This paper takes a step further and explores open-vocabulary video anomaly detection (OVVAD), in which we aim to leverage pre-trained large models to detect and categorize seen and unseen anomalies.
arXiv Detail & Related papers (2023-11-13T02:54:17Z) - That's BAD: Blind Anomaly Detection by Implicit Local Feature Clustering [28.296651124677556]
Setting blind anomaly detection (BAD) can be converted into a local outlier detection problem.
We propose a novel method named PatchCluster that can accurately detect image- and pixel-level anomalies.
Experimental results show that PatchCluster shows a promising performance without the knowledge of normal data.
arXiv Detail & Related papers (2023-07-06T18:17:43Z) - Augment and Criticize: Exploring Informative Samples for Semi-Supervised
Monocular 3D Object Detection [64.65563422852568]
We improve the challenging monocular 3D object detection problem with a general semi-supervised framework.
We introduce a novel, simple, yet effective Augment and Criticize' framework that explores abundant informative samples from unlabeled data.
The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI.
arXiv Detail & Related papers (2023-03-20T16:28:15Z) - Explainable Anomaly Detection in Images and Videos: A Survey [49.07140708026425]
Anomaly detection and localization of visual data, including images and videos, are of great significance in machine learning academia and applied real-world scenarios.
Despite the rapid development of visual anomaly detection techniques in recent years, the interpretations of these black-box models and reasonable explanations of why anomalies can be distinguished out are scarce.
This paper provides the first survey concentrated on explainable visual anomaly detection methods.
arXiv Detail & Related papers (2023-02-13T20:17:41Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - The MVTec 3D-AD Dataset for Unsupervised 3D Anomaly Detection and
Localization [17.437967037670813]
We introduce the first comprehensive 3D dataset for the task of unsupervised anomaly detection and localization.
It is inspired by real-world visual inspection scenarios in which a model has to detect various types of defects on manufactured products.
arXiv Detail & Related papers (2021-12-16T17:35:51Z) - OIAD: One-for-all Image Anomaly Detection with Disentanglement Learning [23.48763375455514]
We propose a One-for-all Image Anomaly Detection system based on disentangled learning using only clean samples.
Our experiments with three datasets show that OIAD can detect over $90%$ of anomalies while maintaining a low false alarm rate.
arXiv Detail & Related papers (2020-01-18T09:57:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.