A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect
- URL: http://arxiv.org/abs/2401.16402v1
- Date: Mon, 29 Jan 2024 18:41:21 GMT
- Title: A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect
- Authors: Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang,
Guansong Pang, Weiming Shen
- Abstract summary: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection.
This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexity of hierarchical anomalies.
- Score: 29.006716009327032
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the
concept of normality in visual data, widely applied across diverse domains,
e.g., industrial defect inspection, and medical lesion detection. This survey
comprehensively examines recent advancements in VAD by identifying three
primary challenges: 1) scarcity of training data, 2) diversity of visual
modalities, and 3) complexity of hierarchical anomalies. Starting with a brief
overview of the VAD background and its generic concept definitions, we
progressively categorize, emphasize, and discuss the latest VAD progress from
the perspective of sample number, data modality, and anomaly hierarchy. Through
an in-depth analysis of the VAD field, we finally summarize future developments
for VAD and conclude the key findings and contributions of this survey.
Related papers
- Deep Learning for Video Anomaly Detection: A Review [52.74513211976795]
Video anomaly detection (VAD) aims to discover behaviors or events deviating from the normality in videos.
In the era of deep learning, a great variety of deep learning based methods are constantly emerging for the VAD task.
This review covers the spectrum of five different categories, namely, semi-supervised, weakly supervised, fully supervised, unsupervised and open-set supervised VAD.
arXiv Detail & Related papers (2024-09-09T07:31:16Z) - Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [107.08019135783444]
We first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era.
Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD.
arXiv Detail & Related papers (2024-07-31T17:59:58Z) - A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [52.228708947607636]
This paper introduces a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.
The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.
We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - Video Anomaly Detection in 10 Years: A Survey and Outlook [10.143205531474907]
Video anomaly detection (VAD) holds immense importance across diverse domains such as surveillance, healthcare, and environmental monitoring.
This survey explores deep learning-based VAD, expanding beyond traditional supervised training paradigms to encompass emerging weakly supervised, self-supervised, and unsupervised approaches.
arXiv Detail & Related papers (2024-05-29T17:56:31Z) - RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis [56.57177181778517]
RadGenome-Chest CT is a large-scale, region-guided 3D chest CT interpretation dataset based on CT-RATE.
We leverage the latest powerful universal segmentation and large language models to extend the original datasets.
arXiv Detail & Related papers (2024-04-25T17:11:37Z) - A Survey on Domain Generalization for Medical Image Analysis [9.410880477358942]
Domain Generalization for MedIA aims to address the domain shift challenge by generalizing effectively and performing robustly across unknown data distributions.
We provide a formal definition of domain shift and domain generalization in medical field, and discuss several related settings.
We summarize the recent methods from three viewpoints: data manipulation level, feature representation level, and model training level, and present some algorithms in detail.
arXiv Detail & Related papers (2024-02-07T17:08:27Z) - VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON [28.511625423590605]
VISION datasets are diverse collection of 14 industrial inspection datasets.
With a total of 18k images encompassing 44 defect types, VISION strives to mirror a wide range of real-world production scenarios.
arXiv Detail & Related papers (2023-06-13T16:31:02Z) - A Comprehensive Survey on Edge Data Integrity Verification: Fundamentals and Future Trends [43.174689394432804]
We show current research status, open problems, and potentially promising insights for readers to further investigate this under-explored field.
To thoroughly assess prior research efforts, we synthesize a universal criteria framework that an effective verification approach should satisfy.
We highlight intriguing research challenges and possible directions for future work, along with a discussion on how forthcoming technology, e.g., machine learning and context-aware security, can augment security in EC.
arXiv Detail & Related papers (2022-10-20T02:58:36Z) - AlignTransformer: Hierarchical Alignment of Visual Regions and Disease
Tags for Medical Report Generation [50.21065317817769]
We propose an AlignTransformer framework, which includes the Align Hierarchical Attention (AHA) and the Multi-Grained Transformer (MGT) modules.
Experiments on the public IU-Xray and MIMIC-CXR datasets show that the AlignTransformer can achieve results competitive with state-of-the-art methods on the two datasets.
arXiv Detail & Related papers (2022-03-18T13:43:53Z) - A Survey of Visual Sensory Anomaly Detection [53.23336329817023]
Visual sensory anomaly detection (AD) is an essential problem in computer vision.
We provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.
arXiv Detail & Related papers (2022-02-14T19:50:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.