Related papers: A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect

A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect

URL: http://arxiv.org/abs/2401.16402v1
Date: Mon, 29 Jan 2024 18:41:21 GMT
Title: A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect
Authors: Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang, Guansong Pang, Weiming Shen
Abstract summary: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection. This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexity of hierarchical anomalies.
Score: 29.006716009327032
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection. This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexity of hierarchical anomalies. Starting with a brief overview of the VAD background and its generic concept definitions, we progressively categorize, emphasize, and discuss the latest VAD progress from the perspective of sample number, data modality, and anomaly hierarchy. Through an in-depth analysis of the VAD field, we finally summarize future developments for VAD and conclude the key findings and contributions of this survey.

Related papers

3D-RAD: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks [14.366478737339909]
Medical Visual Question Answering (Med-VQA) holds significant potential for clinical decision support.<n>This paper presents 3D-RAD, a large-scale dataset designed to advance 3D Med-VQA using radiology CT scans.
arXiv Detail & Related papers (2025-06-11T09:55:42Z)
Out-of-Distribution Detection on Graphs: A Survey [58.47395497985277]
Graph out-of-distribution (GOOD) detection focuses on identifying graph data that deviates from the distribution seen during training. We categorize existing methods into four types: enhancement-based, reconstruction-based, information propagation-based, and classification-based approaches. We discuss practical applications and theoretical foundations, highlighting the unique challenges posed by graph data.
arXiv Detail & Related papers (2025-02-12T04:07:12Z)
Foundation Models for Anomaly Detection: Vision and Challenges [19.2255593926904]
Foundation models (FMs) have emerged as a powerful tool for advancing anomaly detection. This survey presents the first comprehensive review of recent advancements in FM-based anomaly detection.
arXiv Detail & Related papers (2025-02-10T05:01:08Z)
MG-3D: Multi-Grained Knowledge-Enhanced 3D Medical Vision-Language Pre-training [7.968487067774351]
3D medical image analysis is pivotal in numerous clinical applications. Large-scale vision-language pre-training remains underexplored in 3D medical image analysis. We propose MG-3D, pre-trained on large-scale data (47.1K)
arXiv Detail & Related papers (2024-12-08T09:45:59Z)
Deep Learning for Video Anomaly Detection: A Review [52.74513211976795]
Video anomaly detection (VAD) aims to discover behaviors or events deviating from the normality in videos. In the era of deep learning, a great variety of deep learning based methods are constantly emerging for the VAD task. This review covers the spectrum of five different categories, namely, semi-supervised, weakly supervised, fully supervised, unsupervised and open-set supervised VAD.
arXiv Detail & Related papers (2024-09-09T07:31:16Z)
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [107.08019135783444]
We first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era. Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD.
arXiv Detail & Related papers (2024-07-31T17:59:58Z)
A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [52.228708947607636]
This paper introduces a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods. The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics. We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z)
Video Anomaly Detection in 10 Years: A Survey and Outlook [10.143205531474907]
Video anomaly detection (VAD) holds immense importance across diverse domains such as surveillance, healthcare, and environmental monitoring. This survey explores deep learning-based VAD, expanding beyond traditional supervised training paradigms to encompass emerging weakly supervised, self-supervised, and unsupervised approaches.
arXiv Detail & Related papers (2024-05-29T17:56:31Z)
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis [56.57177181778517]
RadGenome-Chest CT is a large-scale, region-guided 3D chest CT interpretation dataset based on CT-RATE. We leverage the latest powerful universal segmentation and large language models to extend the original datasets.
arXiv Detail & Related papers (2024-04-25T17:11:37Z)
A Survey on Domain Generalization for Medical Image Analysis [9.410880477358942]
Domain Generalization for MedIA aims to address the domain shift challenge by generalizing effectively and performing robustly across unknown data distributions. We provide a formal definition of domain shift and domain generalization in medical field, and discuss several related settings. We summarize the recent methods from three viewpoints: data manipulation level, feature representation level, and model training level, and present some algorithms in detail.
arXiv Detail & Related papers (2024-02-07T17:08:27Z)
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON [28.511625423590605]
VISION datasets are diverse collection of 14 industrial inspection datasets. With a total of 18k images encompassing 44 defect types, VISION strives to mirror a wide range of real-world production scenarios.
arXiv Detail & Related papers (2023-06-13T16:31:02Z)
A Comprehensive Survey on Edge Data Integrity Verification: Fundamentals and Future Trends [43.174689394432804]
We show current research status, open problems, and potentially promising insights for readers to further investigate this under-explored field. To thoroughly assess prior research efforts, we synthesize a universal criteria framework that an effective verification approach should satisfy. We highlight intriguing research challenges and possible directions for future work, along with a discussion on how forthcoming technology, e.g., machine learning and context-aware security, can augment security in EC.
arXiv Detail & Related papers (2022-10-20T02:58:36Z)
AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation [50.21065317817769]
We propose an AlignTransformer framework, which includes the Align Hierarchical Attention (AHA) and the Multi-Grained Transformer (MGT) modules. Experiments on the public IU-Xray and MIMIC-CXR datasets show that the AlignTransformer can achieve results competitive with state-of-the-art methods on the two datasets.
arXiv Detail & Related papers (2022-03-18T13:43:53Z)
A Survey of Visual Sensory Anomaly Detection [53.23336329817023]
Visual sensory anomaly detection (AD) is an essential problem in computer vision. We provide a comprehensive review of visual sensory AD and category into three levels according to the form of anomalies.
arXiv Detail & Related papers (2022-02-14T19:50:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.