Real-IAD Variety: Pushing Industrial Anomaly Detection Dataset to a Modern Era
- URL: http://arxiv.org/abs/2511.00540v1
- Date: Sat, 01 Nov 2025 12:58:02 GMT
- Title: Real-IAD Variety: Pushing Industrial Anomaly Detection Dataset to a Modern Era
- Authors: Wenbing Zhu, Chengjie Wang, Bin-Bin Gao, Jiangning Zhang, Guannan Jiang, Jie Hu, Zhenye Gan, Lidong Wang, Ziqing Zhou, Linjie Cheng, Yurui Pan, Bo Peng, Mingmin Chi, Lizhuang Ma,
- Abstract summary: Real-IAD Variety is the largest and most diverse IAD benchmark, comprising 198,960 high-resolution images across 160 distinct object categories.<n>Its diversity is ensured through comprehensive coverage of 28 industries, 24 material types, and 22 color variations.<n>Real-IAD Variety will be made publicly available to facilitate innovation in this critical field.
- Score: 110.83702639978469
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Industrial Anomaly Detection (IAD) is critical for enhancing operational safety, ensuring product quality, and optimizing manufacturing efficiency across global industries. However, the IAD algorithms are severely constrained by the limitations of existing public benchmarks. Current datasets exhibit restricted category diversity and insufficient scale, frequently resulting in metric saturation and limited model transferability to real-world scenarios. To address this gap, we introduce Real-IAD Variety, the largest and most diverse IAD benchmark, comprising 198,960 high-resolution images across 160 distinct object categories. Its diversity is ensured through comprehensive coverage of 28 industries, 24 material types, and 22 color variations. Our comprehensive experimental analysis validates the benchmark's substantial challenge: state-of-the-art multi-class unsupervised anomaly detection methods experience significant performance degradation when scaled from 30 to 160 categories. Crucially, we demonstrate that vision-language models exhibit remarkable robustness to category scale-up, with minimal performance variation across different category counts, significantly enhancing generalization capabilities in diverse industrial contexts. The unprecedented scale and complexity of Real-IAD Variety position it as an essential resource for training and evaluating next-generation foundation models for anomaly detection. By providing this comprehensive benchmark with rigorous evaluation protocols across multi-class unsupervised, multi-view, and zero-/few-shot settings, we aim to accelerate research beyond domain-specific constraints, enabling the development of scalable, general-purpose anomaly detection systems. Real-IAD Variety will be made publicly available to facilitate innovation in this critical field.
Related papers
- Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods [19.39586739934126]
This review systematically analyzes 31 primary studies,and 10 surveys identified through a comprehensive search spanning 20152025.<n>Traditional approaches rely on expert knowledge,ontologies,and naturalistic driving or accident data,while recent developments leverage generative models,including large language models, adversarial networks,diffusion models,and reinforcement learning frameworks,to synthesize diverse and safety-critical scenarios.
arXiv Detail & Related papers (2025-12-17T13:14:15Z) - ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories [26.951550574484553]
We introduce ADNet, a large-scale, multi-domain benchmark for anomaly detection.<n>The benchmark includes a total of 196,294 RGB images, consisting of 116,192 normal samples for training and 80,102 test images, of which 60,311 are anomalous.<n>Dinomaly-m is a context-guided Mixture-of-Experts that expands decoder capacity without increasing inference cost.
arXiv Detail & Related papers (2025-11-25T10:47:48Z) - SVC 2025: the First Multimodal Deception Detection Challenge [16.070848946361696]
The SVC 2025 Multimodal Deception Detection Challenge is a new benchmark designed to evaluate cross-domain generalization in audio-visual deception detection.<n>We aim to foster the development of more adaptable, explainable, and practically deployable deception detection systems.
arXiv Detail & Related papers (2025-08-06T06:56:39Z) - Unsupervised Anomaly Detection in Multivariate Time Series across Heterogeneous Domains [0.8427519082414066]
We introduce a unifying framework for benchmarking unsupervised anomaly detection methods.<n>We then highlight the problem of shifts in normal behaviors that can occur in practical AIOps scenarios.<n>To tackle anomaly detection under domain shift, we propose a novel approach, Domain-Invariant VAE for Anomaly Detection.
arXiv Detail & Related papers (2025-03-29T12:38:28Z) - EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models [23.898938659720503]
Industrial Anomaly Detection (IAD) is critical to ensure product quality during manufacturing.<n>We propose a novel approach that introduces a dedicated multi-modal defect localization module to decouple the dialog functionality from the core feature extraction.<n>We also contribute to the first multi-modal industrial anomaly detection training dataset, named Defect Detection Question Answering (DDQA)
arXiv Detail & Related papers (2025-03-18T11:33:29Z) - A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [89.92916473403108]
This paper proposes a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.<n>The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.<n>We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark [101.23684938489413]
Anomaly detection (AD) is often focused on detecting anomalies for industrial quality inspection and medical lesion examination.
This work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field.
Inspired by the metrics in the segmentation field, we propose several more practical threshold-dependent AD-specific metrics.
arXiv Detail & Related papers (2024-04-16T17:38:26Z) - Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection [46.495442380849894]
We propose a large-scale, Real-world, and multi-view Industrial Anomaly Detection dataset, named Real-IAD.
It contains 150K high-resolution images of 30 different objects, an order of magnitude larger than existing datasets.
To make the dataset closer to real application scenarios, we adopted a multi-view shooting method and proposed sample-level evaluation metrics.
arXiv Detail & Related papers (2024-03-19T09:44:41Z) - Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results [73.98594459933008]
Face anti-spoofing (FAS) is an essential mechanism for safeguarding the integrity of automated face recognition systems.
This limitation can be attributed to the scarcity and lack of diversity in publicly available FAS datasets.
We introduce the Wild Face Anti-Spoofing dataset, a large-scale, diverse FAS dataset collected in unconstrained settings.
arXiv Detail & Related papers (2023-04-12T10:29:42Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.