Kaputt: A Large-Scale Dataset for Visual Defect Detection
- URL: http://arxiv.org/abs/2510.05903v1
- Date: Tue, 07 Oct 2025 13:13:18 GMT
- Title: Kaputt: A Large-Scale Dataset for Visual Defect Detection
- Authors: Sebastian Höfer, Dorian Henning, Artemij Amiranashvili, Douglas Morrison, Mariliza Tzes, Ingmar Posner, Marc Matvienko, Alessandro Rennola, Anton Milan,
- Abstract summary: We present a novel large-scale dataset for defect detection in a logistics setting.<n>With over 230,000 images (and more than 29,000 defective instances), it is 40 times larger than MVTec-AD.
- Score: 41.85463954775384
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present a novel large-scale dataset for defect detection in a logistics setting. Recent work on industrial anomaly detection has primarily focused on manufacturing scenarios with highly controlled poses and a limited number of object categories. Existing benchmarks like MVTec-AD [6] and VisA [33] have reached saturation, with state-of-the-art methods achieving up to 99.9% AUROC scores. In contrast to manufacturing, anomaly detection in retail logistics faces new challenges, particularly in the diversity and variability of object pose and appearance. Leading anomaly detection methods fall short when applied to this new setting. To bridge this gap, we introduce a new benchmark that overcomes the current limitations of existing datasets. With over 230,000 images (and more than 29,000 defective instances), it is 40 times larger than MVTec-AD and contains more than 48,000 distinct objects. To validate the difficulty of the problem, we conduct an extensive evaluation of multiple state-of-the-art anomaly detection methods, demonstrating that they do not surpass 56.96% AUROC on our dataset. Further qualitative analysis confirms that existing methods struggle to leverage normal samples under heavy pose and appearance variation. With our large-scale dataset, we set a new benchmark and encourage future research towards solving this challenging problem in retail logistics anomaly detection. The dataset is available for download under https://www.kaputt-dataset.com.
Related papers
- Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline [63.96226274616927]
A new framework called Track Any Anomalous Object (TAO) introduces a granular video anomaly detection pipeline.<n>Unlike methods that assign anomaly scores to every pixel, our approach transforms the problem into pixel-level tracking of anomalous objects.<n>Experiments demonstrate that TAO sets new benchmarks in accuracy and robustness.
arXiv Detail & Related papers (2025-06-05T15:49:39Z) - The MVTec AD 2 Dataset: Advanced Scenarios for Unsupervised Anomaly Detection [3.9682699334026563]
We present MVTec AD 2, a collection of eight anomaly detection scenarios with more than 8000 high-resolution images.<n>It comprises challenging and highly relevant industrial inspection use cases that have not been considered in previous datasets.<n>Our dataset provides test scenarios with lighting condition changes to assess the robustness of methods under real-world distribution shifts.
arXiv Detail & Related papers (2025-03-27T15:41:46Z) - CableInspect-AD: An Expert-Annotated Anomaly Detection Dataset [14.246172794156987]
$textitCableInspect-AD$ is a high-quality dataset created and annotated by domain experts from Hydro-Qu'ebec, a Canadian public utility.
This dataset includes high-resolution images with challenging real-world anomalies, covering defects with varying severity levels.
We present a comprehensive evaluation protocol based on cross-validation to assess models' performances.
arXiv Detail & Related papers (2024-09-30T14:50:13Z) - A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [89.92916473403108]
This paper proposes a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.<n>The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.<n>We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection [46.495442380849894]
We propose a large-scale, Real-world, and multi-view Industrial Anomaly Detection dataset, named Real-IAD.
It contains 150K high-resolution images of 30 different objects, an order of magnitude larger than existing datasets.
To make the dataset closer to real application scenarios, we adopted a multi-view shooting method and proposed sample-level evaluation metrics.
arXiv Detail & Related papers (2024-03-19T09:44:41Z) - Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection [59.41026558455904]
We focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets.
We propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.
arXiv Detail & Related papers (2024-01-06T07:30:41Z) - PKU-GoodsAD: A Supermarket Goods Dataset for Unsupervised Anomaly
Detection and Segmentation [6.950686169363205]
This dataset contains 6124 high-resolution images of 484 different appearance goods divided into 6 categories.
It follows the unsupervised setting and only normal (defect-free) images are used for training.
We also conduct a thorough evaluation of current state-of-the-art unsupervised anomaly detection methods.
arXiv Detail & Related papers (2023-07-11T01:17:00Z) - An Outlier Exposure Approach to Improve Visual Anomaly Detection
Performance for Mobile Robots [76.36017224414523]
We consider the problem of building visual anomaly detection systems for mobile robots.
Standard anomaly detection models are trained using large datasets composed only of non-anomalous data.
We tackle the problem of exploiting these data to improve the performance of a Real-NVP anomaly detection model.
arXiv Detail & Related papers (2022-09-20T15:18:13Z) - Empirical Upper Bound, Error Diagnosis and Invariance Analysis of Modern
Object Detectors [47.64219291655723]
We employ 2 state-of-the-art object detection benchmarks, and analyze more than 15 models over 4 large scale datasets.
We find that models generate a lot of boxes on empty regions and that context is more important for detecting small objects than larger ones.
arXiv Detail & Related papers (2020-04-05T06:19:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.