IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing
- URL: http://arxiv.org/abs/2301.13359v5
- Date: Sun, 28 Jan 2024 02:20:41 GMT
- Title: IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing
- Authors: Guoyang Xie, Jinbao Wang, Jiaqi Liu, Jiayi Lyu, Yong Liu, Chengjie
Wang, Feng Zheng, Yaochu Jin
- Abstract summary: Image anomaly detection (IAD) is an emerging and vital computer vision task in industrial manufacturing.
The lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications.
We construct a comprehensive image anomaly detection benchmark (IM-IAD), which includes 19 algorithms on seven major datasets.
- Score: 88.35145788575348
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Image anomaly detection (IAD) is an emerging and vital computer vision task
in industrial manufacturing (IM). Recently, many advanced algorithms have been
reported, but their performance deviates considerably with various IM settings.
We realize that the lack of a uniform IM benchmark is hindering the development
and usage of IAD methods in real-world applications. In addition, it is
difficult for researchers to analyze IAD algorithms without a uniform
benchmark. To solve this problem, we propose a uniform IM benchmark, for the
first time, to assess how well these algorithms perform, which includes various
levels of supervision (unsupervised versus fully supervised), learning
paradigms (few-shot, continual and noisy label), and efficiency (memory usage
and inference speed). Then, we construct a comprehensive image anomaly
detection benchmark (IM-IAD), which includes 19 algorithms on seven major
datasets with a uniform setting. Extensive experiments (17,017 total) on IM-IAD
provide in-depth insights into IAD algorithm redesign or selection. Moreover,
the proposed IM-IAD benchmark challenges existing algorithms and suggests
future research directions. To foster reproducibility and accessibility, the
source code of IM-IAD is uploaded on the website,
https://github.com/M-3LAB/IM-IAD.
Related papers
- Segmenting Medical Images with Limited Data [7.83073107607804]
We present a semi-supervised, consistency-based approach termed the data-efficient medical segmenter (DEMS)
The DEMS features an encoder-decoder architecture and incorporates the developed online automatic augmenter (OAA) and residual robustness enhancement (RRE) blocks.
Under extreme data shortage scenarios, our DEMS achieves 16.85% and 10.37% improvement in dice score compared with the U-Net and top-performed state-of-the-art method, respectively.
arXiv Detail & Related papers (2024-07-12T11:41:18Z) - CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion [8.036012885171166]
We introduce CANDY, a benchmark tailored for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion.
CANDY comprehensively assesses a wide range of AKNN algorithms, integrating advanced optimizations such as machine learning-driven inference.
Our evaluations across diverse datasets demonstrate that simpler AKNN baselines often surpass more complex alternatives in terms of recall and latency.
arXiv Detail & Related papers (2024-06-28T04:46:11Z) - A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection [52.228708947607636]
This paper introduces a comprehensive visual anomaly detection benchmark, ADer, which is a modular framework for new methods.
The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics.
We objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection.
arXiv Detail & Related papers (2024-06-05T13:40:07Z) - A-Bench: Are LMMs Masters at Evaluating AI-generated Images? [78.3699767628502]
A-Bench is a benchmark designed to diagnose whether multi-modal models (LMMs) are masters at evaluating AI-generated images (AIGIs)
Ultimately, 2,864 AIGIs from 16 text-to-image models are sampled, each paired with question-answers annotated by human experts, and tested across 18 leading LMMs.
arXiv Detail & Related papers (2024-06-05T08:55:02Z) - A Weight-aware-based Multi-source Unsupervised Domain Adaptation Method for Human Motion Intention Recognition [11.78805948637625]
unsupervised domain adaptation (UDA) method has become an effective way to this problem.
The labeled data are collected from multiple source subjects that might be different not only from the target subject but also from each other.
This paper develops a novel theory and algorithm for UDA to recognize HMI, where the margin disparity discrepancy (MDD) is extended to multi-source UDA theory.
The developed multi-source UDA theory is theoretical and the generalization error on target subject is guaranteed.
arXiv Detail & Related papers (2024-04-19T03:49:54Z) - Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark [101.23684938489413]
Anomaly detection (AD) is often focused on detecting anomalies for industrial quality inspection and medical lesion examination.
This work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field.
Inspired by the metrics in the segmentation field, we propose several more practical threshold-dependent AD-specific metrics.
arXiv Detail & Related papers (2024-04-16T17:38:26Z) - Assessor360: Multi-sequence Network for Blind Omnidirectional Image
Quality Assessment [50.82681686110528]
Blind Omnidirectional Image Quality Assessment (BOIQA) aims to objectively assess the human perceptual quality of omnidirectional images (ODIs)
The quality assessment of ODIs is severely hampered by the fact that the existing BOIQA pipeline lacks the modeling of the observer's browsing process.
We propose a novel multi-sequence network for BOIQA called Assessor360, which is derived from the realistic multi-assessor ODI quality assessment procedure.
arXiv Detail & Related papers (2023-05-18T13:55:28Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z) - Morphing Attack Detection -- Database, Evaluation Platform and
Benchmarking [16.77282920396874]
Morphing attacks have posed a severe threat to Face Recognition System (FRS)
Despite the number of advancements reported in recent works, we note serious open issues such as independent benchmarking, generalizability challenges and considerations to age, gender, ethnicity that are inadequately addressed.
In this work, we present a new sequestered dataset for facilitating the advancements of MAD.
arXiv Detail & Related papers (2020-06-11T14:11:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.