Attention Modules Improve Image-Level Anomaly Detection for Industrial
Inspection: A DifferNet Case Study
- URL: http://arxiv.org/abs/2311.02747v2
- Date: Tue, 7 Nov 2023 15:54:41 GMT
- Title: Attention Modules Improve Image-Level Anomaly Detection for Industrial
Inspection: A DifferNet Case Study
- Authors: Andr\'e Luiz Buarque Vieira e Silva, Francisco Sim\~oes, Danny
Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb
- Abstract summary: We propose a DifferNet-based solution enhanced with attention modules: AttentDifferNet.
It improves image-level detection and classification capabilities on three visual anomaly detection datasets for industrial inspection.
Our evaluation shows an average improvement - compared to DifferNet - of 1.77 +/- 0.25 percentage points in overall AUROC.
- Score: 2.2942964892621807
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Within (semi-)automated visual industrial inspection, learning-based
approaches for assessing visual defects, including deep neural networks, enable
the processing of otherwise small defect patterns in pixel size on
high-resolution imagery. The emergence of these often rarely occurring defect
patterns explains the general need for labeled data corpora. To alleviate this
issue and advance the current state of the art in unsupervised visual
inspection, this work proposes a DifferNet-based solution enhanced with
attention modules: AttentDifferNet. It improves image-level detection and
classification capabilities on three visual anomaly detection datasets for
industrial inspection: InsPLAD-fault, MVTec AD, and Semiconductor Wafer. In
comparison to the state of the art, AttentDifferNet achieves improved results,
which are, in turn, highlighted throughout our quali-quantitative study. Our
quantitative evaluation shows an average improvement - compared to DifferNet -
of 1.77 +/- 0.25 percentage points in overall AUROC considering all three
datasets, reaching SOTA results in InsPLAD-fault, an industrial inspection
in-the-wild dataset. As our variants to AttentDifferNet show great prospects in
the context of currently investigated approaches, a baseline is formulated,
emphasizing the importance of attention for industrial anomaly detection both
in the wild and in controlled environments.
Related papers
- Uncertainty Estimation for 3D Object Detection via Evidential Learning [63.61283174146648]
We introduce a framework for quantifying uncertainty in 3D object detection by leveraging an evidential learning loss on Bird's Eye View representations in the 3D detector.
We demonstrate both the efficacy and importance of these uncertainty estimates on identifying out-of-distribution scenes, poorly localized objects, and missing (false negative) detections.
arXiv Detail & Related papers (2024-10-31T13:13:32Z) - CatFree3D: Category-agnostic 3D Object Detection with Diffusion [63.75470913278591]
We introduce a novel pipeline that decouples 3D detection from 2D detection and depth prediction.
We also introduce the Normalised Hungarian Distance (NHD) metric for an accurate evaluation of 3D detection results.
arXiv Detail & Related papers (2024-08-22T22:05:57Z) - Attention Modules Improve Modern Image-Level Anomaly Detection: A
DifferNet Case Study [2.2942964892621807]
This paper proposes a DifferNet-based solution enhanced with attention modules utilizing SENet and CBAM as backbone - AttentDifferNet.
In comparison to the current state of the art, it is shown that AttentDifferNet achieves improved results, which are, in turn, highlighted throughout our quantitative as well as qualitative evaluation.
arXiv Detail & Related papers (2024-01-13T03:09:47Z) - Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection [59.41026558455904]
We focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets.
We propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.
arXiv Detail & Related papers (2024-01-06T07:30:41Z) - Augment and Criticize: Exploring Informative Samples for Semi-Supervised
Monocular 3D Object Detection [64.65563422852568]
We improve the challenging monocular 3D object detection problem with a general semi-supervised framework.
We introduce a novel, simple, yet effective Augment and Criticize' framework that explores abundant informative samples from unlabeled data.
The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI.
arXiv Detail & Related papers (2023-03-20T16:28:15Z) - Deep Industrial Image Anomaly Detection: A Survey [85.44223757234671]
Recent rapid development of deep learning has laid a milestone in industrial Image Anomaly Detection (IAD)
In this paper, we provide a comprehensive review of deep learning-based image anomaly detection techniques.
We highlight several opening challenges for image anomaly detection.
arXiv Detail & Related papers (2023-01-27T03:18:09Z) - ORA3D: Overlap Region Aware Multi-view 3D Object Detection [11.58746596768273]
Current multi-view 3D object detection methods often fail to detect objects in the overlap region properly.
We propose using the following two main modules: (1) Stereo Disparity Estimation for Weak Depth Supervision and (2) Adrial Overlap Region Discriversaminator.
Our proposed method outperforms current state-of-the-art models, i.e., DETR3D and BEVDet.
arXiv Detail & Related papers (2022-07-02T15:28:44Z) - See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised
Multi-Target Domain Adaptation [7.489722641968593]
We propose an unsupervised multi-target domain adaptation framework, SEE, for transferring the performance of state-of-the-art 3D detectors across lidars.
Our approach interpolates the underlying geometry and normalizes the scan pattern of objects from different lidars before passing them to the detection network.
We demonstrate the effectiveness of SEE on public datasets, achieving state-of-the-art results, and additionally provide quantitative results on a novel high-resolution lidar to prove the industry applications of our framework.
arXiv Detail & Related papers (2021-11-17T23:46:47Z) - Zero-sample surface defect detection and classification based on
semantic feedback neural network [13.796631421521765]
We propose an Ensemble Co-training algorithm, which adaptively reduces the prediction error in image tag embedding from multiple angles.
Various experiments conducted on the zero-shot dataset and the cylinder liner dataset in the industrial field provide competitive results.
arXiv Detail & Related papers (2021-06-15T08:26:36Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.