Unsupervised Visual Defect Detection with Score-Based Generative Model
- URL: http://arxiv.org/abs/2211.16092v1
- Date: Tue, 29 Nov 2022 11:06:29 GMT
- Title: Unsupervised Visual Defect Detection with Score-Based Generative Model
- Authors: Yapeng Teng, Haoyang Li, Fuzhen Cai, Ming Shao, Siyu Xia
- Abstract summary: We focus on the unsupervised visual defect detection and localization tasks.
We propose a novel framework based on the recent score-based generative models.
We evaluate our method on several datasets to demonstrate its effectiveness.
- Score: 17.610722842950555
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anomaly Detection (AD), as a critical problem, has been widely discussed. In
this paper, we specialize in one specific problem, Visual Defect Detection
(VDD), in many industrial applications. And in practice, defect image samples
are very rare and difficult to collect. Thus, we focus on the unsupervised
visual defect detection and localization tasks and propose a novel framework
based on the recent score-based generative models, which synthesize the real
image by iterative denoising through stochastic differential equations (SDEs).
Our work is inspired by the fact that with noise injected into the original
image, the defects may be changed into normal cases in the denoising process
(i.e., reconstruction). First, based on the assumption that the anomalous data
lie in the low probability density region of the normal data distribution, we
explain a common phenomenon that occurs when reconstruction-based approaches
are applied to VDD: normal pixels also change during the reconstruction
process. Second, due to the differences in normal pixels between the
reconstructed and original images, a time-dependent gradient value (i.e.,
score) of normal data distribution is utilized as a metric, rather than
reconstruction loss, to gauge the defects. Third, a novel $T$ scales approach
is developed to dramatically reduce the required number of iterations,
accelerating the inference process. These practices allow our model to
generalize VDD in an unsupervised manner while maintaining reasonably good
performance. We evaluate our method on several datasets to demonstrate its
effectiveness.
Related papers
- Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection [9.784793380119806]
We introduce DIAG, a training-free Diffusion-based In-distribution Anomaly Generation pipeline for data augmentation.
Unlike conventional image generation techniques, we implement a human-in-the-loop pipeline, where domain experts provide multimodal guidance to the model.
We demonstrate the efficacy and versatility of DIAG with respect to state-of-the-art data augmentation approaches on the challenging KSDD2 dataset.
arXiv Detail & Related papers (2024-07-04T14:28:52Z) - GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection [60.78684630040313]
Diffusion models tend to reconstruct normal counterparts of test images with certain noises added.
From the global perspective, the difficulty of reconstructing images with different anomalies is uneven.
We propose a global and local adaptive diffusion model (abbreviated to GLAD) for unsupervised anomaly detection.
arXiv Detail & Related papers (2024-06-11T17:27:23Z) - Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach [49.995833831087175]
This work proposes a novel method for generating generic Video-temporal PAs by inpainting a masked out region of an image.
In addition, we present a simple unified framework to detect real-world anomalies under the OCC setting.
Our method performs on par with other existing state-of-the-art PAs generation and reconstruction based methods under the OCC setting.
arXiv Detail & Related papers (2023-11-27T13:14:06Z) - The role of noise in denoising models for anomaly detection in medical
images [62.0532151156057]
Pathological brain lesions exhibit diverse appearance in brain images.
Unsupervised anomaly detection approaches have been proposed using only normal data for training.
We show that optimization of the spatial resolution and magnitude of the noise improves the performance of different model training regimes.
arXiv Detail & Related papers (2023-01-19T21:39:38Z) - CRADL: Contrastive Representations for Unsupervised Anomaly Detection
and Localization [2.8659934481869715]
Unsupervised anomaly detection in medical imaging aims to detect and localize arbitrary anomalies without requiring anomalous data during training.
Most current state-of-the-art methods use latent variable generative models operating directly on the images.
We propose CRADL whose core idea is to model the distribution of normal samples directly in the low-dimensional representation space of an encoder trained with a contrastive pretext-task.
arXiv Detail & Related papers (2023-01-05T16:07:49Z) - Self-Supervised Training with Autoencoders for Visual Anomaly Detection [61.62861063776813]
We focus on a specific use case in anomaly detection where the distribution of normal samples is supported by a lower-dimensional manifold.
We adapt a self-supervised learning regime that exploits discriminative information during training but focuses on the submanifold of normal examples.
We achieve a new state-of-the-art result on the MVTec AD dataset -- a challenging benchmark for visual anomaly detection in the manufacturing domain.
arXiv Detail & Related papers (2022-06-23T14:16:30Z) - Denoising Diffusion Restoration Models [110.1244240726802]
Denoising Diffusion Restoration Models (DDRM) is an efficient, unsupervised posterior sampling method.
We demonstrate DDRM's versatility on several image datasets for super-resolution, deblurring, inpainting, and colorization.
arXiv Detail & Related papers (2022-01-27T20:19:07Z) - DASVDD: Deep Autoencoding Support Vector Data Descriptor for Anomaly
Detection [9.19194451963411]
Semi-supervised anomaly detection aims to detect anomalies from normal samples using a model that is trained on normal data.
We propose a method, DASVDD, that jointly learns the parameters of an autoencoder while minimizing the volume of an enclosing hyper-sphere on its latent representation.
arXiv Detail & Related papers (2021-06-09T21:57:41Z) - Deep Variational Network Toward Blind Image Restoration [60.45350399661175]
Blind image restoration is a common yet challenging problem in computer vision.
We propose a novel blind image restoration method, aiming to integrate both the advantages of them.
Experiments on two typical blind IR tasks, namely image denoising and super-resolution, demonstrate that the proposed method achieves superior performance over current state-of-the-arts.
arXiv Detail & Related papers (2020-08-25T03:30:53Z) - Anomaly localization by modeling perceptual features [3.04585143845864]
Feature-Augmented VAE is trained by reconstructing the input image in pixel space, and also in several different feature spaces.
It achieves clear improvement over state-of-the-art methods on the MVTec anomaly detection and localization datasets.
arXiv Detail & Related papers (2020-08-12T15:09:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.