Unsupervised Anomaly Detection in Medical Images with a Memory-augmented
Multi-level Cross-attentional Masked Autoencoder
- URL: http://arxiv.org/abs/2203.11725v2
- Date: Tue, 22 Aug 2023 02:16:37 GMT
- Title: Unsupervised Anomaly Detection in Medical Images with a Memory-augmented
Multi-level Cross-attentional Masked Autoencoder
- Authors: Yu Tian and Guansong Pang and Yuyuan Liu and Chong Wang and Yuanhong
Chen and Fengbei Liu and Rajvinder Singh and Johan W Verjans and Mengyu Wang
and Gustavo Carneiro
- Abstract summary: Unsupervised anomaly detection (UAD) aims to find anomalous images by optimising a detector using a training set that contains only normal images.
UAD approaches can be based on reconstruction methods, self-supervised approaches, and Imagenet pre-trained models.
We introduce a new reconstruction-based UAD approach that addresses the low reconstruction error issue for anomalous images.
- Score: 33.5760501931736
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unsupervised anomaly detection (UAD) aims to find anomalous images by
optimising a detector using a training set that contains only normal images.
UAD approaches can be based on reconstruction methods, self-supervised
approaches, and Imagenet pre-trained models. Reconstruction methods, which
detect anomalies from image reconstruction errors, are advantageous because
they do not rely on the design of problem-specific pretext tasks needed by
self-supervised approaches, and on the unreliable translation of models
pre-trained from non-medical datasets. However, reconstruction methods may fail
because they can have low reconstruction errors even for anomalous images. In
this paper, we introduce a new reconstruction-based UAD approach that addresses
this low-reconstruction error issue for anomalous images. Our UAD approach, the
memory-augmented multi-level cross-attentional masked autoencoder (MemMC-MAE),
is a transformer-based approach, consisting of a novel memory-augmented
self-attention operator for the encoder and a new multi-level cross-attention
operator for the decoder. MemMCMAE masks large parts of the input image during
its reconstruction, reducing the risk that it will produce low reconstruction
errors because anomalies are likely to be masked and cannot be reconstructed.
However, when the anomaly is not masked, then the normal patterns stored in the
encoder's memory combined with the decoder's multi-level cross attention will
constrain the accurate reconstruction of the anomaly. We show that our method
achieves SOTA anomaly detection and localisation on colonoscopy, pneumonia, and
covid-19 chest x-ray datasets.
Related papers
- DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Patch-wise Auto-Encoder for Visual Anomaly Detection [1.7546477549938133]
We propose a novel patch-wise auto-encoder framework, which aims at enhancing the reconstruction ability of AE to anomalies instead of weakening it.
Our method is simple and efficient. It advances the state-of-the-art performances on Mvtec AD benchmark, which proves the effectiveness of our model.
arXiv Detail & Related papers (2023-08-01T10:15:15Z) - ReContrast: Domain-Specific Anomaly Detection via Contrastive
Reconstruction [29.370142078092375]
Most advanced unsupervised anomaly detection (UAD) methods rely on modeling feature representations of frozen encoder networks pre-trained on large-scale datasets.
We propose a novel epistemic UAD method, namely ReContrast, which optimize the entire network to reduce biases towards the pre-trained image domain.
We conduct experiments across two popular industrial defect detection benchmarks and three medical image UAD tasks, which shows our superiority over current state-of-the-art methods.
arXiv Detail & Related papers (2023-06-05T05:21:15Z) - Making Reconstruction-based Method Great Again for Video Anomaly
Detection [64.19326819088563]
Anomaly detection in videos is a significant yet challenging problem.
Existing reconstruction-based methods rely on old-fashioned convolutional autoencoders.
We propose a new autoencoder model for enhanced consecutive frame reconstruction.
arXiv Detail & Related papers (2023-01-28T01:57:57Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - Self-Supervised Predictive Convolutional Attentive Block for Anomaly
Detection [97.93062818228015]
We propose to integrate the reconstruction-based functionality into a novel self-supervised predictive architectural building block.
Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field.
We demonstrate the generality of our block by integrating it into several state-of-the-art frameworks for anomaly detection on image and video.
arXiv Detail & Related papers (2021-11-17T13:30:31Z) - Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly
Detection [40.778313918994996]
Reconstruction-based methods play an important role in unsupervised anomaly detection in images.
In this work, we interpret the reconstruction of an image as a divide-and-assemble procedure.
We achieve state-of-the-art performance on the challenging MVTec AD dataset.
arXiv Detail & Related papers (2021-07-28T01:14:32Z) - Anomaly Detection By Autoencoder Based On Weighted Frequency Domain Loss [1.0312968200748116]
In image anomaly detection, Autoencoders are the popular methods that reconstruct the input image that might contain anomalies and output a clean image with no abnormalities.
In this paper, we show our method's superiority over the conventional Autoencoder methods by comparing it with AUROC on the MVTec AD dataset.
arXiv Detail & Related papers (2021-05-21T09:10:36Z) - CutPaste: Self-Supervised Learning for Anomaly Detection and
Localization [59.719925639875036]
We propose a framework for building anomaly detectors using normal training data only.
We first learn self-supervised deep representations and then build a generative one-class classifier on learned representations.
Our empirical study on MVTec anomaly detection dataset demonstrates the proposed algorithm is general to be able to detect various types of real-world defects.
arXiv Detail & Related papers (2021-04-08T19:04:55Z) - Improved anomaly detection by training an autoencoder with skip
connections on images corrupted with Stain-shaped noise [25.85927871251385]
anomaly detection relies on the reconstruction residual or, alternatively, on the reconstruction uncertainty.
We consider an autoencoder architecture with skip connections to improve the sharpness of the reconstruction.
We show that this model favors the reconstruction of clean images from arbitrary real-world images, regardless of the actual defects appearance.
arXiv Detail & Related papers (2020-08-29T13:50:49Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.