ReContrast: Domain-Specific Anomaly Detection via Contrastive
Reconstruction
- URL: http://arxiv.org/abs/2306.02602v3
- Date: Tue, 24 Oct 2023 05:39:45 GMT
- Title: ReContrast: Domain-Specific Anomaly Detection via Contrastive
Reconstruction
- Authors: Jia Guo, Shuai Lu, Lize Jia, Weihang Zhang, Huiqi Li
- Abstract summary: Most advanced unsupervised anomaly detection (UAD) methods rely on modeling feature representations of frozen encoder networks pre-trained on large-scale datasets.
We propose a novel epistemic UAD method, namely ReContrast, which optimize the entire network to reduce biases towards the pre-trained image domain.
We conduct experiments across two popular industrial defect detection benchmarks and three medical image UAD tasks, which shows our superiority over current state-of-the-art methods.
- Score: 29.370142078092375
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Most advanced unsupervised anomaly detection (UAD) methods rely on modeling
feature representations of frozen encoder networks pre-trained on large-scale
datasets, e.g. ImageNet. However, the features extracted from the encoders that
are borrowed from natural image domains coincide little with the features
required in the target UAD domain, such as industrial inspection and medical
imaging. In this paper, we propose a novel epistemic UAD method, namely
ReContrast, which optimizes the entire network to reduce biases towards the
pre-trained image domain and orients the network in the target domain. We start
with a feature reconstruction approach that detects anomalies from errors.
Essentially, the elements of contrastive learning are elegantly embedded in
feature reconstruction to prevent the network from training instability,
pattern collapse, and identical shortcut, while simultaneously optimizing both
the encoder and decoder on the target domain. To demonstrate our transfer
ability on various image domains, we conduct extensive experiments across two
popular industrial defect detection benchmarks and three medical image UAD
tasks, which shows our superiority over current state-of-the-art methods.
Related papers
- Source-free Domain Adaptive Object Detection in Remote Sensing Images [11.19538606490404]
We propose a source-free object detection (SFOD) setting for RS images.
It aims to perform target domain adaptation using only the source pre-trained model.
Our method does not require access to source domain RS images.
arXiv Detail & Related papers (2024-01-31T15:32:44Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - In-Domain GAN Inversion for Faithful Reconstruction and Editability [132.68255553099834]
We propose in-domain GAN inversion, which consists of a domain-guided domain-regularized and a encoder to regularize the inverted code in the native latent space of the pre-trained GAN model.
We make comprehensive analyses on the effects of the encoder structure, the starting inversion point, as well as the inversion parameter space, and observe the trade-off between the reconstruction quality and the editing property.
arXiv Detail & Related papers (2023-09-25T08:42:06Z) - Unsupervised Industrial Anomaly Detection via Pattern Generative and Contrastive Networks [6.393288885927437]
We propose Vision Transformer based (VIT) unsupervised anomaly detection network.
It utilizes a hierarchical task learning and human experience to enhance its interpretability.
Our method achieves 99.8% AUC, which surpasses previous state-of-the-art methods.
arXiv Detail & Related papers (2022-07-20T10:09:53Z) - InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal
Artifact Reduction in CT Images [53.4351366246531]
We construct a novel interpretable dual domain network, termed InDuDoNet+, into which CT imaging process is finely embedded.
We analyze the CT values among different tissues, and merge the prior observations into a prior network for our InDuDoNet+, which significantly improve its generalization performance.
arXiv Detail & Related papers (2021-12-23T15:52:37Z) - REPLICA: Enhanced Feature Pyramid Network by Local Image Translation and
Conjunct Attention for High-Resolution Breast Tumor Detection [6.112883009328882]
We call our method enhanced featuREsynthesis network by Local Image translation and Conjunct Attention, or REPLICA.
We use a convolutional autoencoder as a generator to create new images by injecting objects into images via local Pyramid and reconstruction of their features extracted in hidden layers.
Then due to the larger number of simulated images, we use a visual transformer to enhance outputs of each ResNet layer that serve as inputs to a feature pyramid network.
arXiv Detail & Related papers (2021-11-22T21:33:02Z) - Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation [111.89519571205778]
In this work, we propose an alternative domain-adaptive approach to depth estimation.
Our novel two-step structure first trains a depth estimation network with labeled synthetic images in a supervised manner.
The results of our experiments show that the proposed method improves the network's performance on real images by a considerable margin.
arXiv Detail & Related papers (2021-09-24T08:11:34Z) - Unsupervised Domain Adaptation from Synthetic to Real Images for
Anchorless Object Detection [0.0]
This paper implements unsupervised domain adaptation methods on an anchorless object detector.
In our work, we use CenterNet, one of the most recent anchorless architectures, for a domain adaptation problem involving synthetic images.
arXiv Detail & Related papers (2020-12-15T10:51:43Z) - D-Unet: A Dual-encoder U-Net for Image Splicing Forgery Detection and
Localization [108.8592577019391]
Image splicing forgery detection is a global binary classification task that distinguishes the tampered and non-tampered regions by image fingerprints.
We propose a novel network called dual-encoder U-Net (D-Unet) for image splicing forgery detection, which employs an unfixed encoder and a fixed encoder.
In an experimental comparison study of D-Unet and state-of-the-art methods, D-Unet outperformed the other methods in image-level and pixel-level detection.
arXiv Detail & Related papers (2020-12-03T10:54:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.