Omni-frequency Channel-selection Representations for Unsupervised
Anomaly Detection
- URL: http://arxiv.org/abs/2203.00259v2
- Date: Mon, 3 Jul 2023 09:54:11 GMT
- Title: Omni-frequency Channel-selection Representations for Unsupervised
Anomaly Detection
- Authors: Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong Liu, and
Shuwen Pan
- Abstract summary: We propose a novel Omni-frequency Channel-selection Reconstruction (OCR-GAN) network to handle anomaly detection task in a perspective of frequency.
We show that our approach markedly surpasses the reconstruction-based baseline by +38.1 and the current SOTA method by +0.3.
- Score: 11.926787216956459
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Density-based and classification-based methods have ruled unsupervised
anomaly detection in recent years, while reconstruction-based methods are
rarely mentioned for the poor reconstruction ability and low performance.
However, the latter requires no costly extra training samples for the
unsupervised training that is more practical, so this paper focuses on
improving this kind of method and proposes a novel Omni-frequency
Channel-selection Reconstruction (OCR-GAN) network to handle anomaly detection
task in a perspective of frequency. Concretely, we propose a Frequency
Decoupling (FD) module to decouple the input image into different frequency
components and model the reconstruction process as a combination of parallel
omni-frequency image restorations, as we observe a significant difference in
the frequency distribution of normal and abnormal images. Given the correlation
among multiple frequencies, we further propose a Channel Selection (CS) module
that performs frequency interaction among different encoders by adaptively
selecting different channels. Abundant experiments demonstrate the
effectiveness and superiority of our approach over different kinds of methods,
e.g., achieving a new state-of-the-art 98.3 detection AUC on the MVTec AD
dataset without extra training data that markedly surpasses the
reconstruction-based baseline by +38.1 and the current SOTA method by +0.3.
Source code is available at https://github.com/zhangzjn/OCR-GAN.
Related papers
- CATCH: Channel-Aware multivariate Time Series Anomaly Detection via Frequency Patching [24.927390742543707]
We introduce CATCH, a framework based on frequency patching.
We propose a Channel Fusion Module (CFM) which features a patch-wise mask generator and a masked-attention mechanism.
Experiments on 9 real-world datasets and 12 synthetic datasets demonstrate that CATCH achieves state-of-the-art performance.
arXiv Detail & Related papers (2024-10-16T05:58:55Z) - M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising [63.39134873744748]
Existing industrial anomaly detection methods primarily concentrate on unsupervised learning with pristine RGB images.
This paper proposes a novel noise-resistant M3DM-NR framework to leverage strong multi-modal discriminative capabilities of CLIP.
Extensive experiments show that M3DM-NR outperforms state-of-the-art methods in 3D-RGB multi-modal noisy anomaly detection.
arXiv Detail & Related papers (2024-06-04T12:33:02Z) - Deep OFDM Channel Estimation: Capturing Frequency Recurrence [10.76835122839777]
We propose a deep-learning-based channel estimation scheme in an OFDM system.
We employ recurrent neural network techniques within a single OFDM slot, thus overcoming the latency and memory constraints.
The proposed SisRafNet delivers superior estimation performance compared to existing deep-learning-based channel estimation techniques.
arXiv Detail & Related papers (2024-01-07T14:13:08Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Implicit neural representation for change detection [15.741202788959075]
Most commonly used approaches to detecting changes in point clouds are based on supervised methods.
We propose an unsupervised approach that comprises two components: Implicit Neural Representation (INR) for continuous shape reconstruction and a Gaussian Mixture Model for categorising changes.
We apply our method to a benchmark dataset comprising simulated LiDAR point clouds for urban sprawling.
arXiv Detail & Related papers (2023-07-28T09:26:00Z) - Score-based Source Separation with Applications to Digital Communication
Signals [72.6570125649502]
We propose a new method for separating superimposed sources using diffusion-based generative models.
Motivated by applications in radio-frequency (RF) systems, we are interested in sources with underlying discrete nature.
Our method can be viewed as a multi-source extension to the recently proposed score distillation sampling scheme.
arXiv Detail & Related papers (2023-06-26T04:12:40Z) - ChannelAugment: Improving generalization of multi-channel ASR by
training with input channel randomization [6.42706307642403]
End-to-end (E2E) multi-channel ASR systems show state-of-the-art performance in far-field ASR tasks.
Main limitation of such systems is that they are usually trained with data from a fixed array geometry.
We present a simple and effective data augmentation technique, which is based on randomly dropping channels in the multi-channel audio input during training.
arXiv Detail & Related papers (2021-09-23T09:13:47Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.