Analysis of Feature Representations for Anomalous Sound Detection
- URL: http://arxiv.org/abs/2012.06282v1
- Date: Fri, 11 Dec 2020 12:31:50 GMT
- Title: Analysis of Feature Representations for Anomalous Sound Detection
- Authors: Robert M\"uller, Steffen Illium, Fabian Ritz, Kyrill Schmid
- Abstract summary: We evaluate the efficacy of pretrained neural networks as feature extractors for anomalous sound detection.
We leverage the knowledge that is contained in these neural networks to extract semantically rich features.
Our approach is evaluated on recordings from factory machinery such as valves, pumps, sliders and fans.
- Score: 3.4782990087904597
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this work, we thoroughly evaluate the efficacy of pretrained neural
networks as feature extractors for anomalous sound detection. In doing so, we
leverage the knowledge that is contained in these neural networks to extract
semantically rich features (representations) that serve as input to a Gaussian
Mixture Model which is used as a density estimator to model normality. We
compare feature extractors that were trained on data from various domains,
namely: images, environmental sounds and music. Our approach is evaluated on
recordings from factory machinery such as valves, pumps, sliders and fans. All
of the evaluated representations outperform the autoencoder baseline with music
based representations yielding the best performance in most cases. These
results challenge the common assumption that closely matching the domain of the
feature extractor and the downstream task results in better downstream task
performance.
Related papers
- PREMAP: A Unifying PREiMage APproximation Framework for Neural Networks [30.701422594374456]
We present a framework for preimage abstraction that produces under- and over-approximations of any polyhedral output set.
We evaluate our method on a range of tasks, demonstrating significant improvement in efficiency and scalability to high-input-dimensional image classification tasks.
arXiv Detail & Related papers (2024-08-17T17:24:47Z) - Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation [53.91958614666386]
Unsupervised graph representation learning (UGRL) based on graph neural networks (GNNs)
We propose a novel UGRL method based on Multi-hop feature Quality Estimation (MQE)
arXiv Detail & Related papers (2024-07-29T12:24:28Z) - Learning with Noisy Foundation Models [95.50968225050012]
This paper is the first work to comprehensively understand and analyze the nature of noise in pre-training datasets.
We propose a tuning method (NMTune) to affine the feature space to mitigate the malignant effect of noise and improve generalization.
arXiv Detail & Related papers (2024-03-11T16:22:41Z) - Frequency and Scale Perspectives of Feature Extraction [5.081561820537235]
We analyze the sensitivity of neural networks to frequencies and scales.
We find that neural networks have low- and medium-frequency biases but also prefer different frequency bands for different classes.
These observations lead to the hypothesis that neural networks must learn the ability to extract features at various scales and frequencies.
arXiv Detail & Related papers (2023-02-24T06:37:36Z) - Improving the Robustness of Summarization Models by Detecting and
Removing Input Noise [50.27105057899601]
We present a large empirical study quantifying the sometimes severe loss in performance from different types of input noise for a range of datasets and model sizes.
We propose a light-weight method for detecting and removing such noise in the input during model inference without requiring any training, auxiliary models, or even prior knowledge of the type of noise.
arXiv Detail & Related papers (2022-12-20T00:33:11Z) - Self-supervised Audiovisual Representation Learning for Remote Sensing Data [96.23611272637943]
We propose a self-supervised approach for pre-training deep neural networks in remote sensing.
By exploiting the correspondence between geo-tagged audio recordings and remote sensing, this is done in a completely label-free manner.
We show that our approach outperforms existing pre-training strategies for remote sensing imagery.
arXiv Detail & Related papers (2021-08-02T07:50:50Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Local Competition and Stochasticity for Adversarial Robustness in Deep
Learning [8.023314613846418]
This work addresses adversarial robustness in deep learning by considering deep networks with local winner-takes-all activations.
This type of network units result in sparse representations from each model layer, as the units are organized in blocks where only one unit generates a non-zero output.
arXiv Detail & Related papers (2021-01-04T17:40:52Z) - Score-informed Networks for Music Performance Assessment [64.12728872707446]
Deep neural network-based methods incorporating score information into MPA models have not yet been investigated.
We introduce three different models capable of score-informed performance assessment.
arXiv Detail & Related papers (2020-08-01T07:46:24Z) - Acoustic Anomaly Detection for Machine Sounds based on Image Transfer
Learning [8.828131257265369]
In this paper, we consider acoustic malfunction detection via transfer learning.
We use neural networks that were pretrained on the task of image classification.
We find that features extracted from ResNet based networks yield better results than those from AlexNet and Squeezenet.
arXiv Detail & Related papers (2020-06-05T13:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.