Anomalous Sound Detection using unsupervised and semi-supervised
autoencoders and gammatone audio representation
- URL: http://arxiv.org/abs/2006.15321v1
- Date: Sat, 27 Jun 2020 08:25:47 GMT
- Title: Anomalous Sound Detection using unsupervised and semi-supervised
autoencoders and gammatone audio representation
- Authors: Sergi Perez-Castanos, Javier Naranjo-Alcazar, Pedro Zuccarello and
Maximo Cobos
- Abstract summary: This paper proposes a novel framework based on convolutional autoencoders and a Gammatone-based representation of the audio.
The early detection of malfunctions or damage in machines can mean great savings and an improvement in the efficiency of industrial processes.
- Score: 4.591851728010269
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anomalous sound detection (ASD) is, nowadays, one of the topical subjects in
machine listening discipline. Unsupervised detection is attracting a lot of
interest due to its immediate applicability in many fields. For example,
related to industrial processes, the early detection of malfunctions or damage
in machines can mean great savings and an improvement in the efficiency of
industrial processes. This problem can be solved with an unsupervised ASD
solution since industrial machines will not be damaged simply by having this
audio data in the training stage. This paper proposes a novel framework based
on convolutional autoencoders (both unsupervised and semi-supervised) and a
Gammatone-based representation of the audio. The results obtained by these
architectures substantially exceed the results presented as a baseline.
Related papers
- Activity-Guided Industrial Anomalous Sound Detection against Interferences [8.864726245462908]
We propose SSAD, a framework of source separation (SS) followed by anomaly detection (AD)
SSAD consists of two components: (i) activity-informed SS, enabling effective source separation even given interference with similar timbre, and (ii) two-step masking, robustifying anomaly detection by emphasizing anomalies aligned with the machine activity.
Our experiments demonstrate that SSAD achieves comparable accuracy to a baseline with full access to clean signals, while SSAD is provided only a corrupted signal and activity information.
arXiv Detail & Related papers (2024-09-03T13:26:25Z) - GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features [68.14842693208465]
GeneralAD is an anomaly detection framework designed to operate in semantic, near-distribution, and industrial settings.
We propose a novel self-supervised anomaly generation module that employs straightforward operations like noise addition and shuffling to patch features.
We extensively evaluated our approach on ten datasets, achieving state-of-the-art results in six and on-par performance in the remaining.
arXiv Detail & Related papers (2024-07-17T09:27:41Z) - SoftPatch: Unsupervised Anomaly Detection with Noisy Data [67.38948127630644]
This paper considers label-level noise in image sensory anomaly detection for the first time.
We propose a memory-based unsupervised AD method, SoftPatch, which efficiently denoises the data at the patch level.
Compared with existing methods, SoftPatch maintains a strong modeling ability of normal data and alleviates the overconfidence problem in coreset.
arXiv Detail & Related papers (2024-03-21T08:49:34Z) - Acoustic Signal Analysis with Deep Neural Network for Detecting Fault
Diagnosis in Industrial Machines [0.0]
In this study, a deep learning-based system was designed to analyze the sound signals produced by industrial machines.
The proposed method reached an accuracy rate varying between 97.17% and 99.87% at different Sound Noise Rate levels.
arXiv Detail & Related papers (2023-12-02T08:09:27Z) - Unsupervised Anomaly Detection and Localization of Machine Audio: A
GAN-based Approach [17.85309428707623]
AEGAN-AD is a totally unsupervised approach in which the generator is trained to reconstruct input spectrograms.
The performance of AEGAN-AD on the dataset of DCASE 2022 Challenge TASK 2 demonstrates the state-of-the-art result.
arXiv Detail & Related papers (2023-03-31T10:27:36Z) - Deepfake audio detection by speaker verification [79.99653758293277]
We propose a new detection approach that leverages only the biometric characteristics of the speaker, with no reference to specific manipulations.
The proposed approach can be implemented based on off-the-shelf speaker verification tools.
We test several such solutions on three popular test sets, obtaining good performance, high generalization ability, and high robustness to audio impairment.
arXiv Detail & Related papers (2022-09-28T13:46:29Z) - Should All Proposals be Treated Equally in Object Detection? [110.27485090952385]
The complexity-precision trade-off of an object detector is a critical problem for resource constrained vision tasks.
It is hypothesized that improved detection efficiency requires a paradigm shift, towards the unequal processing of proposals.
This results in better utilization of available computational budget, enabling higher accuracy for the same FLOPS.
arXiv Detail & Related papers (2022-07-07T18:26:32Z) - Canonical Polyadic Decomposition and Deep Learning for Machine Fault
Detection [0.0]
It is impossible to collect enough data to learn all types of faults from a machine.
New algorithms, trained using data from healthy conditions only, were developed to perform unsupervised anomaly detection.
A key issue in the development of these algorithms is the noise in the signals, as it impacts the anomaly detection performance.
arXiv Detail & Related papers (2021-07-20T14:06:50Z) - Spotting adversarial samples for speaker verification by neural vocoders [102.1486475058963]
We adopt neural vocoders to spot adversarial samples for automatic speaker verification (ASV)
We find that the difference between the ASV scores for the original and re-synthesize audio is a good indicator for discrimination between genuine and adversarial samples.
Our codes will be made open-source for future works to do comparison.
arXiv Detail & Related papers (2021-07-01T08:58:16Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z) - Acoustic Anomaly Detection for Machine Sounds based on Image Transfer
Learning [8.828131257265369]
In this paper, we consider acoustic malfunction detection via transfer learning.
We use neural networks that were pretrained on the task of image classification.
We find that features extracted from ResNet based networks yield better results than those from AlexNet and Squeezenet.
arXiv Detail & Related papers (2020-06-05T13:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.