Learnable Wavelet Packet Transform for Data-Adapted Spectrograms
- URL: http://arxiv.org/abs/2201.11069v1
- Date: Wed, 26 Jan 2022 17:28:17 GMT
- Title: Learnable Wavelet Packet Transform for Data-Adapted Spectrograms
- Authors: Frusque Gaetan and Fink Olga
- Abstract summary: We propose a framework for learnable wavelet packet transforms, enabling to learn features automatically from data.
The learned features can be represented as a spectrogram, containing the important time-frequency information of the dataset.
We evaluate the properties and performance of the proposed approach by evaluating its improved spectral leakage and by applying it to an anomaly detection task for acoustic monitoring.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Capturing high-frequency data concerning the condition of complex systems,
e.g. by acoustic monitoring, has become increasingly prevalent. Such
high-frequency signals typically contain time dependencies ranging over
different time scales and different types of cyclic behaviors. Processing such
signals requires careful feature engineering, particularly the extraction of
meaningful time-frequency features. This can be time-consuming and the
performance is often dependent on the choice of parameters. To address these
limitations, we propose a deep learning framework for learnable wavelet packet
transforms, enabling to learn features automatically from data and optimise
them with respect to the defined objective function. The learned features can
be represented as a spectrogram, containing the important time-frequency
information of the dataset. We evaluate the properties and performance of the
proposed approach by evaluating its improved spectral leakage and by applying
it to an anomaly detection task for acoustic monitoring.
Related papers
- Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks [37.95478822883338]
Methods by which audio signals are converted into time-frequency representations can significantly impact performance.
This work demonstrates the performance impact of using different combinations of time-frequency features in a histogram layer time delay neural network.
arXiv Detail & Related papers (2024-09-20T20:22:24Z) - TSLANet: Rethinking Transformers for Time Series Representation Learning [19.795353886621715]
Time series data is characterized by its intrinsic long and short-range dependencies.
We introduce a novel Time Series Lightweight Network (TSLANet) as a universal convolutional model for diverse time series tasks.
Our experiments demonstrate that TSLANet outperforms state-of-the-art models in various tasks spanning classification, forecasting, and anomaly detection.
arXiv Detail & Related papers (2024-04-12T13:41:29Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Topology combined machine learning for consonant recognition [8.188982461393278]
TopCap is capable of capturing features rarely detected in datasets with low dimensional intrinsicity.
In classifying voiced and voiceless consonants, TopCap achieves an accuracy exceeding 96%.
TopCap is geared towards designing topological convolutional layers for deep learning of speech and audio signals.
arXiv Detail & Related papers (2023-11-26T06:53:56Z) - State Sequences Prediction via Fourier Transform for Representation
Learning [111.82376793413746]
We propose State Sequences Prediction via Fourier Transform (SPF), a novel method for learning expressive representations efficiently.
We theoretically analyze the existence of structural information in state sequences, which is closely related to policy performance and signal regularity.
Experiments demonstrate that the proposed method outperforms several state-of-the-art algorithms in terms of both sample efficiency and performance.
arXiv Detail & Related papers (2023-10-24T14:47:02Z) - Deep Spectro-temporal Artifacts for Detecting Synthesized Speech [57.42110898920759]
This paper provides an overall assessment of track 1 (Low-quality Fake Audio Detection) and track 2 (Partially Fake Audio Detection)
In this paper, spectro-temporal artifacts were detected using raw temporal signals, spectral features, as well as deep embedding features.
We ranked 4th and 5th in track 1 and track 2, respectively.
arXiv Detail & Related papers (2022-10-11T08:31:30Z) - A Robust and Explainable Data-Driven Anomaly Detection Approach For
Power Electronics [56.86150790999639]
We present two anomaly detection and classification approaches, namely the Matrix Profile algorithm and anomaly transformer.
The Matrix Profile algorithm is shown to be well suited as a generalizable approach for detecting real-time anomalies in streaming time-series data.
A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
arXiv Detail & Related papers (2022-09-23T06:09:35Z) - Learning summary features of time series for likelihood free inference [93.08098361687722]
We present a data-driven strategy for automatically learning summary features from time series data.
Our results indicate that learning summary features from data can compete and even outperform LFI methods based on hand-crafted values.
arXiv Detail & Related papers (2020-12-04T19:21:37Z) - Benchmarking Deep Learning Interpretability in Time Series Predictions [41.13847656750174]
Saliency methods are used extensively to highlight the importance of input features in model predictions.
We set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures.
arXiv Detail & Related papers (2020-10-26T22:07:53Z) - Change Point Detection in Time Series Data using Autoencoders with a
Time-Invariant Representation [69.34035527763916]
Change point detection (CPD) aims to locate abrupt property changes in time series data.
Recent CPD methods demonstrated the potential of using deep learning techniques, but often lack the ability to identify more subtle changes in the autocorrelation statistics of the signal.
We employ an autoencoder-based methodology with a novel loss function, through which the used autoencoders learn a partially time-invariant representation that is tailored for CPD.
arXiv Detail & Related papers (2020-08-21T15:03:21Z) - RobustPeriod: Time-Frequency Mining for Robust Multiple Periodicity
Detection [36.254037216142244]
We propose a robust and general framework for multiple periodicity detection.
Our algorithm applies maximal overlap discrete wavelet transform to transform the time series into multiple temporal-frequency scales.
Experiments on synthetic and real-world datasets show that our algorithm outperforms other popular ones for both single and multiple periodicity detection.
arXiv Detail & Related papers (2020-02-21T20:10:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.