ML-ASPA: A Contemplation of Machine Learning-based Acoustic Signal
Processing Analysis for Sounds, & Strains Emerging Technology
- URL: http://arxiv.org/abs/2402.10005v1
- Date: Mon, 18 Dec 2023 03:04:42 GMT
- Title: ML-ASPA: A Contemplation of Machine Learning-based Acoustic Signal
Processing Analysis for Sounds, & Strains Emerging Technology
- Authors: Ratul Ali, Aktarul Islam, Md. Shohel Rana, Saila Nasrin, Sohel Afzal
Shajol and Professor Dr. A.H.M. Saifullah Sadi
- Abstract summary: This inquiry explores recent advancements and transformative potential within the domain of acoustics, specifically focusing on machine learning (ML) and deep learning.
ML adopts a data-driven approach, unveiling intricate relationships between features and desired labels or actions, as well as among features themselves.
The application of ML to expansive sets of training data facilitates the discovery of models elucidating complex acoustic phenomena such as human speech and reverberation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Acoustic data serves as a fundamental cornerstone in advancing scientific and
engineering understanding across diverse disciplines, spanning biology,
communications, and ocean and Earth science. This inquiry meticulously explores
recent advancements and transformative potential within the domain of
acoustics, specifically focusing on machine learning (ML) and deep learning.
ML, comprising an extensive array of statistical techniques, proves
indispensable for autonomously discerning and leveraging patterns within data.
In contrast to traditional acoustics and signal processing, ML adopts a
data-driven approach, unveiling intricate relationships between features and
desired labels or actions, as well as among features themselves, given ample
training data. The application of ML to expansive sets of training data
facilitates the discovery of models elucidating complex acoustic phenomena such
as human speech and reverberation. The dynamic evolution of ML in acoustics
yields compelling results and holds substantial promise for the future. The
advent of electronic stethoscopes and analogous recording and data logging
devices has expanded the application of acoustic signal processing concepts to
the analysis of bowel sounds. This paper critically reviews existing literature
on acoustic signal processing for bowel sound analysis, outlining fundamental
approaches and applicable machine learning principles. It chronicles historical
progress in signal processing techniques that have facilitated the extraction
of valuable information from bowel sounds, emphasizing advancements in noise
reduction, segmentation, signal enhancement, feature extraction, sound
localization, and machine learning techniques...
Related papers
- Artificial Intelligence for Cochlear Implants: Review of Strategies, Challenges, and Perspectives [2.608119698700597]
This review aims to comprehensively cover advancements in CI-based ASR and speech enhancement, among other related aspects.
The review will delve into potential applications and suggest future directions to bridge existing research gaps in this domain.
arXiv Detail & Related papers (2024-03-17T11:28:23Z) - Probing the Information Encoded in Neural-based Acoustic Models of
Automatic Speech Recognition Systems [7.207019635697126]
This article aims to determine which and where information is located in an automatic speech recognition acoustic model (AM)
Experiments are performed on speaker verification, acoustic environment classification, gender classification, tempo-distortion detection systems and speech sentiment/emotion identification.
Analysis showed that neural-based AMs hold heterogeneous information that seems surprisingly uncorrelated with phoneme recognition.
arXiv Detail & Related papers (2024-02-29T18:43:53Z) - Informed Meta-Learning [55.2480439325792]
Meta-learning and informed ML stand out as two approaches for incorporating prior knowledge into ML pipelines.
We formalise a hybrid paradigm, informed meta-learning, facilitating the incorporation of priors from unstructured knowledge representations.
We demonstrate the potential benefits of informed meta-learning in improving data efficiency, robustness to observational noise and task distribution shifts.
arXiv Detail & Related papers (2024-02-25T15:08:37Z) - A Review of Machine Learning Methods Applied to Structural Dynamics and
Vibroacoustic [0.0]
Three main applications in Vibroacoustic (SD&V) have taken advantage of Machine Learning (ML)
In Structural Health Monitoring, ML detection and prognosis lead to safe operation and optimized maintenance schedules.
System identification and control design are leveraged by ML techniques in Active Noise Control and Active Vibration Control.
The so-called ML-based surrogate models provide fast alternatives to costly simulations, enabling robust and optimized product design.
arXiv Detail & Related papers (2022-04-13T13:16:21Z) - Physics-driven Synthetic Data Learning for Biomedical Magnetic Resonance [29.338413545265364]
Imaging physics-based data synthesis (IPADS) can provide huge training data in biomedical magnetic resonance without or with few real data.
IPADS generates signals from differential equations or analytical solution models, making the learning more scalable, explainable, and better protecting privacy.
arXiv Detail & Related papers (2022-03-21T17:56:12Z) - Using growth transform dynamical systems for spatio-temporal data
sonification [9.721342507747158]
Sonification, or encoding information in meaningful audio signatures, has several advantages in augmenting or replacing traditional visualization methods for human-in-the-loop decision-making.
This paper presents a novel framework for sonifying high-dimensional data using a complex growth transform dynamical system model.
Our algorithm takes as input the data and optimization parameters underlying the learning or prediction task and combines it with the psycho parameters defined by the user.
arXiv Detail & Related papers (2021-08-21T16:25:59Z) - Improved Speech Emotion Recognition using Transfer Learning and
Spectrogram Augmentation [56.264157127549446]
Speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction.
One of the main challenges in SER is data scarcity.
We propose a transfer learning strategy combined with spectrogram augmentation.
arXiv Detail & Related papers (2021-08-05T10:39:39Z) - Signal Processing and Machine Learning Techniques for Terahertz Sensing:
An Overview [89.09270073549182]
Terahertz (THz) signal generation and radiation methods are shaping the future of wireless systems.
THz-specific signal processing techniques should complement this re-surged interest in THz sensing for efficient utilization of the THz band.
We present an overview of these techniques, with an emphasis on signal pre-processing.
We also address the effectiveness of deep learning techniques by exploring their promising sensing capabilities at the THz band.
arXiv Detail & Related papers (2021-04-09T01:38:34Z) - Federated Learning: A Signal Processing Perspective [144.63726413692876]
Federated learning is an emerging machine learning paradigm for training models across multiple edge devices holding local datasets, without explicitly exchanging the data.
This article provides a unified systematic framework for federated learning in a manner that encapsulates and highlights the main challenges that are natural to treat using signal processing tools.
arXiv Detail & Related papers (2021-03-31T15:14:39Z) - An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and
Separation [57.68765353264689]
Speech enhancement and speech separation are two related tasks.
Traditionally, these tasks have been tackled using signal processing and machine learning techniques.
Deep learning has been exploited to achieve strong performance.
arXiv Detail & Related papers (2020-08-21T17:24:09Z) - Multilinear Compressive Learning with Prior Knowledge [106.12874293597754]
Multilinear Compressive Learning (MCL) framework combines Multilinear Compressive Sensing and Machine Learning into an end-to-end system.
Key idea behind MCL is the assumption of the existence of a tensor subspace which can capture the essential features from the signal for the downstream learning task.
In this paper, we propose a novel solution to address both of the aforementioned requirements, i.e., How to find those tensor subspaces in which the signals of interest are highly separable?
arXiv Detail & Related papers (2020-02-17T19:06:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.