Acoustic-Net: A Novel Neural Network for Sound Localization and
Quantification
- URL: http://arxiv.org/abs/2203.16988v1
- Date: Thu, 31 Mar 2022 12:20:09 GMT
- Title: Acoustic-Net: A Novel Neural Network for Sound Localization and
Quantification
- Authors: Guanxing Zhou, Hao Liang, Xinghao Ding, Yue Huang, Xiaotong Tu, and
Saqlain Abbas
- Abstract summary: A novel neural network, termed the Acoustic-Net, is proposed to locate and quantify the sound source simply using the original signals.
The experiments demonstrate that the proposed method significantly improves the accuracy of sound source prediction and the computing speed.
- Score: 28.670240455952317
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Acoustic source localization has been applied in different fields, such as
aeronautics and ocean science, generally using multiple microphones array data
to reconstruct the source location. However, the model-based beamforming
methods fail to achieve the high-resolution of conventional beamforming maps.
Deep neural networks are also appropriate to locate the sound source, but in
general, these methods with complex network structures are hard to be
recognized by hardware. In this paper, a novel neural network, termed the
Acoustic-Net, is proposed to locate and quantify the sound source simply using
the original signals. The experiments demonstrate that the proposed method
significantly improves the accuracy of sound source prediction and the
computing speed, which may generalize well to real data. The code and trained
models are available at https://github.com/JoaquinChou/Acoustic-Net.
Related papers
- Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark [65.79402756995084]
Real Acoustic Fields (RAF) is a new dataset that captures real acoustic room data from multiple modalities.
RAF is the first dataset to provide densely captured room acoustic data.
arXiv Detail & Related papers (2024-03-27T17:59:56Z) - Dual input neural networks for positional sound source localization [19.07039703121673]
We introduce Dual Input Neural Networks (DI-NNs) as a simple and effective way to model these two data types in a neural network.
We train and evaluate our proposed DI-NN on scenarios of varying difficulty and realism and compare it against an alternative architecture.
Our results show that the DI-NN significantly outperforms the baselines, achieving a five times lower localization error than the LS method and two times lower than the CRNN in a test dataset of real recordings.
arXiv Detail & Related papers (2023-08-08T09:59:56Z) - Histogram Layer Time Delay Neural Networks for Passive Sonar
Classification [58.720142291102135]
A novel method combines a time delay neural network and histogram layer to incorporate statistical contexts for improved feature learning and underwater acoustic target classification.
The proposed method outperforms the baseline model, demonstrating the utility in incorporating statistical contexts for passive sonar target recognition.
arXiv Detail & Related papers (2023-07-25T19:47:26Z) - Millimeter Wave Localization with Imperfect Training Data using Shallow
Neural Networks [13.454939391912095]
We propose a shallow neural network model to localize mmWave devices indoors.
This model requires significantly fewer weights than those proposed in the literature.
We also propose to relieve training data collection efforts by retrieving location estimates from geometry-based mmWave localization algorithms.
arXiv Detail & Related papers (2021-12-09T16:03:30Z) - Three-Way Deep Neural Network for Radio Frequency Map Generation and
Source Localization [67.93423427193055]
Monitoring wireless spectrum over spatial, temporal, and frequency domains will become a critical feature in beyond-5G and 6G communication technologies.
In this paper, we present a Generative Adversarial Network (GAN) machine learning model to interpolate irregularly distributed measurements across the spatial domain.
arXiv Detail & Related papers (2021-11-23T22:25:10Z) - A Review of Sound Source Localization with Deep Learning Methods [71.18444724397486]
This article is a review on deep learning methods for single and multiple sound source localization.
We provide an exhaustive topography of the neural-based localization literature in this context.
Tables summarizing the literature review are provided at the end of the review for a quick search of methods with a given set of target characteristics.
arXiv Detail & Related papers (2021-09-08T07:25:39Z) - Voice Activity Detection for Transient Noisy Environment Based on
Diffusion Nets [13.558688470594674]
We address voice activity detection in acoustic environments of transients and stationary noises.
We exploit unique spatial patterns of speech and non-speech audio frames by independently learning their underlying geometric structure.
A deep neural network is trained to separate speech from non-speech frames.
arXiv Detail & Related papers (2021-06-25T17:05:26Z) - PILOT: Introducing Transformers for Probabilistic Sound Event
Localization [107.78964411642401]
This paper introduces a novel transformer-based sound event localization framework, where temporal dependencies in the received multi-channel audio signals are captured via self-attention mechanisms.
The framework is evaluated on three publicly available multi-source sound event localization datasets and compared against state-of-the-art methods in terms of localization error and event detection accuracy.
arXiv Detail & Related papers (2021-06-07T18:29:19Z) - AutoSpeech: Neural Architecture Search for Speaker Recognition [108.69505815793028]
We propose the first neural architecture search approach approach for the speaker recognition tasks, named as AutoSpeech.
Our algorithm first identifies the optimal operation combination in a neural cell and then derives a CNN model by stacking the neural cell for multiple times.
Results demonstrate that the derived CNN architectures significantly outperform current speaker recognition systems based on VGG-M, ResNet-18, and ResNet-34 back-bones, while enjoying lower model complexity.
arXiv Detail & Related papers (2020-05-07T02:53:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.