Iterative Sound Source Localization for Unknown Number of Sources
- URL: http://arxiv.org/abs/2206.12273v1
- Date: Fri, 24 Jun 2022 13:19:44 GMT
- Title: Iterative Sound Source Localization for Unknown Number of Sources
- Authors: Yanjie Fu, Meng Ge, Haoran Yin, Xinyuan Qian, Longbiao Wang, Gaoyan
Zhang, Jianwu Dang
- Abstract summary: We propose an iterative sound source localization approach called ISSL, which can iteratively extract each source's DOA without threshold until the termination criterion is met.
Our ISSL achieves significant performance improvements in both DOA estimation and source number detection compared with the existing threshold-based algorithms.
- Score: 57.006589498243336
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sound source localization aims to seek the direction of arrival (DOA) of all
sound sources from the observed multi-channel audio. For the practical problem
of unknown number of sources, existing localization algorithms attempt to
predict a likelihood-based coding (i.e., spatial spectrum) and employ a
pre-determined threshold to detect the source number and corresponding DOA
value. However, these threshold-based algorithms are not stable since they are
limited by the careful choice of threshold. To address this problem, we propose
an iterative sound source localization approach called ISSL, which can
iteratively extract each source's DOA without threshold until the termination
criterion is met. Unlike threshold-based algorithms, ISSL designs an active
source detector network based on binary classifier to accept residual spatial
spectrum and decide whether to stop the iteration. By doing so, our ISSL can
deal with an arbitrary number of sources, even more than the number of sources
seen during the training stage. The experimental results show that our ISSL
achieves significant performance improvements in both DOA estimation and source
number detection compared with the existing threshold-based algorithms.
Related papers
- Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge [14.801564966406486]
The goal of the multi-sound source localization task is to localize sound sources from the mixture individually.
We present a novel multi-sound source localization method that can perform localization without prior knowledge of the number of sound sources.
arXiv Detail & Related papers (2024-03-26T06:27:50Z) - Nonparametric Evaluation of Noisy ICA Solutions [5.749787074942513]
Independent Component Analysis (ICA) was introduced in the 1980's as a model for Blind Source Separation (BSS)
We develop a nonparametric score to adaptively pick the right algorithm for ICA with arbitrary Gaussian noise.
arXiv Detail & Related papers (2024-01-16T16:18:17Z) - Score-based Source Separation with Applications to Digital Communication
Signals [72.6570125649502]
We propose a new method for separating superimposed sources using diffusion-based generative models.
Motivated by applications in radio-frequency (RF) systems, we are interested in sources with underlying discrete nature.
Our method can be viewed as a multi-source extension to the recently proposed score distillation sampling scheme.
arXiv Detail & Related papers (2023-06-26T04:12:40Z) - Position tracking of a varying number of sound sources with sliding
permutation invariant training [19.873949136858354]
We present a new training strategy for deep learning sound source localization models.
It is based on the mean squared error of the optimal association between estimated and reference positions.
It minimizes identity switches without compromising frame-wise localization accuracy.
arXiv Detail & Related papers (2022-10-26T07:54:47Z) - MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with
Unknown Number of Sound Sources [56.41687729076406]
Recent neural network based Direction of Arrival (DoA) estimation algorithms have performed well on unknown number of sound sources scenarios.
These algorithms are usually achieved by mapping the multi-channel audio input to the single output (i.e. overall spatial pseudo-spectrum (SPS) of all sources) that is called MISO.
We propose a novel multi-channel input and multiple outputs DoA network called SPS SPIE-DoAnet to address these limitations.
arXiv Detail & Related papers (2022-07-15T06:18:00Z) - Learning based signal detection for MIMO systems with unknown noise
statistics [84.02122699723536]
This paper aims to devise a generalized maximum likelihood (ML) estimator to robustly detect signals with unknown noise statistics.
In practice, there is little or even no statistical knowledge on the system noise, which in many cases is non-Gaussian, impulsive and not analyzable.
Our framework is driven by an unsupervised learning approach, where only the noise samples are required.
arXiv Detail & Related papers (2021-01-21T04:48:15Z) - A Discriminative Technique for Multiple-Source Adaptation [55.5865665284915]
We present a new discriminative technique for the multiple-source adaptation, MSA, problem.
Our solution only requires conditional probabilities that can easily be accurately estimated from unlabeled data from the source domains.
Our experiments with real-world applications further demonstrate that our new discriminative MSA algorithm outperforms the previous generative solution.
arXiv Detail & Related papers (2020-08-25T14:06:15Z) - Resource Allocation via Model-Free Deep Learning in Free Space Optical
Communications [119.81868223344173]
The paper investigates the general problem of resource allocation for mitigating channel fading effects in Free Space Optical (FSO) communications.
Under this framework, we propose two algorithms that solve FSO resource allocation problems.
arXiv Detail & Related papers (2020-07-27T17:38:51Z) - Unsupervised Audio Source Separation using Generative Priors [43.35195236159189]
We propose a novel approach for audio source separation based on generative priors trained on individual sources.
Our approach simultaneously searches in the source-specific latent spaces to effectively recover the constituent sources.
arXiv Detail & Related papers (2020-05-28T03:57:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.