Deep Residual Echo Suppression with A Tunable Tradeoff Between Signal
Distortion and Echo Suppression
- URL: http://arxiv.org/abs/2106.13531v1
- Date: Fri, 25 Jun 2021 09:49:18 GMT
- Title: Deep Residual Echo Suppression with A Tunable Tradeoff Between Signal
Distortion and Echo Suppression
- Authors: Amir Ivry, Israel Cohen, Baruch Berdugo
- Abstract summary: A UNet neural network maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain.
The system employs 136 thousand parameters, and requires 1.6 Giga floating-point operations per second and 10 Mega-bytes of memory.
- Score: 13.558688470594676
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a residual echo suppression method using a UNet
neural network that directly maps the outputs of a linear acoustic echo
canceler to the desired signal in the spectral domain. This system embeds a
design parameter that allows a tunable tradeoff between the desired-signal
distortion and residual echo suppression in double-talk scenarios. The system
employs 136 thousand parameters, and requires 1.6 Giga floating-point
operations per second and 10 Mega-bytes of memory. The implementation satisfies
both the timing requirements of the AEC challenge and the computational and
memory limitations of on-device applications. Experiments are conducted with
161~h of data from the AEC challenge database and from real independent
recordings. We demonstrate the performance of the proposed system in real-life
conditions and compare it with two competing methods regarding echo suppression
and desired-signal distortion, generalization to various environments, and
robustness to high echo levels.
Related papers
- RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge [66.33067693672696]
This paper addresses the critical problem of interference rejection in radio-frequency (RF) signals using a novel, data-driven approach.
First, we present an insightful signal model that serves as a foundation for developing and analyzing interference rejection algorithms.
Second, we introduce the RF Challenge, a publicly available dataset featuring diverse RF signals along with code templates.
Third, we propose novel AI-based rejection algorithms, specifically architectures like UNet and WaveNet, and evaluate their performance across eight different signal mixture types.
arXiv Detail & Related papers (2024-09-13T13:53:41Z) - DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial
Attention Detection [49.196182908826565]
Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment.
Current approaches primarily rely on traditional convolutional neural network designed for processing Euclidean data like images.
This paper proposes a dynamical graph self-distillation (DGSD) approach for AAD, which does not require speech stimuli as input.
arXiv Detail & Related papers (2023-09-07T13:43:46Z) - Simultaneous source separation of unknown numbers of single-channel underwater acoustic signals based on deep neural networks with separator-decoder structure [0.0]
We propose a deep learning-based simultaneous separating solution with a fixed number of output channels equal to the maximum number of possible targets.
This solution avoids the dimensional disaster caused by the permutation problem induced by the alignment of outputs to targets.
Experiments conducted on simulated mixtures of radiated ship noise show that the proposed solution can achieve similar separation performance to that attained with a known number of signals.
arXiv Detail & Related papers (2022-07-24T14:04:34Z) - End-to-End Complex-Valued Multidilated Convolutional Neural Network for
Joint Acoustic Echo Cancellation and Noise Suppression [25.04740291728234]
In this paper, we exploit the offset-compensating ability of complex time-frequency masks and propose an end-to-end complex neural network architecture.
We also propose a dual-mask technique for joint echo and noise suppression with simultaneous speech enhancement.
arXiv Detail & Related papers (2021-10-02T07:41:41Z) - Energy-Efficient Model Compression and Splitting for Collaborative
Inference Over Time-Varying Channels [52.60092598312894]
We propose a technique to reduce the total energy bill at the edge device by utilizing model compression and time-varying model split between the edge and remote nodes.
Our proposed solution results in minimal energy consumption and $CO$ emission compared to the considered baselines.
arXiv Detail & Related papers (2021-06-02T07:36:27Z) - Discriminative Singular Spectrum Classifier with Applications on
Bioacoustic Signal Recognition [67.4171845020675]
We present a bioacoustic signal classifier equipped with a discriminative mechanism to extract useful features for analysis and classification efficiently.
Unlike current bioacoustic recognition methods, which are task-oriented, the proposed model relies on transforming the input signals into vector subspaces.
The validity of the proposed method is verified using three challenging bioacoustic datasets containing anuran, bee, and mosquito species.
arXiv Detail & Related papers (2021-03-18T11:01:21Z) - Conditioning Trick for Training Stable GANs [70.15099665710336]
We propose a conditioning trick, called difference departure from normality, applied on the generator network in response to instability issues during GAN training.
We force the generator to get closer to the departure from normality function of real samples computed in the spectral domain of Schur decomposition.
arXiv Detail & Related papers (2020-10-12T16:50:22Z) - Residual acoustic echo suppression based on efficient multi-task
convolutional neural network [0.0]
We propose a real-time residual acoustic echo suppression (RAES) method using an efficient convolutional neural network.
The training criterion is based on a novel loss function, which we call as the suppression loss, to balance the suppression of residual echo and the distortion of near-end signals.
arXiv Detail & Related papers (2020-09-29T11:26:25Z) - Nonlinear Residual Echo Suppression Based on Multi-stream Conv-TasNet [22.56178941790508]
We propose a residual echo suppression method based on the modification of fully convolutional time-domain audio separation network (Conv-TasNet)
Both the residual signal of the linear acoustic echo cancellation system, and the output of the adaptive filter are adopted to form multiple streams for the Conv-TasNet.
arXiv Detail & Related papers (2020-05-15T16:41:16Z) - Temporal-Spatial Neural Filter: Direction Informed End-to-End
Multi-channel Target Speech Separation [66.46123655365113]
Target speech separation refers to extracting the target speaker's speech from mixed signals.
Two main challenges are the complex acoustic environment and the real-time processing requirement.
We propose a temporal-spatial neural filter, which directly estimates the target speech waveform from multi-speaker mixture.
arXiv Detail & Related papers (2020-01-02T11:12:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.