Online neural fusion of distortionless differential beamformers for robust speech enhancement
- URL: http://arxiv.org/abs/2510.24497v1
- Date: Tue, 28 Oct 2025 15:12:48 GMT
- Title: Online neural fusion of distortionless differential beamformers for robust speech enhancement
- Authors: Yuanhang Qian, Kunlong Zhao, Jilu Jin, Xueqin Luo, Gongping Huang, Jingdong Chen, Jacob Benesty,
- Abstract summary: adaptive convex combination (ACC) algorithms have been introduced, where the outputs of multiple fixed beamformers are linearly combined to improve robustness.<n>ACC often fails in highly non-stationary scenarios, such as rapidly moving interference, since its adaptive updates cannot reliably track rapid changes.<n>We propose a frame-online neural fusion framework for multiple distortionless differential beamformers, which estimates the combination weights through a neural network.
- Score: 50.063071950377086
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Fixed beamforming is widely used in practice since it does not depend on the estimation of noise statistics and provides relatively stable performance. However, a single beamformer cannot adapt to varying acoustic conditions, which limits its interference suppression capability. To address this, adaptive convex combination (ACC) algorithms have been introduced, where the outputs of multiple fixed beamformers are linearly combined to improve robustness. Nevertheless, ACC often fails in highly non-stationary scenarios, such as rapidly moving interference, since its adaptive updates cannot reliably track rapid changes. To overcome this limitation, we propose a frame-online neural fusion framework for multiple distortionless differential beamformers, which estimates the combination weights through a neural network. Compared with conventional ACC, the proposed method adapts more effectively to dynamic acoustic environments, achieving stronger interference suppression while maintaining the distortionless constraint.
Related papers
- Adaptive Benign Overfitting (ABO): Overparameterized RLS for Online Learning in Non-stationary Time-series [0.0]
ABO is highly accurate (comparable to baseline kernel methods) while achieving speed improvements of between 20 and 40 percent.<n>Results provide a unified view linking adaptive filtering, kernel approximation, and benign overfitting within a stable online learning framework.
arXiv Detail & Related papers (2026-01-29T15:58:01Z) - Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification [55.56234913868664]
We propose Test-time Adaptive Hierarchical Co-enhanced Denoising Network (TAHCD) for reliable learning on multimodal data.<n>The proposed method achieves superior classification performance, robustness, and generalization compared with state-of-the-art reliable multimodal learning approaches.
arXiv Detail & Related papers (2026-01-12T03:14:12Z) - Iterative Inference-time Scaling with Adaptive Frequency Steering for Image Super-Resolution [75.3690742776891]
We propose Iterative Diffusion Inference-Time Scaling with Adaptive Frequency Steering (IAFS)<n>IAFS addresses the challenge of balancing perceptual quality and structural fidelity by progressively refining the generated image through iterative correction of structural deviations.<n>Experiments show that IAFS effectively resolves the perception-fidelity conflict, yielding consistently improved perceptual detail and structural accuracy, and outperforming existing inference-time scaling methods.
arXiv Detail & Related papers (2025-12-29T15:09:20Z) - Neural Beamforming with Doppler-Aware Sparse Attention for High Mobility Environments [35.90167049594104]
Deep learning-based beamforming offers an alternative with nonlinear mappings from channel state information to beamforming weights.<n> Transformer-based models are particularly effective due to their ability to model long-range dependencies across time and frequency.<n>Recent studies address this issue through sparse attention mechanisms that reduce complexity while maintaining, yet often employ patterns that disregard channel dynamics.
arXiv Detail & Related papers (2025-11-05T16:51:26Z) - BADiff: Bandwidth Adaptive Diffusion Model [55.10134744772338]
Traditional diffusion models produce high-fidelity images by performing a fixed number of denoising steps, regardless of downstream transmission limitations.<n>In practical cloud-to-device scenarios, limited bandwidth often necessitates heavy compression, leading to loss of fine textures and wasted computation.<n>We introduce a joint end-to-end training strategy where the diffusion model is conditioned on a target quality level derived from the available bandwidth.
arXiv Detail & Related papers (2025-10-24T11:50:03Z) - Adaptive Nonlinear Vector Autoregression: Robust Forecasting for Noisy Chaotic Time Series [0.0]
vector autoregression and reservoir computing have shown promise in forecasting chaotic dynamical systems.<n>We propose an adaptive N model that combines delay-embedded linear inputs with features generated by a shallow, learnable multi-layer perceptron.
arXiv Detail & Related papers (2025-07-11T16:40:10Z) - Diffusion-Based Limited-Angle CT Reconstruction under Noisy Conditions [10.287171164361608]
Missing angular projections lead to incomplete sinograms and artifacts in reconstructed images.<n>We propose a diffusion-based framework that completes missing angular views using a Mean-Reverting Differential Equation (MR-SDE) formulation.<n>To improve robustness under realistic noise, we propose a novel noise-aware mechanism that explicitly models inference-time uncertainty.
arXiv Detail & Related papers (2025-07-08T03:58:52Z) - A TRPCA-Inspired Deep Unfolding Network for Hyperspectral Image Denoising via Thresholded t-SVD and Top-K Sparse Transformer [20.17660504535571]
We propose a novel deep unfolding network (DU-TRPCA) that enforces stage-wise alternation between two tightly integrated modules: low-rank and sparse.<n>Experiments on synthetic and real-world HSIs demonstrate that DU-TRPCA surpasses state-of-the-art methods under severe mixed noise.
arXiv Detail & Related papers (2025-06-03T02:01:39Z) - Layered KIK quantum error mitigation for dynamic circuits and error correction [0.0]
Quantum Error Mitigation is essential for enhancing the reliability of quantum computing experiments.<n>The adaptive KIK error mitigation method has demonstrated significant advantages, including resilience to noise parameter time-drift.<n>However, its reliance on global noise amplification introduces limitations, such as incompatibility with mid-circuit measurements.<n>We propose a layer-based noise amplification approach that overcomes these challenges without incurring additional overhead or experimental complexity.
arXiv Detail & Related papers (2025-04-16T19:47:11Z) - One-Step Diffusion Model for Image Motion-Deblurring [85.76149042561507]
We propose a one-step diffusion model for deblurring (OSDD), a novel framework that reduces the denoising process to a single step.<n>To tackle fidelity loss in diffusion models, we introduce an enhanced variational autoencoder (eVAE), which improves structural restoration.<n>Our method achieves strong performance on both full and no-reference metrics.
arXiv Detail & Related papers (2025-03-09T09:39:57Z) - HarmonICA: Neural non-stationarity correction and source separation for motor neuron interfaces [37.0279893661798]
We propose a potential solution, using an unsupervised learning algorithm to blindly correct for the effects of latent processes which drive the signal non-stationarities.
The proposed design, HarmonICA, sidesteps the identifiability problems of nonlinear ICA.
We test HarmonICA on both invasive and non-invasive recordings both simulated and real.
arXiv Detail & Related papers (2024-06-28T00:08:13Z) - Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation [91.83820250747935]
Pseudo-label noise is mainly contained in unstable samples in which predictions of most pixels undergo significant variations during self-training.
We introduce the Stable Neighbor Denoising (SND) approach, which effectively discovers highly correlated stable and unstable samples.
SND consistently outperforms state-of-the-art methods in various SFUDA semantic segmentation settings.
arXiv Detail & Related papers (2024-06-10T21:44:52Z) - Reliable Beamforming at Terahertz Bands: Are Causal Representations the
Way Forward? [85.06664206117088]
Multi-user wireless systems can meet metaverse requirements by utilizing terahertz bandwidth with massive number of antennas.
Existing solutions lack proper modeling of channel dynamics, resulting in inaccurate beamforming solutions in high-mobility scenarios.
Herein, a dynamic, semantically aware beamforming solution is proposed for the first time, utilizing novel artificial intelligence algorithms in variational causal inference.
arXiv Detail & Related papers (2023-03-14T16:02:46Z) - Improve Noise Tolerance of Robust Loss via Noise-Awareness [60.34670515595074]
We propose a meta-learning method which is capable of adaptively learning a hyper parameter prediction function, called Noise-Aware-Robust-Loss-Adjuster (NARL-Adjuster for brevity)
Four SOTA robust loss functions are attempted to be integrated with our algorithm, and comprehensive experiments substantiate the general availability and effectiveness of the proposed method in both its noise tolerance and performance.
arXiv Detail & Related papers (2023-01-18T04:54:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.