A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
- URL: http://arxiv.org/abs/2506.01023v1
- Date: Sun, 01 Jun 2025 14:09:27 GMT
- Title: A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
- Authors: Shenghui Lu, Hukai Huang, Jinanglong Yao, Kaidi Wang, Qingyang Hong, Lin Li,
- Abstract summary: This paper proposes a model that integrates sub-band processing and deep filtering to exploit information from the target time-frequency (TF) bin and its surrounding TF bins for single-channel speech enhancement.<n>The sub-band module captures surrounding frequency bin information at the input, while the deep filtering module applies filtering at the output to both the target TF bin and its surrounding TF bins.
- Score: 12.634298353225455
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper proposes a model that integrates sub-band processing and deep filtering to fully exploit information from the target time-frequency (TF) bin and its surrounding TF bins for single-channel speech enhancement. The sub-band module captures surrounding frequency bin information at the input, while the deep filtering module applies filtering at the output to both the target TF bin and its surrounding TF bins. To further improve the model performance, we decouple deep filtering into temporal and frequency components and introduce a two-stage framework, reducing the complexity of filter coefficient prediction at each stage. Additionally, we propose the TAConv module to strengthen convolutional feature extraction. Experimental results demonstrate that the proposed hierarchical deep filtering network (HDF-Net) effectively utilizes surrounding TF bin information and outperforms other advanced systems while using fewer resources.
Related papers
- Understanding Dataset Distillation via Spectral Filtering [69.07076441512612]
We introduce UniDD, a spectral filtering framework that unifies diverse DD objectives.<n>UniDD reveals that the essence of DD fundamentally lies in matching frequency-specific features.<n>To address this limitation, we propose Curriculum Frequency Matching (CFM), which gradually adjusts the filter parameter to cover both low- and high-frequency information.
arXiv Detail & Related papers (2025-03-03T06:22:34Z) - FilterNet: Harnessing Frequency Filters for Time Series Forecasting [34.83702192033196]
FilterNet is built upon our proposed learnable frequency filters to extract key informative temporal patterns by selectively passing or attenuating certain components of time series signals.
equipped with the two filters, FilterNet can approximately surrogate the linear and attention mappings widely adopted in time series literature.
arXiv Detail & Related papers (2024-11-03T16:20:41Z) - Multi-Mode Array Filtering of Resonance Fluorescence [0.0]
We present a novel frequency-filtering method for measuring and calculating frequency-filtered photon-correlations.
By introducing a mode-dependent phase modulation, we produce a near rectangular frequency response.
We present results for the single-mode and multi-mode-array filtered second-order auto- and cross-correlation functions.
arXiv Detail & Related papers (2024-05-06T23:07:19Z) - Decomposed Guided Dynamic Filters for Efficient RGB-Guided Depth
Completion [46.04264366475848]
RGB-guided depth completion aims at predicting dense depth maps from sparse depth measurements and corresponding RGB images.
Guided dynamic filters generate spatially-variant depth-wise separable convolutional filters from RGB features to guide depth features.
We propose to decompose the guided dynamic filters into a spatially-shared component multiplied by content-adaptive adaptors at each spatial location.
arXiv Detail & Related papers (2023-09-05T08:37:58Z) - Filter Pruning for Efficient CNNs via Knowledge-driven Differential
Filter Sampler [103.97487121678276]
Filter pruning simultaneously accelerates the computation and reduces the memory overhead of CNNs.
We propose a novel Knowledge-driven Differential Filter Sampler(KDFS) with Masked Filter Modeling(MFM) framework for filter pruning.
arXiv Detail & Related papers (2023-07-01T02:28:41Z) - Focus Your Attention (with Adaptive IIR Filters) [62.80628327613344]
We present a new layer in which dynamic (i.e.,input-dependent) Infinite Impulse Response (IIR) filters of order two are used to process the input sequence.
Despite their relatively low order, the causal adaptive filters are shown to focus attention on the relevant sequence elements.
arXiv Detail & Related papers (2023-05-24T09:42:30Z) - Adaptive Frequency Learning in Two-branch Face Forgery Detection [66.91715092251258]
We propose Adaptively learn Frequency information in the two-branch Detection framework, dubbed AFD.
We liberate our network from the fixed frequency transforms, and achieve better performance with our data- and task-dependent transform layers.
arXiv Detail & Related papers (2022-03-27T14:25:52Z) - Filter-enhanced MLP is All You Need for Sequential Recommendation [89.0974365344997]
In online platforms, logged user behavior data is inevitable to contain noise.
We borrow the idea of filtering algorithms from signal processing that attenuates the noise in the frequency domain.
We propose textbfFMLP-Rec, an all-MLP model with learnable filters for sequential recommendation task.
arXiv Detail & Related papers (2022-02-28T05:49:35Z) - DeepFilterNet: A Low Complexity Speech Enhancement Framework for
Full-Band Audio based on Deep Filtering [9.200520879361916]
We propose DeepFilterNet, a two stage speech enhancement framework utilizing deep filtering.
First, we enhance the spectral envelope using ERB-scaled gains modeling the human frequency perception.
The second stage employs deep filtering to enhance the periodic components of speech.
arXiv Detail & Related papers (2021-10-11T20:03:52Z) - Unsharp Mask Guided Filtering [53.14430987860308]
The goal of this paper is guided image filtering, which emphasizes the importance of structure transfer during filtering.
We propose a new and simplified formulation of the guided filter inspired by unsharp masking.
Our formulation enjoys a filtering prior to a low-pass filter and enables explicit structure transfer by estimating a single coefficient.
arXiv Detail & Related papers (2021-06-02T19:15:34Z) - On Filter Generalization for Music Bandwidth Extension Using Deep Neural
Networks [0.40611352512781856]
We formulate the bandwidth extension problem using deep neural networks, where a band-limited signal is provided as input to the network.
Our main contribution centers on the impact of the choice of low pass filter when training and subsequently testing the network.
We propose a data augmentation strategy which utilizes multiple low pass filters during training and leads to improved generalization to unseen filtering conditions at test time.
arXiv Detail & Related papers (2020-11-14T11:41:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.