Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear
Prediction Analysis and Deep Neural Networks
- URL: http://arxiv.org/abs/2201.01525v1
- Date: Wed, 5 Jan 2022 10:27:07 GMT
- Title: Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear
Prediction Analysis and Deep Neural Networks
- Authors: Dhananjaya Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo
Alku
- Abstract summary: Formant tracking is investigated by using trackers based on dynamic programming (DP) and deep neural nets (DNNs)
The six methods include linear prediction (LP) algorithms, weighted LP algorithms and the recently developed quasi-closed phase forward-backward (QCP-FB) method.
A novel formant tracking approach, which combines benefits of deep learning and signal processing based on QCP-FB, was proposed.
- Score: 48.98397553726019
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Formant tracking is investigated in this study by using trackers based on
dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach,
six formant estimation methods were first compared. The six methods include
linear prediction (LP) algorithms, weighted LP algorithms and the recently
developed quasi-closed phase forward-backward (QCP-FB) method. QCP-FB gave the
best performance in the comparison. Therefore, a novel formant tracking
approach, which combines benefits of deep learning and signal processing based
on QCP-FB, was proposed. In this approach, the formants predicted by a
DNN-based tracker from a speech frame are refined using the peaks of the
all-pole spectrum computed by QCP-FB from the same frame. Results show that the
proposed DNN-based tracker performed better both in detection rate and
estimation error for the lowest three formants compared to reference formant
trackers. Compared to the popular Wavesurfer, for example, the proposed tracker
gave a reduction of 29%, 48% and 35% in the estimation error for the lowest
three formants, respectively.
Related papers
- Unrolled denoising networks provably learn optimal Bayesian inference [54.79172096306631]
We prove the first rigorous learning guarantees for neural networks based on unrolling approximate message passing (AMP)
For compressed sensing, we prove that when trained on data drawn from a product prior, the layers of the network converge to the same denoisers used in Bayes AMP.
arXiv Detail & Related papers (2024-09-19T17:56:16Z) - Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking
in Speech Signals [17.69029813982043]
We propose a new method for the accurate estimation and tracking of formants in speech signals.
TVQCP analysis combines three approaches to improve formant estimation and tracking.
The proposed TVQCP method performs better than conventional and popular formant tracking tools.
arXiv Detail & Related papers (2023-08-31T08:30:20Z) - Quantitative Susceptibility Mapping through Model-based Deep Image Prior
(MoDIP) [10.230055884828445]
We propose a training-free model-based unsupervised method called MoDIP (Model-based Deep Image Prior)
MoDIP comprises a small, untrained network and a Data Fidelity Optimization (DFO) module.
It is 33% more computationally efficient and runs 4 times faster than conventional DIP-based approaches.
arXiv Detail & Related papers (2023-08-18T11:07:39Z) - Refining a Deep Learning-based Formant Tracker using Linear Prediction
Methods [19.88212227822267]
Two refined DeepFormants trackers were compared with the original DeepFormants and with five known traditional trackers.
The results indicated that the data-driven DeepFormants trackers outperformed the conventional trackers and that the best performance was obtained by refining the formants predicted by DeepFormants using QCP-FB analysis.
arXiv Detail & Related papers (2023-08-17T15:32:32Z) - The Cascaded Forward Algorithm for Neural Network Training [61.06444586991505]
We propose a new learning framework for neural networks, namely Cascaded Forward (CaFo) algorithm, which does not rely on BP optimization as that in FF.
Unlike FF, our framework directly outputs label distributions at each cascaded block, which does not require generation of additional negative samples.
In our framework each block can be trained independently, so it can be easily deployed into parallel acceleration systems.
arXiv Detail & Related papers (2023-03-17T02:01:11Z) - Deep learning via message passing algorithms based on belief propagation [2.931240348160871]
We present a family of BP-based message-passing algorithms with a reinforcement field that biases towards locally entropic distributions.
These algorithms are capable of training multi-layer neural networks with discrete weights and activations with performance comparable to SGD-inspired solutions.
arXiv Detail & Related papers (2021-10-27T16:52:26Z) - Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for
Cellular Offloading [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T20:22:05Z) - NOMA in UAV-aided cellular offloading: A machine learning approach [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T17:38:48Z) - AIN: Fast and Accurate Sequence Labeling with Approximate Inference
Network [75.44925576268052]
The linear-chain Conditional Random Field (CRF) model is one of the most widely-used neural sequence labeling approaches.
Exact probabilistic inference algorithms are typically applied in training and prediction stages of the CRF model.
We propose to employ a parallelizable approximate variational inference algorithm for the CRF model.
arXiv Detail & Related papers (2020-09-17T12:18:43Z) - Using Machine Learning for Particle Track Identification in the CLAS12
Detector [0.0]
This article describes the development of four machine learning (ML) models that assist the tracking by identifying valid track candidates.
An algorithm with recommended track candidates was implemented as part of the CLAS12 reconstruction software.
The resulting software achieved accuracy of greater than 99% and resulted in an end-to-end speedup of 35% compared to existing algorithms.
arXiv Detail & Related papers (2020-08-28T21:57:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.