Identification of Hypokinetic Dysarthria Using Acoustic Analysis of Poem
Recitation
- URL: http://arxiv.org/abs/2203.09880v1
- Date: Fri, 18 Mar 2022 11:45:01 GMT
- Title: Identification of Hypokinetic Dysarthria Using Acoustic Analysis of Poem
Recitation
- Authors: Jan Mucha, Zoltan Galaz, Jiri Mekyska, Tomas Kiska, Vojtech Zvoncak,
Zdenek Smekal, Ilona Eliasova, Martina Mrackova, Milena Kostalova, Irena
Rektorova, Marcos Faundez-Zanuy and Jesus B. Alonso-Hernandez
- Abstract summary: Up to 90 % of patients with Parkinson's disease (PD) suffer from hypokinetic dysarthria (HD)
We analysed the power of conventional speech features imprecise articulation, dysprosody, speech dysfluency and speech quality deterioration extracted from a specialized poem recitation task to discriminate HD.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Up to 90 % of patients with Parkinson's disease (PD) suffer from hypokinetic
dysarthria (HD). In this work, we analysed the power of conventional speech
features quantifying imprecise articulation, dysprosody, speech dysfluency and
speech quality deterioration extracted from a specialized poem recitation task
to discriminate dysarthric and healthy speech. For this purpose, 152 speakers
(53 healthy speakers, 99 PD patients) were examined. Only mildly strong
correlation between speech features and clinical status of the speakers was
observed. In the case of univariate classification analysis, sensitivity of
62.63% (imprecise articulation), 61.62% (dysprosody), 71.72% (speech
dysfluency) and 59.60% (speech quality deterioration) was achieved.
Multivariate classification analysis improved the classification performance.
Sensitivity of 83.42% using only two features describing imprecise articulation
and speech quality deterioration in HD was achieved. We showed the promising
potential of the selected speech features and especially the use of poem
recitation task to quantify and identify HD in PD.
Related papers
- Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition [40.44769351506048]
Perceiver-Prompt is a method for speaker adaptation that utilizes P-Tuning on the Whisper large-scale model.
We first fine-tune Whisper using LoRA and then integrate a trainable Perceiver to generate fixed-length speaker prompts from variable-length inputs.
arXiv Detail & Related papers (2024-06-14T09:36:46Z) - Perceptual Features as Markers of Parkinson's Disease: The Issue of
Clinical Interpretability [0.0]
Up to 90% of patients with Parkinson's disease (PD) suffer from hypokinetic dysathria (HD)
This paper provides large and robust insight into analysis of 5 Czech vowels of 84 PD patients.
arXiv Detail & Related papers (2022-03-21T09:46:48Z) - Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric
and Elderly Speech Recognition [48.33873602050463]
Speaker adaptation techniques play a key role in personalization of ASR systems for such users.
Motivated by the spectro-temporal level differences between dysarthric, elderly and normal speech.
Novel spectrotemporal subspace basis deep embedding features derived using SVD speech spectrum.
arXiv Detail & Related papers (2022-02-21T15:11:36Z) - Speaker Identity Preservation in Dysarthric Speech Reconstruction by
Adversarial Speaker Adaptation [59.41186714127256]
Dysarthric speech reconstruction (DSR) aims to improve the quality of dysarthric speech.
Speaker encoder (SE) optimized for speaker verification has been explored to control the speaker identity.
We propose a novel multi-task learning strategy, i.e., adversarial speaker adaptation (ASA)
arXiv Detail & Related papers (2022-02-18T08:59:36Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - Spectro-Temporal Deep Features for Disordered Speech Assessment and
Recognition [65.25325641528701]
Motivated by the spectro-temporal level differences between disordered and normal speech that systematically manifest in articulatory imprecision, decreased volume and clarity, slower speaking rates and increased dysfluencies, novel spectro-temporal subspace basis embedding deep features derived by SVD decomposition of speech spectrum are proposed.
Experiments conducted on the UASpeech corpus suggest the proposed spectro-temporal deep feature adapted systems consistently outperformed baseline i- adaptation by up to 263% absolute (8.6% relative) reduction in word error rate (WER) with or without data augmentation.
arXiv Detail & Related papers (2022-01-14T16:56:43Z) - The Phonetic Footprint of Parkinson's Disease [16.64383793837174]
Parkinson's disease (PD) has a significant impact on the fine motor skills of patients.
Characteristic patterns such as vowel instability, slurred pronunciation and slow speech can often be observed in the affected individuals.
We used a phonetic recognizer trained exclusively on healthy speech data to investigate how PD affected the phonetic footprint of patients.
arXiv Detail & Related papers (2021-12-21T20:44:21Z) - A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion [50.040466658605524]
We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC)
The poor quality of dysarthric speech can be greatly improved by statistical VC.
But as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient.
arXiv Detail & Related papers (2021-06-02T18:41:03Z) - Parkinsonian Chinese Speech Analysis towards Automatic Classification of
Parkinson's Disease [31.431256876809343]
Speech disorders often occur at the early stage of Parkinson's disease (PD)
We constructed a new speech corpus of Mandarin Chinese and addressed classification of patients with PD.
Our classification accuracy significantly surpassed state-of-the-art studies.
arXiv Detail & Related papers (2021-05-31T04:51:44Z) - Bulbar ALS Detection Based on Analysis of Voice Perturbation and Vibrato [68.97335984455059]
The purpose of this work was to verify the sutability of the sustain vowel phonation test for automatic detection of patients with ALS.
We proposed enhanced procedure for separation of voice signal into fundamental periods that requires for calculation of measurements.
arXiv Detail & Related papers (2020-03-24T12:49:25Z) - Identification of Dementia Using Audio Biomarkers [15.740689461116762]
The objective of this work is to use speech processing and machine learning techniques to automatically identify the stage of dementia.
Non-linguistic acoustic parameters are used for this purpose, making this a language independent approach.
We analyze the contribution of various types of acoustic features such as spectral, temporal, cepstral their feature-level fusion and selection towards the identification of dementia stage.
arXiv Detail & Related papers (2020-02-27T13:54:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.