The Phonetic Footprint of Parkinson's Disease
- URL: http://arxiv.org/abs/2112.11514v1
- Date: Tue, 21 Dec 2021 20:44:21 GMT
- Title: The Phonetic Footprint of Parkinson's Disease
- Authors: Philipp Klumpp, Tom\'as Arias-Vergara, Juan Camilo V\'asquez-Correa,
Paula Andrea P\'erez-Toro, Juan Rafael Orozco-Arroyave, Anton Batliner, Elmar
N\"oth
- Abstract summary: Parkinson's disease (PD) has a significant impact on the fine motor skills of patients.
Characteristic patterns such as vowel instability, slurred pronunciation and slow speech can often be observed in the affected individuals.
We used a phonetic recognizer trained exclusively on healthy speech data to investigate how PD affected the phonetic footprint of patients.
- Score: 16.64383793837174
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: As one of the most prevalent neurodegenerative disorders, Parkinson's disease
(PD) has a significant impact on the fine motor skills of patients. The complex
interplay of different articulators during speech production and realization of
required muscle tension become increasingly difficult, thus leading to a
dysarthric speech. Characteristic patterns such as vowel instability, slurred
pronunciation and slow speech can often be observed in the affected individuals
and were analyzed in previous studies to determine the presence and progression
of PD. In this work, we used a phonetic recognizer trained exclusively on
healthy speech data to investigate how PD affected the phonetic footprint of
patients. We rediscovered numerous patterns that had been described in previous
contributions although our system had never seen any pathological speech
previously. Furthermore, we could show that intermediate activations from the
neural network could serve as feature vectors encoding information related to
the disease state of individuals. We were also able to directly correlate the
expert-rated intelligibility of a speaker with the mean confidence of phonetic
predictions. Our results support the assumption that pathological data is not
necessarily required to train systems that are capable of analyzing PD speech.
Related papers
- Self-supervised Speech Models for Word-Level Stuttered Speech Detection [66.46810024006712]
We introduce a word-level stuttering speech detection model leveraging self-supervised speech models.
Our evaluation demonstrates that our model surpasses previous approaches in word-level stuttering speech detection.
arXiv Detail & Related papers (2024-09-16T20:18:20Z) - Selfsupervised learning for pathological speech detection [0.0]
Speech production is susceptible to influence and disruption by various neurodegenerative pathological speech disorders.
These disorders lead to pathological speech characterized by abnormal speech patterns and imprecise articulation.
Unlike neurotypical speakers, patients with speech pathologies or impairments are unable to access various virtual assistants such as Alexa, Siri, etc.
arXiv Detail & Related papers (2024-05-16T07:12:47Z) - Exploring Speech Pattern Disorders in Autism using Machine Learning [12.469348589699766]
This study presents a comprehensive approach to identify distinctive speech patterns through the analysis of examiner-patient dialogues.
We extracted 40 speech-related features, categorized into frequency, zero-crossing rate, energy, spectral characteristics, Mel Frequency Cepstral Coefficients (MFCCs) and balance.
The classification model aimed to differentiate between ASD and non-ASD cases, achieving an accuracy of 87.75%.
arXiv Detail & Related papers (2024-05-03T02:59:15Z) - NeuroVoz: a Castillian Spanish corpus of parkinsonian speech [36.23298373892936]
NeuroVoz is composed by 2,903 audio recordings averaging $26.88 pm 3.35$ recordings per participant.
This dataset has already underpinned several studies, achieving a benchmark accuracy of 89% in PD speech pattern identification.
arXiv Detail & Related papers (2024-03-04T16:17:39Z) - Decoding speech perception from non-invasive brain recordings [48.46819575538446]
We introduce a model trained with contrastive-learning to decode self-supervised representations of perceived speech from non-invasive recordings.
Our model can identify, from 3 seconds of MEG signals, the corresponding speech segment with up to 41% accuracy out of more than 1,000 distinct possibilities.
arXiv Detail & Related papers (2022-08-25T10:01:43Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - Perception Point: Identifying Critical Learning Periods in Speech for
Bilingual Networks [58.24134321728942]
We compare and identify cognitive aspects on deep neural-based visual lip-reading models.
We observe a strong correlation between these theories in cognitive psychology and our unique modeling.
arXiv Detail & Related papers (2021-10-13T05:30:50Z) - A Longitudinal Multi-modal Dataset for Dementia Monitoring and Diagnosis [22.672055089496972]
We introduce a novel fine-grained longitudinal multi-modal corpus collected from healthy controls and people with dementia.
The corpus consists of spoken conversations, a subset of which are transcribed, as well as typed and written thoughts and associated extra-linguistic information.
arXiv Detail & Related papers (2021-09-03T14:02:12Z) - A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion [50.040466658605524]
We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC)
The poor quality of dysarthric speech can be greatly improved by statistical VC.
But as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient.
arXiv Detail & Related papers (2021-06-02T18:41:03Z) - Parkinsonian Chinese Speech Analysis towards Automatic Classification of
Parkinson's Disease [31.431256876809343]
Speech disorders often occur at the early stage of Parkinson's disease (PD)
We constructed a new speech corpus of Mandarin Chinese and addressed classification of patients with PD.
Our classification accuracy significantly surpassed state-of-the-art studies.
arXiv Detail & Related papers (2021-05-31T04:51:44Z) - Silent Speech Interfaces for Speech Restoration: A Review [59.68902463890532]
Silent speech interface (SSI) research aims to provide alternative and augmentative communication methods for persons with severe speech disorders.
SSIs rely on non-acoustic biosignals generated by the human body during speech production to enable communication.
Most present-day SSIs have only been validated in laboratory settings for healthy users.
arXiv Detail & Related papers (2020-09-04T11:05:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.