Parkinson's disease diagnostics using AI and natural language knowledge
transfer
- URL: http://arxiv.org/abs/2204.12559v1
- Date: Tue, 26 Apr 2022 19:39:29 GMT
- Title: Parkinson's disease diagnostics using AI and natural language knowledge
transfer
- Authors: Maurycy Chronowski, Maciej Klaczynski, Malgorzata Dec-Cwiek, Karolina
Porebska
- Abstract summary: Deep learning approach for classification of raw speech recordings in patients with diagnosed PD was proposed.
Method was tested on a group of 38 PD patients and 10 healthy persons above the age of 50.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, the issue of Parkinson's disease (PD) diagnostics using
non-invasive antemortem techniques was tackled. A deep learning approach for
classification of raw speech recordings in patients with diagnosed PD was
proposed. The core of proposed method is an audio classifier using knowledge
transfer from a pretrained natural language model, namely \textit{wav2vec 2.0}.
Method was tested on a group of 38 PD patients and 10 healthy persons above the
age of 50. A dataset of speech recordings acquired using a smartphone recorder
was constructed and the recordings were label as PD/non-PD with severity of the
disease additionally rated using Hoehn-Yahr scale. The audio recordings were
cut into 2141 samples that include sentences, syllables, vowels and sustained
phonation. The classifier scores up to 97.92\% of cross-validated accuracy.
Additionally, paper presents results of a human-level performance assessment
questionnaire, which was consulted with the neurology professionals
Related papers
- Self-supervised Speech Models for Word-Level Stuttered Speech Detection [66.46810024006712]
We introduce a word-level stuttering speech detection model leveraging self-supervised speech models.
Our evaluation demonstrates that our model surpasses previous approaches in word-level stuttering speech detection.
arXiv Detail & Related papers (2024-09-16T20:18:20Z) - NeuroVoz: a Castillian Spanish corpus of parkinsonian speech [36.23298373892936]
NeuroVoz is composed by 2,903 audio recordings averaging $26.88 pm 3.35$ recordings per participant.
This dataset has already underpinned several studies, achieving a benchmark accuracy of 89% in PD speech pattern identification.
arXiv Detail & Related papers (2024-03-04T16:17:39Z) - Automatically measuring speech fluency in people with aphasia: first
achievements using read-speech data [55.84746218227712]
This study aims at assessing the relevance of a signalprocessingalgorithm, initially developed in the field of language acquisition, for the automatic measurement of speech fluency.
arXiv Detail & Related papers (2023-08-09T07:51:40Z) - Exploring Multimodal Approaches for Alzheimer's Disease Detection Using
Patient Speech Transcript and Audio Data [10.782153332144533]
Alzheimer's disease (AD) is a common form of dementia that severely impacts patient health.
This study investigates various methods for detecting AD using patients' speech and transcripts data from the DementiaBank Pitt database.
arXiv Detail & Related papers (2023-07-05T12:40:11Z) - Exploiting prompt learning with pre-trained language models for
Alzheimer's Disease detection [70.86672569101536]
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression.
This paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function.
arXiv Detail & Related papers (2022-10-29T09:18:41Z) - Decoding speech perception from non-invasive brain recordings [48.46819575538446]
We introduce a model trained with contrastive-learning to decode self-supervised representations of perceived speech from non-invasive recordings.
Our model can identify, from 3 seconds of MEG signals, the corresponding speech segment with up to 41% accuracy out of more than 1,000 distinct possibilities.
arXiv Detail & Related papers (2022-08-25T10:01:43Z) - Investigation of Data Augmentation Techniques for Disordered Speech
Recognition [69.50670302435174]
This paper investigates a set of data augmentation techniques for disordered speech recognition.
Both normal and disordered speech were exploited in the augmentation process.
The final speaker adapted system constructed using the UASpeech corpus and the best augmentation approach based on speed perturbation produced up to 2.92% absolute word error rate (WER)
arXiv Detail & Related papers (2022-01-14T17:09:22Z) - The Phonetic Footprint of Parkinson's Disease [16.64383793837174]
Parkinson's disease (PD) has a significant impact on the fine motor skills of patients.
Characteristic patterns such as vowel instability, slurred pronunciation and slow speech can often be observed in the affected individuals.
We used a phonetic recognizer trained exclusively on healthy speech data to investigate how PD affected the phonetic footprint of patients.
arXiv Detail & Related papers (2021-12-21T20:44:21Z) - Parkinsonian Chinese Speech Analysis towards Automatic Classification of
Parkinson's Disease [31.431256876809343]
Speech disorders often occur at the early stage of Parkinson's disease (PD)
We constructed a new speech corpus of Mandarin Chinese and addressed classification of patients with PD.
Our classification accuracy significantly surpassed state-of-the-art studies.
arXiv Detail & Related papers (2021-05-31T04:51:44Z) - NUVA: A Naming Utterance Verifier for Aphasia Treatment [49.114436579008476]
Assessment of speech performance using picture naming tasks is a key method for both diagnosis and monitoring of responses to treatment interventions by people with aphasia (PWA)
Here we present NUVA, an utterance verification system incorporating a deep learning element that classifies 'correct' versus'incorrect' naming attempts from aphasic stroke patients.
When tested on eight native British-English speaking PWA the system's performance accuracy ranged between 83.6% to 93.6%, with a 10-fold cross-validation mean of 89.5%.
arXiv Detail & Related papers (2021-02-10T13:00:29Z) - Multi-Modal Detection of Alzheimer's Disease from Speech and Text [3.702631194466718]
We propose a deep learning method that utilizes speech and the corresponding transcript simultaneously to detect Alzheimer's disease (AD)
The proposed method achieves 85.3% 10-fold cross-validation accuracy when trained and evaluated on the Dementiabank Pitt corpus.
arXiv Detail & Related papers (2020-11-30T21:18:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.