Exploring linguistic feature and model combination for speech
recognition based automatic AD detection
- URL: http://arxiv.org/abs/2206.13758v1
- Date: Tue, 28 Jun 2022 05:09:01 GMT
- Title: Exploring linguistic feature and model combination for speech
recognition based automatic AD detection
- Authors: Yi Wang, Tianzi Wang, Zi Ye, Lingwei Meng, Shoukang Hu, Xixin Wu,
Xunying Liu, Helen Meng
- Abstract summary: Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques.
Scarcity of specialist data leads to uncertainty in both model selection and feature learning when developing such systems.
This paper investigates the use of feature and model combination approaches to improve the robustness of domain fine-tuning of BERT and Roberta pre-trained text encoders.
- Score: 61.91708957996086
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating
preventive care and delay progression. Speech based automatic AD screening
systems provide a non-intrusive and more scalable alternative to other clinical
screening techniques. Scarcity of such specialist data leads to uncertainty in
both model selection and feature learning when developing such systems. To this
end, this paper investigates the use of feature and model combination
approaches to improve the robustness of domain fine-tuning of BERT and Roberta
pre-trained text encoders on limited data, before the resulting embedding
features being fed into an ensemble of backend classifiers to produce the final
AD detection decision via majority voting. Experiments conducted on the
ADReSS20 Challenge dataset suggest consistent performance improvements were
obtained using model and feature combination in system development.
State-of-the-art AD detection accuracies of 91.67 percent and 93.75 percent
were obtained using manual and ASR speech transcripts respectively on the
ADReSS20 test set consisting of 48 elderly speakers.
Related papers
- Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and
Dysarthric Speech Recognition [64.9816313630768]
Fine-tuning is often used to exploit the large quantities of non-aged and healthy speech pre-trained models.
This paper investigates hyper- parameter adaptation for Conformer ASR systems that are pre-trained on the Librispeech corpus.
arXiv Detail & Related papers (2023-06-27T07:49:35Z) - Leveraging Pretrained Representations with Task-related Keywords for
Alzheimer's Disease Detection [69.53626024091076]
Alzheimer's disease (AD) is particularly prominent in older adults.
Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations.
This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features.
arXiv Detail & Related papers (2023-03-14T16:03:28Z) - Exploiting prompt learning with pre-trained language models for
Alzheimer's Disease detection [70.86672569101536]
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression.
This paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function.
arXiv Detail & Related papers (2022-10-29T09:18:41Z) - Conformer Based Elderly Speech Recognition System for Alzheimer's
Disease Detection [62.23830810096617]
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care to delay further progression.
This paper presents the development of a state-of-the-art Conformer based speech recognition system built on the DementiaBank Pitt corpus for automatic AD detection.
arXiv Detail & Related papers (2022-06-23T12:50:55Z) - NUVA: A Naming Utterance Verifier for Aphasia Treatment [49.114436579008476]
Assessment of speech performance using picture naming tasks is a key method for both diagnosis and monitoring of responses to treatment interventions by people with aphasia (PWA)
Here we present NUVA, an utterance verification system incorporating a deep learning element that classifies 'correct' versus'incorrect' naming attempts from aphasic stroke patients.
When tested on eight native British-English speaking PWA the system's performance accuracy ranged between 83.6% to 93.6%, with a 10-fold cross-validation mean of 89.5%.
arXiv Detail & Related papers (2021-02-10T13:00:29Z) - Combining Prosodic, Voice Quality and Lexical Features to Automatically
Detect Alzheimer's Disease [0.0]
This paper is a contribution to the ADReSS Challenge, aiming at improving Alzheimer's automatic detection from spontaneous speech.
Recordings from 108 participants, which are age-, gender-, and AD condition-balanced, have been used as training set.
Both tasks have been performed extracting 28 features from speech based on prosody and voice quality.
arXiv Detail & Related papers (2020-11-18T13:37:27Z) - To BERT or Not To BERT: Comparing Speech and Language-based Approaches
for Alzheimer's Disease Detection [17.99855227184379]
Natural language processing and machine learning provide promising techniques for reliably detecting Alzheimer's disease (AD)
We compare and contrast the performance of two such approaches for AD detection on the recent ADReSS challenge dataset.
We observe that fine-tuned BERT models, given the relative importance of linguistics in cognitive impairment detection, outperform feature-based approaches on the AD detection task.
arXiv Detail & Related papers (2020-07-26T04:50:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.