Cross-lingual Alzheimer's Disease detection based on paralinguistic and
pre-trained features
- URL: http://arxiv.org/abs/2303.07650v1
- Date: Tue, 14 Mar 2023 06:34:18 GMT
- Title: Cross-lingual Alzheimer's Disease detection based on paralinguistic and
pre-trained features
- Authors: Xuchu Chen, Yu Pu, Jinpeng Li, Wei-Qiang Zhang
- Abstract summary: We present our submission to the ICASSP-SPGC-2023 ADReSS-M Challenge Task.
This task aims to investigate which acoustic features can be generalized and transferred across languages for Alzheimer's Disease prediction.
We extract paralinguistic features using openSmile toolkit and acoustic features using XLSR-53.
Our method achieves an accuracy of 69.6% on the classification task and a root mean squared error (RMSE) of 4.788 on the regression task.
- Score: 6.928826160866143
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present our submission to the ICASSP-SPGC-2023 ADReSS-M Challenge Task,
which aims to investigate which acoustic features can be generalized and
transferred across languages for Alzheimer's Disease (AD) prediction. The
challenge consists of two tasks: one is to classify the speech of AD patients
and healthy individuals, and the other is to infer Mini Mental State
Examination (MMSE) score based on speech only. The difficulty is mainly
embodied in the mismatch of the dataset, in which the training set is in
English while the test set is in Greek. We extract paralinguistic features
using openSmile toolkit and acoustic features using XLSR-53. In addition, we
extract linguistic features after transcribing the speech into text. These
features are used as indicators for AD detection in our method. Our method
achieves an accuracy of 69.6% on the classification task and a root mean
squared error (RMSE) of 4.788 on the regression task. The results show that our
proposed method is expected to achieve automatic multilingual Alzheimer's
Disease detection through spontaneous speech.
Related papers
- Leveraging Pretrained Representations with Task-related Keywords for
Alzheimer's Disease Detection [69.53626024091076]
Alzheimer's disease (AD) is particularly prominent in older adults.
Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations.
This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features.
arXiv Detail & Related papers (2023-03-14T16:03:28Z) - Multilingual Alzheimer's Dementia Recognition through Spontaneous
Speech: a Signal Processing Grand Challenge [18.684024762601215]
This Signal Processing Grand Challenge (SPGC) targets a difficult automatic prediction problem of societal and medical relevance.
The Challenge has been designed to assess the extent to which predictive models built based on speech in one language (English) generalise to another language (Greek)
arXiv Detail & Related papers (2023-01-13T14:09:13Z) - Exploiting prompt learning with pre-trained language models for
Alzheimer's Disease detection [70.86672569101536]
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression.
This paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function.
arXiv Detail & Related papers (2022-10-29T09:18:41Z) - Exploring linguistic feature and model combination for speech
recognition based automatic AD detection [61.91708957996086]
Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques.
Scarcity of specialist data leads to uncertainty in both model selection and feature learning when developing such systems.
This paper investigates the use of feature and model combination approaches to improve the robustness of domain fine-tuning of BERT and Roberta pre-trained text encoders.
arXiv Detail & Related papers (2022-06-28T05:09:01Z) - Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging
Features For Elderly And Dysarthric Speech Recognition [55.25565305101314]
Articulatory features are invariant to acoustic signal distortion and have been successfully incorporated into automatic speech recognition systems.
This paper presents a cross-domain and cross-lingual A2A inversion approach that utilizes the parallel audio and ultrasound tongue imaging (UTI) data of the 24-hour TaL corpus in A2A model pre-training.
Experiments conducted on three tasks suggested incorporating the generated articulatory features consistently outperformed the baseline TDNN and Conformer ASR systems.
arXiv Detail & Related papers (2022-06-15T07:20:28Z) - Influence of ASR and Language Model on Alzheimer's Disease Detection [2.4698886064068555]
We analyse the usage of a SotA ASR system to transcribe participant's spoken descriptions from a picture.
We study the influence of a language model -- which tends to correct non-standard sequences of words -- with the lack of language model to decode the hypothesis from the ASR.
The proposed system combines acoustic -- based on prosody and voice quality -- and lexical features based on the first occurrence of the most common words.
arXiv Detail & Related papers (2021-09-20T10:41:39Z) - Explainable Identification of Dementia from Transcripts using
Transformer Networks [0.0]
Alzheimer's disease (AD) is the main cause of dementia which is accompanied by loss of memory and may lead to severe consequences in peoples' everyday life if not diagnosed on time.
We introduce two multi-task learning models, where the main task refers to the identification of dementia (binary classification) and the auxiliary one corresponds to the identification of the severity of dementia (multiclass classification)
Our model obtains accuracy equal to 84.99% on the detection of AD patients in the multi-task learning setting.
arXiv Detail & Related papers (2021-09-14T21:49:05Z) - NUVA: A Naming Utterance Verifier for Aphasia Treatment [49.114436579008476]
Assessment of speech performance using picture naming tasks is a key method for both diagnosis and monitoring of responses to treatment interventions by people with aphasia (PWA)
Here we present NUVA, an utterance verification system incorporating a deep learning element that classifies 'correct' versus'incorrect' naming attempts from aphasic stroke patients.
When tested on eight native British-English speaking PWA the system's performance accuracy ranged between 83.6% to 93.6%, with a 10-fold cross-validation mean of 89.5%.
arXiv Detail & Related papers (2021-02-10T13:00:29Z) - Combining Prosodic, Voice Quality and Lexical Features to Automatically
Detect Alzheimer's Disease [0.0]
This paper is a contribution to the ADReSS Challenge, aiming at improving Alzheimer's automatic detection from spontaneous speech.
Recordings from 108 participants, which are age-, gender-, and AD condition-balanced, have been used as training set.
Both tasks have been performed extracting 28 features from speech based on prosody and voice quality.
arXiv Detail & Related papers (2020-11-18T13:37:27Z) - The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning
with Keywords and Sentence Length Estimation [49.41766997393417]
This report describes the system participating to the Detection and Classification of Acoustic Scenes and Events (DCASE) 2020 Challenge, Task 6.
Our submission focuses on solving two indeterminacy problems in automated audio captioning: word selection indeterminacy and sentence length indeterminacy.
We simultaneously solve the main caption generation and sub indeterminacy problems by estimating keywords and sentence length through multi-task learning.
arXiv Detail & Related papers (2020-07-01T04:26:27Z) - Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS
Challenge [10.497861245133086]
The ADReSS Challenge at INTERSPEECH 2020 defines a shared task through which different approaches to the automated recognition of Alzheimer's dementia can be compared.
ADReSS provides researchers with a benchmark speech dataset which has been acoustically pre-processed and balanced in terms of age and gender.
arXiv Detail & Related papers (2020-04-14T23:25:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.