To BERT or Not To BERT: Comparing Speech and Language-based Approaches
for Alzheimer's Disease Detection
- URL: http://arxiv.org/abs/2008.01551v1
- Date: Sun, 26 Jul 2020 04:50:47 GMT
- Title: To BERT or Not To BERT: Comparing Speech and Language-based Approaches
for Alzheimer's Disease Detection
- Authors: Aparna Balagopalan, Benjamin Eyre, Frank Rudzicz, Jekaterina Novikova
- Abstract summary: Natural language processing and machine learning provide promising techniques for reliably detecting Alzheimer's disease (AD)
We compare and contrast the performance of two such approaches for AD detection on the recent ADReSS challenge dataset.
We observe that fine-tuned BERT models, given the relative importance of linguistics in cognitive impairment detection, outperform feature-based approaches on the AD detection task.
- Score: 17.99855227184379
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Research related to automatically detecting Alzheimer's disease (AD) is
important, given the high prevalence of AD and the high cost of traditional
methods. Since AD significantly affects the content and acoustics of
spontaneous speech, natural language processing and machine learning provide
promising techniques for reliably detecting AD. We compare and contrast the
performance of two such approaches for AD detection on the recent ADReSS
challenge dataset: 1) using domain knowledge-based hand-crafted features that
capture linguistic and acoustic phenomena, and 2) fine-tuning Bidirectional
Encoder Representations from Transformer (BERT)-based sequence classification
models. We also compare multiple feature-based regression models for a
neuropsychological score task in the challenge. We observe that fine-tuned BERT
models, given the relative importance of linguistics in cognitive impairment
detection, outperform feature-based approaches on the AD detection task.
Related papers
- Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech [60.08015780474457]
Alzheimer's Disease (AD) detection has emerged as a promising research area that employs machine learning classification models.
We identify within-class variation as a critical challenge in AD detection: individuals with AD exhibit a spectrum of cognitive impairments.
We propose two novel methods: Soft Target Distillation (SoTD) and Instance-level Re-balancing (InRe), targeting two problems respectively.
arXiv Detail & Related papers (2024-09-22T02:06:05Z) - Leveraging Pretrained Representations with Task-related Keywords for
Alzheimer's Disease Detection [69.53626024091076]
Alzheimer's disease (AD) is particularly prominent in older adults.
Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations.
This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features.
arXiv Detail & Related papers (2023-03-14T16:03:28Z) - Multilingual Alzheimer's Dementia Recognition through Spontaneous
Speech: a Signal Processing Grand Challenge [18.684024762601215]
This Signal Processing Grand Challenge (SPGC) targets a difficult automatic prediction problem of societal and medical relevance.
The Challenge has been designed to assess the extent to which predictive models built based on speech in one language (English) generalise to another language (Greek)
arXiv Detail & Related papers (2023-01-13T14:09:13Z) - Acoustic-Linguistic Features for Modeling Neurological Task Score in
Alzheimer's [1.290382979353427]
Natural language processing and machine learning provide promising techniques for reliably detecting Alzheimer's disease.
We compare and contrast the performance of ten linear regression models for predicting Mini-Mental Status exam scores.
We find that, for the given task, handcrafted linguistic features are more significant than acoustic and learned features.
arXiv Detail & Related papers (2022-09-13T15:35:31Z) - Exploring linguistic feature and model combination for speech
recognition based automatic AD detection [61.91708957996086]
Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques.
Scarcity of specialist data leads to uncertainty in both model selection and feature learning when developing such systems.
This paper investigates the use of feature and model combination approaches to improve the robustness of domain fine-tuning of BERT and Roberta pre-trained text encoders.
arXiv Detail & Related papers (2022-06-28T05:09:01Z) - Data Augmentation for Dementia Detection in Spoken Language [1.7324358447544175]
Recent deep-learning techniques can offer a faster diagnosis and have shown promising results.
They require large amounts of labelled data which is not easily available for the task of dementia detection.
One effective solution to sparse data problems is data augmentation, though the exact methods need to be selected carefully.
arXiv Detail & Related papers (2022-06-26T13:40:25Z) - Conformer Based Elderly Speech Recognition System for Alzheimer's
Disease Detection [62.23830810096617]
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care to delay further progression.
This paper presents the development of a state-of-the-art Conformer based speech recognition system built on the DementiaBank Pitt corpus for automatic AD detection.
arXiv Detail & Related papers (2022-06-23T12:50:55Z) - The Overlooked Classifier in Human-Object Interaction Recognition [82.20671129356037]
We encode the semantic correlation among classes into the classification head by initializing the weights with language embeddings of HOIs.
We propose a new loss named LSE-Sign to enhance multi-label learning on a long-tailed dataset.
Our simple yet effective method enables detection-free HOI classification, outperforming the state-of-the-arts that require object detection and human pose by a clear margin.
arXiv Detail & Related papers (2022-03-10T23:35:00Z) - Unsupervised Domain Adaptation for Dysarthric Speech Detection via
Domain Adversarial Training and Mutual Information Minimization [52.82138296332476]
This paper makes a first attempt to formulate cross-domain Dysarthric speech detection (DSD) as an unsupervised domain adaptation problem.
We propose a multi-task learning strategy, including dysarthria presence classification (DPC), domain adversarial training ( DAT) and mutual information minimization (MIM)
Experiments show that the incorporation of UDA attains absolute increases of 22.2% and 20.0% respectively in utterance-level weighted average recall and speaker-level accuracy.
arXiv Detail & Related papers (2021-06-18T13:34:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.