Dementia Detection using Multi-modal Methods on Audio Data
- URL: http://arxiv.org/abs/2501.00465v1
- Date: Tue, 31 Dec 2024 14:26:54 GMT
- Title: Dementia Detection using Multi-modal Methods on Audio Data
- Authors: Saugat Kannojia, Anirudh Praveen, Danish Vasdev, Saket Nandedkar, Divyansh Mittal, Sarthak Kalankar, Shaurya Johari, Vipul Arora,
- Abstract summary: This paper attempts to talk about a model developed to predict the onset of dementia using audio recordings from patients.
An ASR-based model was developed that generates transcripts from the audio files using Whisper model and then applies RoBERTa regression model to generate an MMSE score for the patient.
The model achieved an RMSE score of 2.6911 which is around 10 percent lower than the described baseline.
- Score: 1.5517328698216646
- License:
- Abstract: Dementia is a neurodegenerative disease that causes gradual cognitive impairment, which is very common in the world and undergoes a lot of research every year to prevent and cure it. It severely impacts the patient's ability to remember events and communicate clearly, where most variations of it have no known cure, but early detection can help alleviate symptoms before they become worse. One of the main symptoms of dementia is difficulty in expressing ideas through speech. This paper attempts to talk about a model developed to predict the onset of the disease using audio recordings from patients. An ASR-based model was developed that generates transcripts from the audio files using Whisper model and then applies RoBERTa regression model to generate an MMSE score for the patient. This score can be used to predict the extent to which the cognitive ability of a patient has been affected. We use the PROCESS_V1 dataset for this task, which is introduced through the PROCESS Grand Challenge 2025. The model achieved an RMSE score of 2.6911 which is around 10 percent lower than the described baseline.
Related papers
- Dementia Classification Using Acoustic Speech and Feature Selection [0.3749861135832073]
The study employs Ridge linear regression, Extreme Minimal Learning Machine, and Linear Support Vector Machine learning models to compute feature importance scores based on model outputs.
The study's results rank among the top compared to other studies using the same dataset and acoustic feature extraction for dementia diagnosis.
arXiv Detail & Related papers (2025-02-04T14:50:19Z) - Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models [2.4561590439700076]
This study focuses on classifying Magnetic Resonance Imaging (MRI) data for Alzheimer's disease (AD) by leveraging deep learning techniques characterized by state-of-the-art CNNs.
Alzheimer's disease is the leading cause of dementia in the elderly, and it is an irreversible brain illness that causes gradual cognitive function disorder.
In future, this study can be extended to incorporate other types of medical data, including signals, images, and other data.
arXiv Detail & Related papers (2024-05-20T15:44:07Z) - A Comparative Analysis on Metaheuristic Algorithms Based Vision
Transformer Model for Early Detection of Alzheimer's Disease [0.7673339435080445]
A number of life threatening neuro-degenerative disorders had degraded the quality of life for the older generation in particular.
Dementia is one such symptom which may lead to a severe condition called Alzheimer's disease if not detected at an early stage.
In this paper, an innovative metaheuristic algorithms based ViT model has been proposed for the identification of dementia at different stage.
arXiv Detail & Related papers (2024-01-18T08:31:38Z) - DDxT: Deep Generative Transformer Models for Differential Diagnosis [51.25660111437394]
We show that a generative approach trained with simpler supervised and self-supervised learning signals can achieve superior results on the current benchmark.
The proposed Transformer-based generative network, named DDxT, autoregressively produces a set of possible pathologies, i.e., DDx, and predicts the actual pathology using a neural network.
arXiv Detail & Related papers (2023-12-02T22:57:25Z) - How Does Pruning Impact Long-Tailed Multi-Label Medical Image
Classifiers? [49.35105290167996]
Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance.
This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification.
arXiv Detail & Related papers (2023-08-17T20:40:30Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - Leveraging Pretrained Representations with Task-related Keywords for
Alzheimer's Disease Detection [69.53626024091076]
Alzheimer's disease (AD) is particularly prominent in older adults.
Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations.
This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features.
arXiv Detail & Related papers (2023-03-14T16:03:28Z) - Acoustic-Linguistic Features for Modeling Neurological Task Score in
Alzheimer's [1.290382979353427]
Natural language processing and machine learning provide promising techniques for reliably detecting Alzheimer's disease.
We compare and contrast the performance of ten linear regression models for predicting Mini-Mental Status exam scores.
We find that, for the given task, handcrafted linguistic features are more significant than acoustic and learned features.
arXiv Detail & Related papers (2022-09-13T15:35:31Z) - Remote Medication Status Prediction for Individuals with Parkinson's
Disease using Time-series Data from Smartphones [75.23250968928578]
We present a method for predicting the medication status of Parkinson's disease patients using the public mPower dataset.
The proposed method shows promising results in predicting three medication statuses objectively.
arXiv Detail & Related papers (2022-07-26T02:08:08Z) - Comparing Natural Language Processing Techniques for Alzheimer's
Dementia Prediction in Spontaneous Speech [1.2805268849262246]
Alzheimer's Dementia (AD) is an incurable, debilitating, and progressive neurodegenerative condition that affects cognitive function.
The Alzheimer's Dementia Recognition through Spontaneous Speech task offers acoustically pre-processed and balanced datasets for the classification and prediction of AD.
arXiv Detail & Related papers (2020-06-12T17:51:16Z) - Detecting Parkinsonian Tremor from IMU Data Collected In-The-Wild using
Deep Multiple-Instance Learning [59.74684475991192]
Parkinson's Disease (PD) is a slowly evolving neuro-logical disease that affects about 1% of the population above 60 years old.
PD symptoms include tremor, rigidity and braykinesia.
We present a method for automatically identifying tremorous episodes related to PD, based on IMU signals captured via a smartphone device.
arXiv Detail & Related papers (2020-05-06T09:02:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.