Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
- URL: http://arxiv.org/abs/2406.18625v1
- Date: Wed, 26 Jun 2024 13:28:24 GMT
- Title: Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
- Authors: Liming Wang, Yuan Gong, Nauman Dawalatabad, Marco Vilela, Katerina Placek, Brian Tracey, Yishu Gong, Alan Premasiri, Fernando Vieira, James Glass,
- Abstract summary: We propose ALS longitudinal speech transformer (ALST), a neural network-based automatic predictor of ALS disease progression.
By taking advantage of high-quality pretrained speech features and longitudinal information in the recordings, our best model achieves 91.0% AUC.
ALST is capable of fine-grained and interpretable predictions of ALS progression, especially for distinguishing between rarer and more severe cases.
- Score: 56.17737749551133
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Automatic prediction of amyotrophic lateral sclerosis (ALS) disease progression provides a more efficient and objective alternative than manual approaches. We propose ALS longitudinal speech transformer (ALST), a neural network-based automatic predictor of ALS disease progression from longitudinal speech recordings of ALS patients. By taking advantage of high-quality pretrained speech features and longitudinal information in the recordings, our best model achieves 91.0\% AUC, improving upon the previous best model by 5.6\% relative on the ALS TDI dataset. Careful analysis reveals that ALST is capable of fine-grained and interpretable predictions of ALS progression, especially for distinguishing between rarer and more severe cases. Code is publicly available.
Related papers
- Towards Interpretable End-Stage Renal Disease (ESRD) Prediction: Utilizing Administrative Claims Data with Explainable AI Techniques [6.417777780911223]
This study explores the potential of utilizing administrative claims data, combined with advanced machine learning and deep learning techniques, to predict the progression of Chronic Kidney Disease (CKD) to End-Stage Renal Disease (ESRD)
We analyze a comprehensive, 10-year dataset provided by a major health insurance organization to develop prediction models for multiple observation windows using traditional machine learning methods such as Random Forest and XGBoost as well as deep learning approaches such as Long Short-Term Memory (LSTM) networks.
Our findings demonstrate that the LSTM model, particularly with a 24-month observation window, exhibits superior performance in predicting ESRD progression,
arXiv Detail & Related papers (2024-09-18T16:03:57Z) - Normative Diffusion Autoencoders: Application to Amyotrophic Lateral Sclerosis [1.7397451877951422]
Normative models present a solution as they increase statistical power by leveraging large healthy cohorts.
We combine the benefits of generative and normative modelling by introducing the normative diffusion autoencoder framework.
Our approach outperforms generative and non-generative normative modelling benchmarks in ALS prognostication.
arXiv Detail & Related papers (2024-07-19T10:39:24Z) - Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition [71.87998918300806]
This paper explores approaches to integrate domain fine-tuned SSL pre-trained models and their features into TDNN and Conformer ASR systems.
TDNN systems constructed by integrating domain-adapted HuBERT, wav2vec2-conformer or multi-lingual XLSR models consistently outperform standalone fine-tuned SSL pre-trained models.
Consistent improvements in Alzheimer's Disease detection accuracy are also obtained using the DementiaBank Pitt elderly speech recognition outputs.
arXiv Detail & Related papers (2024-07-03T08:33:39Z) - It's Never Too Late: Fusing Acoustic Information into Large Language
Models for Automatic Speech Recognition [70.77292069313154]
Large language models (LLMs) can be successfully used for generative error correction (GER) on top of the automatic speech recognition (ASR) output.
In this work, we aim to overcome such a limitation by infusing acoustic information before generating the predicted transcription through a novel late fusion solution termed Uncertainty-Aware Dynamic Fusion (UADF)
arXiv Detail & Related papers (2024-02-08T07:21:45Z) - Sequential Model for Predicting Patient Adherence in Subcutaneous Immunotherapy for Allergic Rhinitis [16.386676205583697]
Subcutaneous Immunotherapy (SCIT) is the long-lasting causal treatment of allergic rhinitis (AR)
This study aims to leverage novel machine learning models to precisely predict the risk of non-adherence of AR patients.
arXiv Detail & Related papers (2024-01-21T09:55:47Z) - Acoustic-to-articulatory inversion for dysarthric speech: Are
pre-trained self-supervised representations favorable? [3.43759997215733]
Acoustic-to-articulatory inversion (AAI) involves mapping from the acoustic to the articulatory space.
In this work, we perform AAI for dysarthric speech using representations from pre-trained self-supervised learning (SSL) models.
arXiv Detail & Related papers (2023-09-03T07:44:38Z) - Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and
Dysarthric Speech Recognition [64.9816313630768]
Fine-tuning is often used to exploit the large quantities of non-aged and healthy speech pre-trained models.
This paper investigates hyper- parameter adaptation for Conformer ASR systems that are pre-trained on the Librispeech corpus.
arXiv Detail & Related papers (2023-06-27T07:49:35Z) - Large Language Models for Healthcare Data Augmentation: An Example on
Patient-Trial Matching [49.78442796596806]
We propose an innovative privacy-aware data augmentation approach for patient-trial matching (LLM-PTM)
Our experiments demonstrate a 7.32% average improvement in performance using the proposed LLM-PTM method, and the generalizability to new data is improved by 12.12%.
arXiv Detail & Related papers (2023-03-24T03:14:00Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - Classification of ALS patients based on acoustic analysis of sustained
vowel phonations [0.7106986689736825]
Amyotrophic lateral sclerosis (ALS) is incurable neurological disorder with rapidly progressive course.
Common early symptoms of ALS are difficulty in swallowing and speech.
This study presents an approach to voice assessment for automatic system that separates healthy people from patients with ALS.
arXiv Detail & Related papers (2020-12-14T08:56:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.