AD-BERT: Using Pre-trained contextualized embeddings to Predict the
Progression from Mild Cognitive Impairment to Alzheimer's Disease
- URL: http://arxiv.org/abs/2212.06042v1
- Date: Mon, 7 Nov 2022 04:05:46 GMT
- Title: AD-BERT: Using Pre-trained contextualized embeddings to Predict the
Progression from Mild Cognitive Impairment to Alzheimer's Disease
- Authors: Chengsheng Mao, Jie Xu, Luke Rasmussen, Yikuan Li, Prakash
Adekkanattu, Jennifer Pacheco, Borna Bonakdarpour, Robert Vassar, Guoqian
Jiang, Fei Wang, Jyotishman Pathak, Yuan Luo
- Abstract summary: We develop a deep learning framework based on the pre-trained Bidirectional Representations from Transformers (BERT) model.
We predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD) using unstructured clinical notes.
- Score: 14.59521645987661
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Objective: We develop a deep learning framework based on the pre-trained
Bidirectional Encoder Representations from Transformers (BERT) model using
unstructured clinical notes from electronic health records (EHRs) to predict
the risk of disease progression from Mild Cognitive Impairment (MCI) to
Alzheimer's Disease (AD). Materials and Methods: We identified 3657 patients
diagnosed with MCI together with their progress notes from Northwestern
Medicine Enterprise Data Warehouse (NMEDW) between 2000-2020. The progress
notes no later than the first MCI diagnosis were used for the prediction. We
first preprocessed the notes by deidentification, cleaning and splitting, and
then pretrained a BERT model for AD (AD-BERT) based on the publicly available
Bio+Clinical BERT on the preprocessed notes. The embeddings of all the sections
of a patient's notes processed by AD-BERT were combined by MaxPooling to
compute the probability of MCI-to-AD progression. For replication, we conducted
a similar set of experiments on 2563 MCI patients identified at Weill Cornell
Medicine (WCM) during the same timeframe. Results: Compared with the 7 baseline
models, the AD-BERT model achieved the best performance on both datasets, with
Area Under receiver operating characteristic Curve (AUC) of 0.8170 and F1 score
of 0.4178 on NMEDW dataset and AUC of 0.8830 and F1 score of 0.6836 on WCM
dataset. Conclusion: We developed a deep learning framework using BERT models
which provide an effective solution for prediction of MCI-to-AD progression
using clinical note analysis.
Related papers
- Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach [7.212939068975618]
We utilized data about 10,326 CKD patients, combining their clinical and claims information from 2009 to 2018.
A 24-month observation window was identified as optimal for balancing early detection and prediction accuracy.
The 2021 eGFR equation improved prediction accuracy and reduced racial bias, notably for African American patients.
arXiv Detail & Related papers (2024-10-02T03:21:01Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - HiMAL: A Multimodal Hierarchical Multi-task Auxiliary Learning framework for predicting and explaining Alzheimer disease progression [0.0]
HiMAL (Hierarchical, Multi-task Auxiliary Learning) framework was developed.
It predicts cognitive composite functions as auxiliary tasks that estimate the longitudinal risk of transition from Mild Cognitive Impairment to Alzheimer Disease.
arXiv Detail & Related papers (2024-04-04T05:30:03Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - A multi-stage machine learning model on diagnosis of esophageal
manometry [50.591267188664666]
The framework includes deep-learning models at the swallow-level stage and feature-based machine learning models at the study-level stage.
This is the first artificial-intelligence-style model to automatically predict CC diagnosis of HRM study from raw multi-swallow data.
arXiv Detail & Related papers (2021-06-25T20:09:23Z) - Cross-Cohort Generalizability of Deep and Conventional Machine Learning
for MRI-based Diagnosis and Prediction of Alzheimer's Disease [3.651988141747362]
This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to prediction of conversion to AD in individuals with mild cognitive impairment (MCI)
We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal prevalidation or more extensive prevalidation.
arXiv Detail & Related papers (2020-12-16T07:09:30Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - All Data Inclusive, Deep Learning Models to Predict Critical Events in
the Medical Information Mart for Intensive Care III Database (MIMIC III) [0.0]
This study was performed using 42,818 hospital admissions involving 35,348 patients.
Over 75 million events across multiple data sources were processed, resulting in over 355 million tokens.
It is possible to predict in-hospital mortality with much better confidence and higher reliability from models built using all sources of data.
arXiv Detail & Related papers (2020-09-02T22:12:18Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z) - Interpretable Machine Learning Model for Early Prediction of Mortality
in Elderly Patients with Multiple Organ Dysfunction Syndrome (MODS): a
Multicenter Retrospective Study and Cross Validation [9.808639780672156]
Elderly patients with MODS have high risk of death and poor prognosis.
This study aims to develop an interpretable and generalizable model for early mortality prediction in elderly patients with MODS.
arXiv Detail & Related papers (2020-01-28T17:15:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.