Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction
of clinical events using multimodal longitudinal electronic health records
- URL: http://arxiv.org/abs/2106.11360v1
- Date: Mon, 21 Jun 2021 18:47:08 GMT
- Title: Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction
of clinical events using multimodal longitudinal electronic health records
- Authors: Yikuan Li, Mohammad Mamouei, Gholamreza Salimi-Khorshidi, Shishir Rao,
Abdelaali Hassaine, Dexter Canoy, Thomas Lukasiewicz, and Kazem Rahimi
- Abstract summary: We present Hi-BEHRT, a hierarchical Transformer-based model that can extract associations from much longer sequences.
Using a multimodal large-scale linked longitudinal electronic health records, the Hi-BEHRT exceeds the state-of-the-art BROCT 1% to 5% area.
We also provide an effective end-to-end pre-training strategy for Hi-BEHRT using EHR, improving its transferability on predicting clinical events.
- Score: 23.36011610637235
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Electronic health records represent a holistic overview of patients'
trajectories. Their increasing availability has fueled new hopes to leverage
them and develop accurate risk prediction models for a wide range of diseases.
Given the complex interrelationships of medical records and patient outcomes,
deep learning models have shown clear merits in achieving this goal. However, a
key limitation of these models remains their capacity in processing long
sequences. Capturing the whole history of medical encounters is expected to
lead to more accurate predictions, but the inclusion of records collected for
decades and from multiple resources can inevitably exceed the receptive field
of the existing deep learning architectures. This can result in missing
crucial, long-term dependencies. To address this gap, we present Hi-BEHRT, a
hierarchical Transformer-based model that can significantly expand the
receptive field of Transformers and extract associations from much longer
sequences. Using a multimodal large-scale linked longitudinal electronic health
records, the Hi-BEHRT exceeds the state-of-the-art BEHRT 1% to 5% for area
under the receiver operating characteristic (AUROC) curve and 3% to 6% for area
under the precision recall (AUPRC) curve on average, and 3% to 6% (AUROC) and
3% to 11% (AUPRC) for patients with long medical history for 5-year heart
failure, diabetes, chronic kidney disease, and stroke risk prediction.
Additionally, because pretraining for hierarchical Transformer is not
well-established, we provide an effective end-to-end contrastive pre-training
strategy for Hi-BEHRT using EHR, improving its transferability on predicting
clinical events with relatively small training dataset.
Related papers
- Prediction of Lung Metastasis from Hepatocellular Carcinoma using the SEER Database [0.9055332067000195]
Hepatocellular carcinoma (HCC) is a leading cause of cancer-related mortality.
predictive models for lung metastasis inHCC remain limited in scope and clinical applicability.
We develop and validate an end-to-end machine learning pipeline using data from the Surveillance, Epidemiology, and End Results (SEER) database.
arXiv Detail & Related papers (2025-01-20T20:06:31Z) - PT: A Plain Transformer is Good Hospital Readmission Predictor [1.2695170900407071]
High readmission rates often indicate inadequate treatment or post-discharge care.
We propose PT, a Transformer-based model that integrates Electronic Health Records (EHR), medical images, and clinical notes.
PT extracts features from raw data and uses specialized Transformer blocks tailored to the data's complexity.
arXiv Detail & Related papers (2024-12-17T13:37:11Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - Integrated Convolutional and Recurrent Neural Networks for Health Risk
Prediction using Patient Journey Data with Many Missing Values [9.418011774179794]
This paper proposes a novel end-to-end approach to modeling EHR patient journey data with Integrated Convolutional and Recurrent Neural Networks.
Our model can capture both long- and short-term temporal patterns within each patient journey and effectively handle the high degree of missingness in EHR data without any imputation data generation.
arXiv Detail & Related papers (2022-11-11T07:36:18Z) - Robust and Efficient Medical Imaging with Self-Supervision [80.62711706785834]
We present REMEDIS, a unified representation learning strategy to improve robustness and data-efficiency of medical imaging AI.
We study a diverse range of medical imaging tasks and simulate three realistic application scenarios using retrospective data.
arXiv Detail & Related papers (2022-05-19T17:34:18Z) - Multimodal spatiotemporal graph neural networks for improved prediction
of 30-day all-cause hospital readmission [4.609543591101764]
We propose a multimodal, modality-agnostic graph neural network (MM-STGNN) for prediction of 30-day all-cause hospital readmission.
MM-STGNN achieves AU of 0.79 on both primary and external datasets.
For subset populations of patients with heart and vascular disease, our model also outperforms baselines on predicting 30-day readmission.
arXiv Detail & Related papers (2022-04-14T05:50:07Z) - SANSformers: Self-Supervised Forecasting in Electronic Health Records
with Attention-Free Models [48.07469930813923]
This work aims to forecast the demand for healthcare services, by predicting the number of patient visits to healthcare facilities.
We introduce SANSformer, an attention-free sequential model designed with specific inductive biases to cater for the unique characteristics of EHR data.
Our results illuminate the promising potential of tailored attention-free models and self-supervised pretraining in refining healthcare utilization predictions across various patient demographics.
arXiv Detail & Related papers (2021-08-31T08:23:56Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Longitudinal modeling of MS patient trajectories improves predictions of
disability progression [2.117653457384462]
This work addresses the task of optimally extracting information from longitudinal patient data in the real-world setting.
We show that with machine learning methods suited for patient trajectories modeling, we can predict disability progression of patients in a two-year horizon.
Compared to the models available in the literature, this work uses the most complete patient history for MS disease progression prediction.
arXiv Detail & Related papers (2020-11-09T20:48:00Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.