Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
- URL: http://arxiv.org/abs/2405.08780v2
- Date: Tue, 30 Jul 2024 03:42:00 GMT
- Title: Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
- Authors: Gregory Holste, Mingquan Lin, Ruiwen Zhou, Fei Wang, Lei Liu, Qi Yan, Sarah H. Van Tassel, Kyle Kovacs, Emily Y. Chew, Zhiyong Lu, Zhangyang Wang, Yifan Peng,
- Abstract summary: Our proposed Longitudinal Transformer for Survival Analysis (LTSA) enables dynamic disease prognosis from longitudinal medical imaging.
A temporal attention analysis also suggested that, while the most recent image is typically the most influential, prior imaging still provides additional prognostic value.
- Score: 49.52787013516891
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Deep learning has enabled breakthroughs in automated diagnosis from medical imaging, with many successful applications in ophthalmology. However, standard medical image classification approaches only assess disease presence at the time of acquisition, neglecting the common clinical setting of longitudinal imaging. For slow, progressive eye diseases like age-related macular degeneration (AMD) and primary open-angle glaucoma (POAG), patients undergo repeated imaging over time to track disease progression and forecasting the future risk of developing disease is critical to properly plan treatment. Our proposed Longitudinal Transformer for Survival Analysis (LTSA) enables dynamic disease prognosis from longitudinal medical imaging, modeling the time to disease from sequences of fundus photography images captured over long, irregular time periods. Using longitudinal imaging data from the Age-Related Eye Disease Study (AREDS) and Ocular Hypertension Treatment Study (OHTS), LTSA significantly outperformed a single-image baseline in 19/20 head-to-head comparisons on late AMD prognosis and 18/20 comparisons on POAG prognosis. A temporal attention analysis also suggested that, while the most recent image is typically the most influential, prior imaging still provides additional prognostic value.
Related papers
- Time-to-Event Pretraining for 3D Medical Imaging [44.46415168541444]
We introduce time-to-event pretraining, a pretraining framework for 3D medical imaging models.
We use a dataset of 18,945 CT scans (4.2 million 2D images) and time-to-event distributions across thousands of EHR-derived tasks.
Our method improves outcome prediction, achieving an average AUROC increase of 23.7% and a 29.4% gain in Harrell's C-index across 8 benchmark tasks.
arXiv Detail & Related papers (2024-11-14T11:08:54Z) - Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences [46.80977922491862]
The utilization of longitudinal datasets for glaucoma progression prediction offers a compelling approach to support early therapeutic interventions.
We propose a novel diffusion-based model to predict prospective images by extrapolating from existing longitudinal fundus images of patients.
arXiv Detail & Related papers (2024-10-28T15:31:47Z) - L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction [2.663690023739801]
Pre-training strategies based on self-supervised learning (SSL) have proven to be effective pretext tasks for many downstream tasks in computer vision.
We developed a longitudinal masked auto-encoder (MAE) based on the well-known Transformer-based MAE.
Using OPHDIAT, a large follow-up screening dataset targeting diabetic retinopathy (DR), we evaluated the pre-trained weights on a longitudinal task.
arXiv Detail & Related papers (2024-03-24T19:34:33Z) - Multi-scale Spatio-temporal Transformer-based Imbalanced Longitudinal
Learning for Glaucoma Forecasting from Irregular Time Series Images [45.894671834869975]
Glaucoma is one of the major eye diseases that leads to progressive optic nerve fiber damage and irreversible blindness.
We introduce the Multi-scale Spatio-temporal Transformer Network (MST-former) based on the transformer architecture tailored for sequential image inputs.
Our method shows excellent generalization capability on the Alzheimer's Disease Neuroimaging Initiative (ADNI) MRI dataset, with an accuracy of 90.3% for mild cognitive impairment and Alzheimer's disease prediction.
arXiv Detail & Related papers (2024-02-21T02:16:59Z) - Strategy for Rapid Diabetic Retinopathy Exposure Based on Enhanced
Feature Extraction Processing [0.0]
This research aims to improve diabetic retinopathy diagnosis by developing an enhanced deep learning model for timely DR identification.
The proposed model will detect various lesions from retinal images in the early stages.
arXiv Detail & Related papers (2023-05-08T14:17:33Z) - A CNN-LSTM Combination Network for Cataract Detection using Eye Fundus
Images [0.0]
One of the leading causes of irreversible blindness in persons over the age of 50 is delayed cataract treatment.
We developed a CNN-LSTM-based model architecture with the goal of creating a low-cost diagnostic system.
The suggested architecture outperformed previous systems with a state-of-the-art 97.53% accuracy.
arXiv Detail & Related papers (2022-10-28T12:35:15Z) - RADNet: Ensemble Model for Robust Glaucoma Classification in Color
Fundus Images [0.0]
Glaucoma is one of the most severe eye diseases, characterized by rapid progression and leading to irreversible blindness.
Regular glaucoma screenings of the population shall improve early-stage detection, however the desirable frequency of etymological checkups is often not feasible.
In our work, we propose an advanced image pre-processing technique combined with an ensemble of deep classification networks.
arXiv Detail & Related papers (2022-05-25T16:48:00Z) - An Interpretable Multiple-Instance Approach for the Detection of
referable Diabetic Retinopathy from Fundus Images [72.94446225783697]
We propose a machine learning system for the detection of referable Diabetic Retinopathy in fundus images.
By extracting local information from image patches and combining it efficiently through an attention mechanism, our system is able to achieve high classification accuracy.
We evaluate our approach on publicly available retinal image datasets, in which it exhibits near state-of-the-art performance.
arXiv Detail & Related papers (2021-03-02T13:14:15Z) - Modeling and Enhancing Low-quality Retinal Fundus Images [167.02325845822276]
Low-quality fundus images increase uncertainty in clinical observation and lead to the risk of misdiagnosis.
We propose a clinically oriented fundus enhancement network (cofe-Net) to suppress global degradation factors.
Experiments on both synthetic and real images demonstrate that our algorithm effectively corrects low-quality fundus images without losing retinal details.
arXiv Detail & Related papers (2020-05-12T08:01:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.