Related papers: L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction

L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction

URL: http://arxiv.org/abs/2403.16272v1
Date: Sun, 24 Mar 2024 19:34:33 GMT
Title: L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction
Authors: Rachid Zeghlache, Pierre-Henri Conze, Mostafa El Habib Daho, Yihao Li, Alireza Rezaei, Hugo Le Boité, Ramin Tadayoni, Pascal Massin, Béatrice Cochener, Ikram Brahim, Gwenolé Quellec, Mathieu Lamard,
Abstract summary: Pre-training strategies based on self-supervised learning (SSL) have proven to be effective pretext tasks for many downstream tasks in computer vision. We developed a longitudinal masked auto-encoder (MAE) based on the well-known Transformer-based MAE. Using OPHDIAT, a large follow-up screening dataset targeting diabetic retinopathy (DR), we evaluated the pre-trained weights on a longitudinal task.
Score: 2.663690023739801
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Pre-training strategies based on self-supervised learning (SSL) have proven to be effective pretext tasks for many downstream tasks in computer vision. Due to the significant disparity between medical and natural images, the application of typical SSL is not straightforward in medical imaging. Additionally, those pretext tasks often lack context, which is critical for computer-aided clinical decision support. In this paper, we developed a longitudinal masked auto-encoder (MAE) based on the well-known Transformer-based MAE. In particular, we explored the importance of time-aware position embedding as well as disease progression-aware masking. Taking into account the time between examinations instead of just scheduling them offers the benefit of capturing temporal changes and trends. The masking strategy, for its part, evolves during follow-up to better capture pathological changes, ensuring a more accurate assessment of disease progression. Using OPHDIAT, a large follow-up screening dataset targeting diabetic retinopathy (DR), we evaluated the pre-trained weights on a longitudinal task, which is to predict the severity label of the next visit within 3 years based on the past time series examinations. Our results demonstrated the relevancy of both time-aware position embedding and masking strategies based on disease progression knowledge. Compared to popular baseline models and standard longitudinal Transformers, these simple yet effective extensions significantly enhance the predictive ability of deep classification models.

Related papers

A CNN-Transformer for Classification of Longitudinal 3D MRI Images -- A Case Study on Hepatocellular Carcinoma Prediction [0.0]
HCCNet is a novel model architecture that integrates a 3D adaptation of the ConvNeXt CNN architecture with a Transformer encoder. Our results show that HCCNet significantly improves predictive accuracy and reliability over baseline models.
arXiv Detail & Related papers (2025-01-18T11:39:46Z)
Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences [46.80977922491862]
The utilization of longitudinal datasets for glaucoma progression prediction offers a compelling approach to support early therapeutic interventions. We propose a novel diffusion-based model to predict prospective images by extrapolating from existing longitudinal fundus images of patients.
arXiv Detail & Related papers (2024-10-28T15:31:47Z)
Early Prediction of Causes (not Effects) in Healthcare by Long-Term Clinical Time Series Forecasting [11.96384267146423]
We propose to directly predict the causes via time series forecasting (TSF) of clinical variables. Because model training does not rely on a particular label anymore, the forecasted data can be used to predict any consensus-based label.
arXiv Detail & Related papers (2024-08-07T14:52:06Z)
Deep Learning to Predict Glaucoma Progression using Structural Changes in the Eye [0.20718016474717196]
Glaucoma is a chronic eye disease characterized by optic neuropathy, leading to irreversible vision loss. Early detection is crucial to monitor atrophy and develop treatment strategies to prevent further vision impairment. In this study, we use deep learning models to identify complex disease traits and progression criteria.
arXiv Detail & Related papers (2024-06-09T01:12:41Z)
Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling [49.52787013516891]
Our proposed Longitudinal Transformer for Survival Analysis (LTSA) enables dynamic disease prognosis from longitudinal medical imaging. A temporal attention analysis also suggested that, while the most recent image is typically the most influential, prior imaging still provides additional prognostic value.
arXiv Detail & Related papers (2024-05-14T17:15:28Z)
Bidirectional Generative Pre-training for Improving Healthcare Time-series Representation Learning [9.621781933666844]
We propose a novel architecture called BiTimely Generative Pre-trained Transformer (BiTimelyGPT) BiTimelyGPT pre-trains on biosignals and longitudinal clinical records by both next-token and previous-token prediction in alternating transformer layers. Using biosignals and longitudinal clinical records, BiTimelyGPT demonstrates superior performance in predicting neurological functionality, disease diagnosis, and physiological signs.
arXiv Detail & Related papers (2024-02-14T20:19:24Z)
LMT: Longitudinal Mixing Training, a Framework to Predict Disease Progression from a Single Image [1.805673949640389]
We introduce a new way to train time-aware models using $t_mix$, a weighted average time between two consecutive examinations. We predict whether an eye would develop a severe DR in the following visit using a single image, with an AUC of 0.798 compared to baseline results of 0.641.
arXiv Detail & Related papers (2023-10-16T14:01:20Z)
LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic Health Records [11.408950540503112]
We propose a LAbel-efficienT incidenT phEnotyping algorithm to accurately annotate the timing of clinical events from longitudinal EHR data. LATTE is evaluated on three analyses: the onset of type-2 diabetes, heart failure, and the onset and relapses of multiple sclerosis.
arXiv Detail & Related papers (2023-05-19T03:28:51Z)
Safe AI for health and beyond -- Monitoring to transform a health service [51.8524501805308]
We will assess the infrastructure required to monitor the outputs of a machine learning algorithm. We will present two scenarios with examples of monitoring and updates of models.
arXiv Detail & Related papers (2023-03-02T17:27:45Z)
On the Robustness of Pretraining and Self-Supervision for a Deep Learning-based Analysis of Diabetic Retinopathy [70.71457102672545]
We compare the impact of different training procedures for diabetic retinopathy grading. We investigate different aspects such as quantitative performance, statistics of the learned feature representations, interpretability and robustness to image distortions. Our results indicate that models from ImageNet pretraining report a significant increase in performance, generalization and robustness to image distortions.
arXiv Detail & Related papers (2021-06-25T08:32:45Z)
BiteNet: Bidirectional Temporal Encoder Network to Predict Medical Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey. An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys. We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z)
Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights. It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness. In recent years, there has been a significant effort to automate the diagnosis using deep learning. This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN) Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)

This list is automatically generated from the titles and abstracts of the papers in this site.