Disease Progression and Subtype Modeling for Combined Discrete and Continuous Input Data
- URL: http://arxiv.org/abs/2602.22018v1
- Date: Wed, 25 Feb 2026 15:31:30 GMT
- Title: Disease Progression and Subtype Modeling for Combined Discrete and Continuous Input Data
- Authors: Sterre de Jonge, Elisabeth J. Vinke, Meike W. Vernooij, Daniel C. Alexander, Alexandra L. Young, Esther E. Bron,
- Abstract summary: We propose a novel disease progression model that handles both discrete and continuous data types.<n>We demonstrate the effectiveness of Mixed-SuStaIn through simulation experiments and real-world data from the Alzheimer's Disease Neuroimaging Initiative.
- Score: 37.93788591991097
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Disease progression modeling provides a robust framework to identify long-term disease trajectories from short-term biomarker data. It is a valuable tool to gain a deeper understanding of diseases with a long disease trajectory, such as Alzheimer's disease. A key limitation of most disease progression models is that they are specific to a single data type (e.g., continuous data), thereby limiting their applicability to heterogeneous, real-world datasets. To address this limitation, we propose the Mixed Events model, a novel disease progression model that handles both discrete and continuous data types. This model is implemented within the Subtype and Stage Inference (SuStaIn) framework, resulting in Mixed-SuStaIn, enabling subtype and progression modeling. We demonstrate the effectiveness of Mixed-SuStaIn through simulation experiments and real-world data from the Alzheimer's Disease Neuroimaging Initiative, showing that it performs well on mixed datasets. The code is available at: https://github.com/ucl-pond/pySuStaIn.
Related papers
- Bayesian Event-Based Model for Disease Subtype and Stage Inference [1.2318267573115809]
Subtype and Stage Inference Event-Based Model (SuStaIn) has been widely applied to uncover the subtypes of many diseases.<n>We develop a principled Bayesian subtype variant of the event-based model (BEBMS) and compare its performance to SuStaIn.<n>We find BEBMS has results that are more consistent with the scientific consensus of Alzheimer's disease progression than SuStaIn.
arXiv Detail & Related papers (2025-12-03T05:45:16Z) - MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification [13.350688594462214]
We propose a novel approach explicitly modeling such metadata into a generative Diffusion model framework (MeDi)<n>MeDi allows for a targeted augmentation of underrepresented subpopulations with synthetic data.<n>We experimentally show that MeDi generates high-quality histopathology images for unseen subpopulations in TCGA.
arXiv Detail & Related papers (2025-06-20T16:41:25Z) - Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis [55.959002385347645]
Latent Drifting enables diffusion models to be conditioned for medical images fitted for the complex task of counterfactual image generation.<n>We evaluate our method on three public longitudinal benchmark datasets of brain MRI and chest X-rays for counterfactual image generation.
arXiv Detail & Related papers (2024-12-30T01:59:34Z) - Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data [49.73114504515852]
We show that replacing the original real data by each generation's synthetic data does indeed tend towards model collapse.
We demonstrate that accumulating the successive generations of synthetic data alongside the original real data avoids model collapse.
arXiv Detail & Related papers (2024-04-01T18:31:24Z) - Time-dependent Probabilistic Generative Models for Disease Progression [0.0]
We propose a Markovian generative model of treatments to model the irregular time intervals between medical events.
We use the Expectation-Maximization algorithm to learn the model, which is efficiently solved with a dynamic programming-based method.
arXiv Detail & Related papers (2023-11-15T21:00:00Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Benchmarking Continuous Time Models for Predicting Multiple Sclerosis
Progression [46.394865849252696]
Multiple sclerosis is a disease that affects the brain and spinal cord, it can lead to severe disability and has no known cure.
In a recent paper it was shown that disease progression can be predicted effectively using performance outcome measures and demographic data.
We benchmark four continuous time models using a publicly available multiple sclerosis dataset.
We find that the best continuous model is often able to outperform the best benchmarked discrete time model.
arXiv Detail & Related papers (2023-02-15T18:45:32Z) - Temporal Clustering with External Memory Network for Disease Progression
Modeling [8.015263440307631]
Disease progression modeling (DPM) involves using mathematical frameworks to quantitatively measure the severity of how certain disease progresses.
DPM is useful in many ways such as predicting health state, categorizing disease stages, and assessing patients disease trajectory etc.
arXiv Detail & Related papers (2021-09-29T02:32:06Z) - Contrastive Model Inversion for Data-Free Knowledge Distillation [60.08025054715192]
We propose Contrastive Model Inversion, where the data diversity is explicitly modeled as an optimizable objective.
Our main observation is that, under the constraint of the same amount of data, higher data diversity usually indicates stronger instance discrimination.
Experiments on CIFAR-10, CIFAR-100, and Tiny-ImageNet demonstrate that CMI achieves significantly superior performance when the generated data are used for knowledge distillation.
arXiv Detail & Related papers (2021-05-18T15:13:00Z) - Learning transition times in event sequences: the Event-Based Hidden
Markov Model of disease progression [4.12857285066818]
We connect ideas from event-based and hidden Markov modelling to derive a new generative model of disease progression.
Our model can infer the most likely group-level sequence and timing of events from limited datasets.
We use clinical, imaging and biofluid data from the Alzheimer's Disease Neuroimaging Initiative to demonstrate the validity and utility of our model.
arXiv Detail & Related papers (2020-11-02T15:13:03Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.