Pre-training Epidemic Time Series Forecasters with Compartmental Prototypes
- URL: http://arxiv.org/abs/2502.03393v4
- Date: Fri, 26 Sep 2025 20:37:12 GMT
- Title: Pre-training Epidemic Time Series Forecasters with Compartmental Prototypes
- Authors: Zewen Liu, Juntong Ni, Max S. Y. Lau, Wei Jin,
- Abstract summary: We propose CAPE, the first open-source pre-trained model for epidemic forecasting.<n>Unlike existing time series foundation models that overlook epidemiological challenges, CAPE models epidemic dynamics as of latent population states.<n>On a comprehensive benchmark spanning 17 diseases and 50+ regions, CAPE significantly outperforms strong baselines in zero-shot, few-shot, and full-shot forecasting.
- Score: 8.716953628182742
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate epidemic forecasting is crucial for outbreak preparedness, but existing data-driven models are often brittle. Typically trained on a single pathogen, they struggle with data scarcity during new outbreaks and fail under distribution shifts caused by viral evolution or interventions. However, decades of surveillance data from diverse diseases offer an untapped source of transferable knowledge. To leverage the collective lessons from history, we propose CAPE, the first open-source pre-trained model for epidemic forecasting. Unlike existing time series foundation models that overlook epidemiological challenges, CAPE models epidemic dynamics as mixtures of latent population states, termed compartmental prototypes. It discovers a flexible dictionary of compartment prototypes directly from surveillance data, enabling each outbreak to be expressed as a time-varying mixture that links observed infections to latent population states. To promote robust generalization, CAPE combines self-supervised pre-training objectives with lightweight epidemic-aware regularizers that align the learned prototypes with epidemiological semantics. On a comprehensive benchmark spanning 17 diseases and 50+ regions, CAPE significantly outperforms strong baselines in zero-shot, few-shot, and full-shot forecasting. This work represents a principled step toward pre-trained epidemic models that are both transferable and epidemiologically grounded.
Related papers
- Deep learning framework for predicting stochastic take-off and die-out of early spreading [3.3147247892604708]
Large-scale outbreaks pose significant threats to human society.<n>The question of whether an emerging outbreak will escalate into a major epidemic or naturally die out remains largely unaddressed.<n>Here, we introduce the first systematic framework for forecasting whether initial transmission events will amplify into major outbreaks.
arXiv Detail & Related papers (2025-10-06T08:18:47Z) - Predictive Causal Inference via Spatio-Temporal Modeling and Penalized Empirical Likelihood [0.0]
This study introduces an integrated framework for predictive causal inference designed to overcome limitations in conventional single model approaches.<n> Specifically, we combine a Hidden Markov Model for spatial health state estimation with a Multi Task and Multi Graph Convolutional Network (MTGCN) for capturing temporal outcome trajectories.<n>To demonstrate its utility, we focus on clinical domains such as cancer, dementia, Parkinson disease, where treatment effects are challenging to observe directly.
arXiv Detail & Related papers (2025-07-11T03:11:15Z) - Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models [70.64969663547703]
AdaCVD is an adaptable CVD risk prediction framework built on large language models extensively fine-tuned on over half a million participants from the UK Biobank.<n>It addresses key clinical challenges across three dimensions: it flexibly incorporates comprehensive yet variable patient information; it seamlessly integrates both structured data and unstructured text; and it rapidly adapts to new patient populations using minimal additional data.
arXiv Detail & Related papers (2025-05-30T14:42:02Z) - Epidemic-guided deep learning for spatiotemporal forecasting of Tuberculosis outbreak [0.0]
Tuberosis (TB) remains a formidable global health challenge, driven by complex transmission dynamics and influenced by factors such as population mobility and behavioral changes.
We propose an Epidemic-Guided Deep Learning approach that fuses mechanistic epidemiological principles with advanced deep learning techniques.
arXiv Detail & Related papers (2025-02-15T12:39:42Z) - Towards the efficacy of federated prediction for epidemics on networks [5.719600613434056]
We develop a general privacy-level framework for node-level prediction on epidemic networks based on federated learning (FL)<n>This study offers practical insights into the efficacy of FL scenario in epidemic management.
arXiv Detail & Related papers (2024-12-03T04:37:28Z) - A Multilateral Attention-enhanced Deep Neural Network for Disease Outbreak Forecasting: A Case Study on COVID-19 [0.6874745415692134]
We propose a novel approach to address the challenges of infectious disease forecasting.
We introduce a Multilateral Attention-enhanced GRU model that leverages information from multiple sources.
By incorporating attention mechanisms within a GRU framework, our model can effectively capture complex relationships and temporal dependencies in the data.
arXiv Detail & Related papers (2024-08-26T06:31:53Z) - Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - Modeling, Inference, and Prediction in Mobility-Based Compartmental Models for Epidemiology [5.079807662054658]
We introduce individual mobility as a key factor in disease transmission and control.
We characterize disease dynamics using mobility distribution functions for each compartment.
We infer mobility distributions from the time series of the infected population.
arXiv Detail & Related papers (2024-06-17T18:13:57Z) - PEMS: Pre-trained Epidemic Time-series Models [23.897701882327972]
We introduce Pre-trained Epidemic Time-Series Models (PEMS)
PEMS learn from diverse time-series datasets of a variety of diseases by formulating pre-training as a set of self-supervised learning (SSL) tasks.
The resultant PEM outperforms previous state-of-the-art methods in various downstream time-series tasks across datasets of varying seasonal patterns, geography, and mechanism of contagion including the novel Covid-19 pandemic unseen in pre-trained data with better efficiency using smaller fraction of datasets.
arXiv Detail & Related papers (2023-11-14T01:40:21Z) - Forecast reconciliation for vaccine supply chain optimization [61.13962963550403]
Vaccine supply chain optimization can benefit from hierarchical time series forecasting.
Forecasts of different hierarchy levels become incoherent when higher levels do not match the sum of the lower levels forecasts.
We tackle the vaccine sale forecasting problem by modeling sales data from GSK between 2010 and 2021 as a hierarchical time series.
arXiv Detail & Related papers (2023-05-02T14:34:34Z) - SPADE4: Sparsity and Delay Embedding based Forecasting of Epidemics [2.578242050187029]
We propose Sparsity and Delay Embedding based Forecasting (SPADE4) for predicting epidemics.
We show that our approach outperforms compartmental models when applied to both simulated and real data.
arXiv Detail & Related papers (2022-11-11T23:39:48Z) - Data-Centric Epidemic Forecasting: A Survey [56.99209141838794]
This survey delves into various data-driven methodological and practical advancements.
We enumerate the large number of epidemiological datasets and novel data streams that are relevant to epidemic forecasting.
We also discuss experiences and challenges that arise in real-world deployment of these forecasting systems.
arXiv Detail & Related papers (2022-07-19T16:15:11Z) - Epicasting: An Ensemble Wavelet Neural Network (EWNet) for Forecasting
Epidemics [2.705025060422369]
Infectious diseases remain among the top contributors to human illness and death worldwide.
Forecasts of epidemics can assist stakeholders in tailoring countermeasures to the situation at hand.
arXiv Detail & Related papers (2022-06-21T19:31:25Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data.
We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Steering a Historical Disease Forecasting Model Under a Pandemic: Case
of Flu and COVID-19 [75.99038202534628]
We propose CALI-Net, a neural transfer learning architecture which allows us to'steer' a historical disease forecasting model to new scenarios where flu and COVID co-exist.
Our experiments demonstrate that our approach is successful in adapting a historical forecasting model to the current pandemic.
arXiv Detail & Related papers (2020-09-23T22:35:43Z) - Individual Treatment Prescription Effect Estimation in a Low Compliance
Setting [11.672067762133299]
Individual Treatment Effect estimation is an extensively researched problem, with applications in various domains.
We propose a new approach for the estimation of the IPE that takes advantage of observed compliance information to prevent signal fading.
We conduct experiments on both synthetic and real-world datasets that highlight the benefit of the approach, which consistently improves state-of-the-art in low compliance settings.
arXiv Detail & Related papers (2020-08-07T15:53:00Z) - From predictions to prescriptions: A data-driven response to COVID-19 [42.57407485467993]
We propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19.
We build personalized calculators to predict the risk of infection and mortality.
We propose an optimization model to re-allocate ventilators and alleviate shortages.
arXiv Detail & Related papers (2020-06-30T03:34:00Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.