Cross-Country Learning for National Infectious Disease Forecasting Using European Data
- URL: http://arxiv.org/abs/2601.20771v1
- Date: Wed, 28 Jan 2026 16:57:51 GMT
- Title: Cross-Country Learning for National Infectious Disease Forecasting Using European Data
- Authors: Zacharias Komodromos, Kleanthis Malialis, Artemis Kontou, Panayiotis Kolios,
- Abstract summary: We investigate a cross-country learning approach for infectious disease forecasting, in which a single model is trained on time series data from multiple countries.<n>We examine this approach through a case study on COVID-19 case forecasting in Cyprus, using surveillance data from European countries.<n>Our results show that incorporating data from other countries can lead to consistent improvements over models trained solely on national data.
- Score: 2.22825081715888
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Accurate forecasting of infectious disease incidence is critical for public health planning and timely intervention. While most data-driven forecasting approaches rely primarily on historical data from a single country, such data are often limited in length and variability, restricting the performance of machine learning (ML) models. In this work, we investigate a cross-country learning approach for infectious disease forecasting, in which a single model is trained on time series data from multiple countries and evaluated on a country of interest. This setting enables the model to exploit shared epidemic dynamics across countries and to benefit from an enlarged training set. We examine this approach through a case study on COVID-19 case forecasting in Cyprus, using surveillance data from European countries. We evaluate multiple ML models and analyse the impact of the lookback window length and cross-country `data augmentation' on multi-step forecasting performance. Our results show that incorporating data from other countries can lead to consistent improvements over models trained solely on national data. Although the empirical focus is on Cyprus and COVID-19, the proposed framework and findings are applicable to infectious disease forecasting more broadly, particularly in settings with limited national historical data.
Related papers
- A Multilateral Attention-enhanced Deep Neural Network for Disease Outbreak Forecasting: A Case Study on COVID-19 [0.6874745415692134]
We propose a novel approach to address the challenges of infectious disease forecasting.
We introduce a Multilateral Attention-enhanced GRU model that leverages information from multiple sources.
By incorporating attention mechanisms within a GRU framework, our model can effectively capture complex relationships and temporal dependencies in the data.
arXiv Detail & Related papers (2024-08-26T06:31:53Z) - Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Measuring Causal Effects of Data Statistics on Language Model's
`Factual' Predictions [59.284907093349425]
Large amounts of training data are one of the major reasons for the high performance of state-of-the-art NLP models.
We provide a language for describing how training data influences predictions, through a causal framework.
Our framework bypasses the need to retrain expensive models and allows us to estimate causal effects based on observational data alone.
arXiv Detail & Related papers (2022-07-28T17:36:24Z) - Strict baselines for Covid-19 forecasting and ML perspective for USA and
Russia [105.54048699217668]
Covid-19 allows researchers to gather datasets accumulated over 2 years and to use them in predictive analysis.
We present the results of a consistent comparative study of different types of methods for predicting the dynamics of the spread of Covid-19 based on regional data for two countries: the United States and Russia.
arXiv Detail & Related papers (2022-07-15T18:21:36Z) - A spatiotemporal machine learning approach to forecasting COVID-19
incidence at the county level in the United States [2.9822184411723645]
We present COVID-LSTM, a data-driven model based on a Long Short-term memory architecture for forecasting COVID-19 incidence at the county-level in the US.
We use the weekly number of new cases as temporal input, and hand-engineered spatial features from Facebook to capture the spread of the disease in time and space.
Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble.
arXiv Detail & Related papers (2021-09-24T17:40:08Z) - Deep learning via LSTM models for COVID-19 infection forecasting in
India [13.163271874039191]
Prominent computational and mathematical models have been unreliable due to the complexity of the spread of infections.
Deep learning models such as recurrent neural networks are well suited for modelling temporal sequences.
We select states with COVID-19 hotpots in terms of the rate of infections and compare with states where infections have been contained or reached their peak.
Our results show that long-term forecasts are promising which motivates the application of the method in other countries or areas.
arXiv Detail & Related papers (2021-01-28T09:19:10Z) - When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and
Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures.
Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential.
This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z) - An Epidemiological Modelling Approach for Covid19 via Data Assimilation [18.837659009007705]
The global pandemic of the 2019-nCov requires the evaluation of policy interventions to mitigate future social and economic costs of quarantine measures worldwide.
We propose an epidemiological model for forecasting and policy evaluation which incorporates new data in real-time through variational data assimilation.
arXiv Detail & Related papers (2020-04-25T12:46:36Z) - Learning to Forecast and Forecasting to Learn from the COVID-19 Pandemic [10.796851110372593]
We propose a heterogeneous infection rate model with human mobility for epidemic modeling.
By linearizing the model and using weighted least squares, our model is able to quickly adapt to changing trends.
We show that during the earlier part of the epidemic, using travel data increases the predictions.
arXiv Detail & Related papers (2020-04-23T07:25:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.