CrossLag: Predicting Major Dengue Outbreaks with a Domain Knowledge Informed Transformer
- URL: http://arxiv.org/abs/2510.03566v1
- Date: Fri, 03 Oct 2025 23:30:31 GMT
- Title: CrossLag: Predicting Major Dengue Outbreaks with a Domain Knowledge Informed Transformer
- Authors: Ashwin Prabu, Nhat Thanh Tran, Guofa Zhou, Jack Xin,
- Abstract summary: We introduce CrossLag, an environmentally informed attention that allows for the incorporation of lagging endogenous signals behind the significant events in the data.<n>Our proposed model outperforms TimeXer by a considerable margin in detecting and predicting major outbreaks in Singapore dengue data over a 24-week prediction window.
- Score: 1.9132624817489867
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A variety of models have been developed to forecast dengue cases to date. However, it remains a challenge to predict major dengue outbreaks that need timely public warnings the most. In this paper, we introduce CrossLag, an environmentally informed attention that allows for the incorporation of lagging endogenous signals behind the significant events in the exogenous data into the architecture of the transformer at low parameter counts. Outbreaks typically lag behind major changes in climate and oceanic anomalies. We use TimeXer, a recent general-purpose transformer distinguishing exogenous-endogenous inputs, as the baseline for this study. Our proposed model outperforms TimeXer by a considerable margin in detecting and predicting major outbreaks in Singapore dengue data over a 24-week prediction window.
Related papers
- Transformer Model for Alzheimer's Disease Progression Prediction Using Longitudinal Visit Sequences [0.032771631221674334]
Alzheimer's disease (AD) is a neurodegenerative disorder with no known cure that affects tens of millions of people worldwide.<n>We propose a Transformer model for predicting the stage of AD progression at a subject's next clinical visit using features from a sequence of visits extracted from the subject's visit history.
arXiv Detail & Related papers (2025-07-05T04:35:04Z) - Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection [67.40407388422514]
We design a conceptual fine-grained causal model named TBN Granger Causality.
Second, we propose an end-to-end deep generative model called TacSas, which discovers TBN Granger Causality in a generative manner.
We test TacSas on climate benchmark ERA5 for climate forecasting and the extreme weather benchmark of NOAA for extreme weather alerts.
arXiv Detail & Related papers (2024-08-08T06:47:21Z) - Early detection of disease outbreaks and non-outbreaks using incidence data [9.155744274374506]
We develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks.
We show that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur.
We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur.
arXiv Detail & Related papers (2024-04-13T03:57:14Z) - FWin transformer for dengue prediction under climate and ocean influence [1.6114012813668932]
Dengue fever is one of the most deadly mosquito-born tropical infectious diseases.
In this study, we examine methods used to forecast dengue cases for long range predictions.
arXiv Detail & Related papers (2024-03-10T19:20:55Z) - TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables [75.83318701911274]
TimeXer ingests external information to enhance the forecasting of endogenous variables.
TimeXer achieves consistent state-of-the-art performance on twelve real-world forecasting benchmarks.
arXiv Detail & Related papers (2024-02-29T11:54:35Z) - Performative Time-Series Forecasting [64.03865043422597]
We formalize performative time-series forecasting (PeTS) from a machine-learning perspective.<n>We propose a novel approach, Feature Performative-Shifting (FPS), which leverages the concept of delayed response to anticipate distribution shifts.<n>We conduct comprehensive experiments using multiple time-series models on COVID-19 and traffic forecasting tasks.
arXiv Detail & Related papers (2023-10-09T18:34:29Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - A spatiotemporal machine learning approach to forecasting COVID-19
incidence at the county level in the United States [2.9822184411723645]
We present COVID-LSTM, a data-driven model based on a Long Short-term memory architecture for forecasting COVID-19 incidence at the county-level in the US.
We use the weekly number of new cases as temporal input, and hand-engineered spatial features from Facebook to capture the spread of the disease in time and space.
Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble.
arXiv Detail & Related papers (2021-09-24T17:40:08Z) - Early Detection of COVID-19 Hotspots Using Spatio-Temporal Data [66.70036251870988]
The Centers for Disease Control and Prevention (CDC) has worked with other federal agencies to identify counties with increasing coronavirus 2019 (CO-19) incidence (hotspots)
This paper presents a sparse model for early detection of COVID-19 hotspots (at the county level) in the United States.
Deep neural networks are introduced to enhance the model's representative power while still enjoying the interpretability of the kernel.
arXiv Detail & Related papers (2021-05-31T19:28:17Z) - Gaussian Process Nowcasting: Application to COVID-19 Mortality Reporting [2.8712862578745018]
Updating observations of a signal due to the delays in the measurement process is a common problem in signal processing.
We present a flexible approach using a latent Gaussian process that is capable of describing the changing auto-correlation structure present in the reporting time-delay surface.
This approach also yields robust estimates of uncertainty for the estimated nowcasted numbers of deaths.
arXiv Detail & Related papers (2021-02-22T18:32:44Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.