Early detection of disease outbreaks and non-outbreaks using incidence data
- URL: http://arxiv.org/abs/2404.08893v1
- Date: Sat, 13 Apr 2024 03:57:14 GMT
- Title: Early detection of disease outbreaks and non-outbreaks using incidence data
- Authors: Shan Gao, Amit K. Chakraborty, Russell Greiner, Mark A. Lewis, Hao Wang,
- Abstract summary: We develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks.
We show that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur.
We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur.
- Score: 9.155744274374506
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a Susceptible-Infected-Recovered model for slowly changing, noisy disease dynamics. Outbreak sequences give a transcritical bifurcation within a specified future time window, whereas non-outbreak (null bifurcation) sequences do not. We identified incipient differences in time series of infectives leading to future outbreaks and non-outbreaks. These differences are reflected in 22 statistical features and 5 early warning signal indicators. Classifier performance, given by the area under the receiver-operating curve, ranged from 0.99 for large expanding windows of training data to 0.7 for small rolling windows. Real-world performances of classifiers were tested on two empirical datasets, COVID-19 data from Singapore and SARS data from Hong Kong, with two classifiers exhibiting high accuracy. In summary, we showed that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur. We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur.
Related papers
- Conditional Score-Based Diffusion Model for Cortical Thickness
Trajectory Prediction [29.415616701032604]
Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals.
We propose a conditional score-based diffusion model to generate CTh trajectories with the given baseline information.
Our model has a near-zero bias with narrow confidential 95% interval compared to the ground-truth CTh in 6-36 months.
arXiv Detail & Related papers (2024-03-11T17:26:18Z) - Harmful algal bloom forecasting. A comparison between stream and batch
learning [0.7067443325368975]
Harmful Algal Blooms (HABs) pose risks to public health and the shellfish industry.
This study develops a machine learning workflow for predicting the number of cells of a toxic dinoflagellate.
The model DoME emerged as the most effective and interpretable predictor, outperforming the other algorithms.
arXiv Detail & Related papers (2024-02-20T15:01:11Z) - CenTime: Event-Conditional Modelling of Censoring in Survival Analysis [49.44664144472712]
We introduce CenTime, a novel approach to survival analysis that directly estimates the time to event.
Our method features an innovative event-conditional censoring mechanism that performs robustly even when uncensored data is scarce.
Our results indicate that CenTime offers state-of-the-art performance in predicting time-to-death while maintaining comparable ranking performance.
arXiv Detail & Related papers (2023-09-07T17:07:33Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Tracking the risk of a deployed model and detecting harmful distribution
shifts [105.27463615756733]
In practice, it may make sense to ignore benign shifts, under which the performance of a deployed model does not degrade substantially.
We argue that a sensible method for firing off a warning has to both (a) detect harmful shifts while ignoring benign ones, and (b) allow continuous monitoring of model performance without increasing the false alarm rate.
arXiv Detail & Related papers (2021-10-12T17:21:41Z) - Modeling COVID-19 uncertainties evolving over time and density-dependent
social reinforcement and asymptomatic infections [35.57038204847526]
coronavirus disease 2019 (COVID-19) presents unique and unknown problem complexities and modeling challenges.
We introduce a novel hybrid approach to characterizing Undocumented (U) and Documented (D) infections commonly seen during COVID-19 incubation periods and asymptomatic infections.
arXiv Detail & Related papers (2021-08-23T09:38:54Z) - Clustering Left-Censored Multivariate Time-Series [2.4493299476776778]
We focus on mitigating the interference of left-censorship in the task of clustering.
We develop a deep generative, continuous-time model of time-series data that clusters while correcting for censorship time.
We demonstrate accurate, stable, and interpretable results on synthetic data.
arXiv Detail & Related papers (2021-02-13T21:22:40Z) - Forecasting Irreversible Disease via Progression Learning [18.13106500847306]
Parapapillary atrophy (PPA) is a symptom related to most irreversible eye diseases.
We propose a novel framework, namely textbfDisease textbfForecast via textbfProgression textbfLearning (textbfDFPL)
arXiv Detail & Related papers (2020-12-21T03:54:35Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - The Hidden Uncertainty in a Neural Networks Activations [105.4223982696279]
The distribution of a neural network's latent representations has been successfully used to detect out-of-distribution (OOD) data.
This work investigates whether this distribution correlates with a model's epistemic uncertainty, thus indicating its ability to generalise to novel inputs.
arXiv Detail & Related papers (2020-12-05T17:30:35Z) - Tracking disease outbreaks from sparse data with Bayesian inference [55.82986443159948]
The COVID-19 pandemic provides new motivation for estimating the empirical rate of transmission during an outbreak.
Standard methods struggle to accommodate the partial observability and sparse data common at finer scales.
We propose a Bayesian framework which accommodates partial observability in a principled manner.
arXiv Detail & Related papers (2020-09-12T20:37:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.