Data-driven Identification of Number of Unreported Cases for COVID-19:
Bounds and Limitations
- URL: http://arxiv.org/abs/2006.02127v5
- Date: Thu, 9 Jul 2020 04:17:27 GMT
- Title: Data-driven Identification of Number of Unreported Cases for COVID-19:
Bounds and Limitations
- Authors: Ajitesh Srivastava and Viktor K. Prasanna
- Abstract summary: A critical factor that can hinder accurate long-term forecasts, is the number of unreported/asymptomatic cases.
We show that we can identify lower bounds on this ratio or upper bound on actual cases as a factor of reported cases.
We prove that the number of unreported cases can be reliably estimated only from a certain time period.
- Score: 10.796851110372593
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate forecasts for COVID-19 are necessary for better preparedness and
resource management. Specifically, deciding the response over months or several
months requires accurate long-term forecasts which is particularly challenging
as the model errors accumulate with time. A critical factor that can hinder
accurate long-term forecasts, is the number of unreported/asymptomatic cases.
While there have been early serology tests to estimate this number, more tests
need to be conducted for more reliable results. To identify the number of
unreported/asymptomatic cases, we take an epidemiology data-driven approach. We
show that we can identify lower bounds on this ratio or upper bound on actual
cases as a factor of reported cases. To do so, we propose an extension of our
prior heterogeneous infection rate model, incorporating unreported/asymptomatic
cases. We prove that the number of unreported cases can be reliably estimated
only from a certain time period of the epidemic data. In doing so, we construct
an algorithm called Fixed Infection Rate method, which identifies a reliable
bound on the learned ratio. We also propose two heuristics to learn this ratio
and show their effectiveness on simulated data. We use our approaches to
identify the upper bounds on the ratio of actual to reported cases for New York
City and several US states. Our results demonstrate with high confidence that
the actual number of cases cannot be more than 35 times in New York, 40 times
in Illinois, 38 times in Massachusetts and 29 times in New Jersey, than the
reported cases.
Related papers
- CenTime: Event-Conditional Modelling of Censoring in Survival Analysis [49.44664144472712]
We introduce CenTime, a novel approach to survival analysis that directly estimates the time to event.
Our method features an innovative event-conditional censoring mechanism that performs robustly even when uncensored data is scarce.
Our results indicate that CenTime offers state-of-the-art performance in predicting time-to-death while maintaining comparable ranking performance.
arXiv Detail & Related papers (2023-09-07T17:07:33Z) - Statistical and Computational Phase Transitions in Group Testing [73.55361918807883]
We study the group testing problem where the goal is to identify a set of k infected individuals carrying a rare disease.
We consider two different simple random procedures for assigning individuals tests.
arXiv Detail & Related papers (2022-06-15T16:38:50Z) - Can Self Reported Symptoms Predict Daily COVID-19 Cases? [12.029443053416399]
We develop machine learning models to estimate the prevalence of COVID-19 using self-reported symptoms.
Our results indicate a lower error on the local models as opposed to the global model.
This work demonstrates that the models developed on crowd-sourced data, curated via online platforms, can complement the existing epidemiological surveillance infrastructure.
arXiv Detail & Related papers (2021-05-18T07:26:09Z) - Gaussian Process Nowcasting: Application to COVID-19 Mortality Reporting [2.8712862578745018]
Updating observations of a signal due to the delays in the measurement process is a common problem in signal processing.
We present a flexible approach using a latent Gaussian process that is capable of describing the changing auto-correlation structure present in the reporting time-delay surface.
This approach also yields robust estimates of uncertainty for the estimated nowcasted numbers of deaths.
arXiv Detail & Related papers (2021-02-22T18:32:44Z) - Comparative Analysis of Machine Learning Approaches to Analyze and
Predict the Covid-19 Outbreak [10.307715136465056]
We present a comparative analysis of various machine learning (ML) approaches in predicting the COVID-19 outbreak in the epidemiological domain.
The results reveal the advantages of ML algorithms for supporting decision making of evolving short term policies.
arXiv Detail & Related papers (2021-02-11T11:57:33Z) - Increasing the efficiency of randomized trial estimates via linear
adjustment for a prognostic score [59.75318183140857]
Estimating causal effects from randomized experiments is central to clinical research.
Most methods for historical borrowing achieve reductions in variance by sacrificing strict type-I error rate control.
arXiv Detail & Related papers (2020-12-17T21:10:10Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Estimating COVID-19 cases and outbreaks on-stream through phone-calls [0.0]
We present an algorithm to estimate on-stream the number of COVID-19 cases using the data from telephone calls to a COVID-line.
We show how to use the algorithm to track on-stream the epidemic, and present the Early Outbreak Alarm to detect outbreaks in advance to lab results.
arXiv Detail & Related papers (2020-10-10T15:44:05Z) - Tracking disease outbreaks from sparse data with Bayesian inference [55.82986443159948]
The COVID-19 pandemic provides new motivation for estimating the empirical rate of transmission during an outbreak.
Standard methods struggle to accommodate the partial observability and sparse data common at finer scales.
We propose a Bayesian framework which accommodates partial observability in a principled manner.
arXiv Detail & Related papers (2020-09-12T20:37:33Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.