Strict baselines for Covid-19 forecasting and ML perspective for USA and
Russia
- URL: http://arxiv.org/abs/2207.07689v1
- Date: Fri, 15 Jul 2022 18:21:36 GMT
- Title: Strict baselines for Covid-19 forecasting and ML perspective for USA and
Russia
- Authors: Alexander G. Sboev, Nikolay A. Kudryshov, Ivan A. Moloshnikov, Saveliy
V. Zavertyaev, Aleksandr V. Naumov and Roman B. Rybka
- Abstract summary: Covid-19 allows researchers to gather datasets accumulated over 2 years and to use them in predictive analysis.
We present the results of a consistent comparative study of different types of methods for predicting the dynamics of the spread of Covid-19 based on regional data for two countries: the United States and Russia.
- Score: 105.54048699217668
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Currently, the evolution of Covid-19 allows researchers to gather the
datasets accumulated over 2 years and to use them in predictive analysis. In
turn, this makes it possible to assess the efficiency potential of more complex
predictive models, including neural networks with different forecast horizons.
In this paper, we present the results of a consistent comparative study of
different types of methods for predicting the dynamics of the spread of
Covid-19 based on regional data for two countries: the United States and
Russia. We used well-known statistical methods (e.g., Exponential Smoothing), a
"tomorrow-as-today" approach, as well as a set of classic machine learning
models trained on data from individual regions. Along with them, a neural
network model based on Long short-term memory (LSTM) layers was considered, the
training samples of which aggregate data from all regions of two countries: the
United States and Russia. Efficiency evaluation was carried out using
cross-validation according to the MAPE metric. It is shown that for complicated
periods characterized by a large increase in the number of confirmed daily
cases, the best results are shown by the LSTM model trained on all regions of
both countries, showing an average Mean Absolute Percentage Error (MAPE) of
18%, 30%, 37% for Russia and 31%, 41%, 50% for US for predictions at forecast
horizons of 14, 28, and 42 days, respectively.
Related papers
- A Meta-Learning Approach to Predicting Performance and Data Requirements [163.4412093478316]
We propose an approach to estimate the number of samples required for a model to reach a target performance.
We find that the power law, the de facto principle to estimate model performance, leads to large error when using a small dataset.
We introduce a novel piecewise power law (PPL) that handles the two data differently.
arXiv Detail & Related papers (2023-03-02T21:48:22Z) - A spatiotemporal machine learning approach to forecasting COVID-19
incidence at the county level in the United States [2.9822184411723645]
We present COVID-LSTM, a data-driven model based on a Long Short-term memory architecture for forecasting COVID-19 incidence at the county-level in the US.
We use the weekly number of new cases as temporal input, and hand-engineered spatial features from Facebook to capture the spread of the disease in time and space.
Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble.
arXiv Detail & Related papers (2021-09-24T17:40:08Z) - Improving COVID-19 Forecasting using eXogenous Variables [7.245000255986182]
We study the pandemic course in the United States by considering national and state levels data.
We propose and compare multiple time-series prediction techniques which incorporate auxiliary variables.
arXiv Detail & Related papers (2021-07-20T03:26:18Z) - Modeling the geospatial evolution of COVID-19 using spatio-temporal
convolutional sequence-to-sequence neural networks [48.7576911714538]
Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000.
Despite its importance, accurate prediction of the geospatial evolution of COVID-19 remains a challenge.
arXiv Detail & Related papers (2021-05-06T15:24:00Z) - Deep learning-based COVID-19 pneumonia classification using chest CT
images: model generalizability [54.86482395312936]
Deep learning (DL) classification models were trained to identify COVID-19-positive patients on 3D computed tomography (CT) datasets from different countries.
We trained nine identical DL-based classification models by using combinations of the datasets with a 72% train, 8% validation, and 20% test data split.
The models trained on multiple datasets and evaluated on a test set from one of the datasets used for training performed better.
arXiv Detail & Related papers (2021-02-18T21:14:52Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - Examining Deep Learning Models with Multiple Data Sources for COVID-19
Forecasting [10.052302234274256]
We design and analysis of deep learning-based models for COVID-19 forecasting.
We consider multiple sources such as COVID-19 confirmed and death case count data and testing data for better predictions.
We propose clustering-based training for high-temporal forecasting.
arXiv Detail & Related papers (2020-10-27T17:52:02Z) - Adaptive County Level COVID-19 Forecast Models: Analysis and Improvement [1.8899300124593645]
We adapt the state and county level model, TDEFSI-LONLY, to national and county level COVID-19 data.
We show that this model poorly forecasts the current pandemic.
We propose an alternate forecast model, it County Level Epidemiological Inference Recurrent Network (alg) that trains an LSTM backbone on national confirmed cases to learn a low dimensional time pattern.
arXiv Detail & Related papers (2020-06-16T17:20:54Z) - A machine learning methodology for real-time forecasting of the
2019-2020 COVID-19 outbreak using Internet searches, news alerts, and
estimates from mechanistic models [53.900779250589814]
Our method is able to produce stable and accurate forecasts 2 days ahead of current time.
Our model's predictive power outperforms a collection of baseline models in 27 out of the 32 Chinese provinces.
arXiv Detail & Related papers (2020-04-08T14:39:32Z) - COVID-19 forecasting based on an improved interior search algorithm and
multi-layer feed forward neural network [0.0]
A new forecasting model is presented to analyze and forecast the confirmed cases of COVID-19 for the coming days.
The ISACL-MFNN model integrates an improved interior search algorithm (ISA) based on chaotic learning (CL) strategy into a multi-layer feed-forward neural network (MFNN)
The proposed model is investigated in the most affected countries (i.e., USA, Italy, and Spain)
arXiv Detail & Related papers (2020-04-06T12:08:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.