Deep COVID-19 Forecasting for Multiple States with Data Augmentation
- URL: http://arxiv.org/abs/2302.01155v1
- Date: Thu, 2 Feb 2023 15:16:13 GMT
- Title: Deep COVID-19 Forecasting for Multiple States with Data Augmentation
- Authors: Chung Yan Fong and Dit-Yan Yeung
- Abstract summary: We propose a deep learning approach to forecasting state-level COVID-19 trends of weekly cumulative death in the United States (US) and incident cases in Germany.
This approach includes a transformer model, an ensemble method, and a data augmentation technique for time series.
Our model has achieved some of the best state-level results on the COVID-19 Forecast Hub for the US and for Germany.
- Score: 10.197800697048903
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we propose a deep learning approach to forecasting state-level
COVID-19 trends of weekly cumulative death in the United States (US) and
incident cases in Germany. This approach includes a transformer model, an
ensemble method, and a data augmentation technique for time series. We arrange
the inputs of the transformer in such a way that predictions for different
states can attend to the trends of the others. To overcome the issue of
scarcity of training data for this COVID-19 pandemic, we have developed a novel
data augmentation technique to generate useful data for training. More
importantly, the generated data can also be used for model validation. As such,
it has a two-fold advantage: 1) more actual observations can be used for
training, and 2) the model can be validated on data which has distribution
closer to the expected situation. Our model has achieved some of the best
state-level results on the COVID-19 Forecast Hub for the US and for Germany.
Related papers
- Learning Augmentation Policies from A Model Zoo for Time Series Forecasting [58.66211334969299]
We introduce AutoTSAug, a learnable data augmentation method based on reinforcement learning.
By augmenting the marginal samples with a learnable policy, AutoTSAug substantially improves forecasting performance.
arXiv Detail & Related papers (2024-09-10T07:34:19Z) - Ask Your Distribution Shift if Pre-Training is Right for You [74.18516460467019]
In practice, fine-tuning a pre-trained model improves robustness significantly in some cases but not at all in others.
We focus on two possible failure modes of models under distribution shift: poor extrapolation and biases in the training data.
Our study suggests that, as a rule of thumb, pre-training can help mitigate poor extrapolation but not dataset biases.
arXiv Detail & Related papers (2024-02-29T23:46:28Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - Strict baselines for Covid-19 forecasting and ML perspective for USA and
Russia [105.54048699217668]
Covid-19 allows researchers to gather datasets accumulated over 2 years and to use them in predictive analysis.
We present the results of a consistent comparative study of different types of methods for predicting the dynamics of the spread of Covid-19 based on regional data for two countries: the United States and Russia.
arXiv Detail & Related papers (2022-07-15T18:21:36Z) - How to Learn when Data Gradually Reacts to Your Model [10.074466859579571]
We propose a new algorithm, Stateful Performative Gradient Descent (Stateful PerfGD), for minimizing the performative loss even in the presence of these effects.
Our experiments confirm that Stateful PerfGD substantially outperforms previous state-of-the-art methods.
arXiv Detail & Related papers (2021-12-13T22:05:26Z) - Improving COVID-19 Forecasting using eXogenous Variables [7.245000255986182]
We study the pandemic course in the United States by considering national and state levels data.
We propose and compare multiple time-series prediction techniques which incorporate auxiliary variables.
arXiv Detail & Related papers (2021-07-20T03:26:18Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Contrastive Model Inversion for Data-Free Knowledge Distillation [60.08025054715192]
We propose Contrastive Model Inversion, where the data diversity is explicitly modeled as an optimizable objective.
Our main observation is that, under the constraint of the same amount of data, higher data diversity usually indicates stronger instance discrimination.
Experiments on CIFAR-10, CIFAR-100, and Tiny-ImageNet demonstrate that CMI achieves significantly superior performance when the generated data are used for knowledge distillation.
arXiv Detail & Related papers (2021-05-18T15:13:00Z) - Examining Deep Learning Models with Multiple Data Sources for COVID-19
Forecasting [10.052302234274256]
We design and analysis of deep learning-based models for COVID-19 forecasting.
We consider multiple sources such as COVID-19 confirmed and death case count data and testing data for better predictions.
We propose clustering-based training for high-temporal forecasting.
arXiv Detail & Related papers (2020-10-27T17:52:02Z) - Adaptive County Level COVID-19 Forecast Models: Analysis and Improvement [1.8899300124593645]
We adapt the state and county level model, TDEFSI-LONLY, to national and county level COVID-19 data.
We show that this model poorly forecasts the current pandemic.
We propose an alternate forecast model, it County Level Epidemiological Inference Recurrent Network (alg) that trains an LSTM backbone on national confirmed cases to learn a low dimensional time pattern.
arXiv Detail & Related papers (2020-06-16T17:20:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.