An Interpretable Hybrid Predictive Model of COVID-19 Cases using
Autoregressive Model and LSTM
- URL: http://arxiv.org/abs/2211.17014v3
- Date: Sun, 30 Apr 2023 02:31:10 GMT
- Title: An Interpretable Hybrid Predictive Model of COVID-19 Cases using
Autoregressive Model and LSTM
- Authors: Yangyi Zhang, Sui Tang, and Guo Yu
- Abstract summary: We propose a novel hybrid model in which the interpretability of the Autoregressive model (AR) and the predictive power of the long short-term memory neural networks (LSTM) join forces.
Our study provides a new and promising direction for building effective and interpretable data-driven models.
- Score: 3.7555792840171787
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The Coronavirus Disease 2019 (COVID-19) has a profound impact on global
health and economy, making it crucial to build accurate and interpretable
data-driven predictive models for COVID-19 cases to improve policy making. The
extremely large scale of the pandemic and the intrinsically changing
transmission characteristics pose great challenges for effective COVID-19 case
prediction. To address this challenge, we propose a novel hybrid model in which
the interpretability of the Autoregressive model (AR) and the predictive power
of the long short-term memory neural networks (LSTM) join forces. The proposed
hybrid model is formalized as a neural network with an architecture that
connects two composing model blocks, of which the relative contribution is
decided data-adaptively in the training procedure. We demonstrate the favorable
performance of the hybrid model over its two component models as well as other
popular predictive models through comprehensive numerical studies on two data
sources under multiple evaluation metrics. Specifically, in county-level data
of 8 California counties, our hybrid model achieves 4.173% MAPE on average,
outperforming the composing AR (5.629%) and LSTM (4.934%). In country-level
datasets, our hybrid model outperforms the widely-used predictive models - AR,
LSTM, SVM, Gradient Boosting, and Random Forest - in predicting COVID-19 cases
in 8 countries around the world. In addition, we illustrate the
interpretability of our proposed hybrid model, a key feature not shared by most
black-box predictive models for COVID-19 cases. Our study provides a new and
promising direction for building effective and interpretable data-driven
models, which could have significant implications for public health policy
making and control of the current and potential future pandemics.
Related papers
- MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Temporal Deep Learning Architecture for Prediction of COVID-19 Cases in
India [1.7969777786551424]
Recently the new machine learning approaches are being used to understand the dynamic trend of COVID-19 spread.
We designed the recurrent and convolutional neural network models: vanilla LSTM, stacked LSTM, ED-LSTM, Bi-LSTM, CNN, and hybrid CNN+LSTM model.
The results show that the stacked LSTM and hybrid CNN+LSTM models perform best relative to other models.
arXiv Detail & Related papers (2021-08-31T13:28:51Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - A Twin Neural Model for Uplift [59.38563723706796]
Uplift is a particular case of conditional treatment effect modeling.
We propose a new loss function defined by leveraging a connection with the Bayesian interpretation of the relative risk.
We show our proposed method is competitive with the state-of-the-art in simulation setting and on real data from large scale randomized experiments.
arXiv Detail & Related papers (2021-05-11T16:02:39Z) - End-2-End COVID-19 Detection from Breath & Cough Audio [68.41471917650571]
We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples.
We introduce a novel modelling strategy using a custom deep neural network to diagnose COVID-19 from a joint breath and cough representation.
arXiv Detail & Related papers (2021-01-07T01:13:00Z) - A Spatial-Temporal Graph Based Hybrid Infectious Disease Model with
Application to COVID-19 [3.785123406103385]
As the COVID-19 pandemic evolves, reliable prediction plays an important role for policy making.
The data-driven machine learning models such as RNN can suffer in case of limited time series data such as COVID-19.
We combine SEIR and RNN on a graph structure to develop a hybrid-temporal model to achieve both accuracy and efficiency in training and forecasting.
arXiv Detail & Related papers (2020-10-18T19:34:54Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z) - Two-step penalised logistic regression for multi-omic data with an
application to cardiometabolic syndrome [62.997667081978825]
We implement a two-step approach to multi-omic logistic regression in which variable selection is performed on each layer separately.
Our approach should be preferred if the goal is to select as many relevant predictors as possible.
Our proposed approach allows us to identify features that characterise cardiometabolic syndrome at the molecular level.
arXiv Detail & Related papers (2020-08-01T10:36:27Z) - DeepCOVIDNet: An Interpretable Deep Learning Model for Predictive
Surveillance of COVID-19 Using Heterogeneous Features and their Interactions [2.30238915794052]
We propose a deep learning model to forecast the range of increase in COVID-19 infected cases in future days.
Using data collected from various sources, we estimate the range of increase in infected cases seven days into the future for all U.S. counties.
arXiv Detail & Related papers (2020-07-31T23:37:38Z) - Adaptive County Level COVID-19 Forecast Models: Analysis and Improvement [1.8899300124593645]
We adapt the state and county level model, TDEFSI-LONLY, to national and county level COVID-19 data.
We show that this model poorly forecasts the current pandemic.
We propose an alternate forecast model, it County Level Epidemiological Inference Recurrent Network (alg) that trains an LSTM backbone on national confirmed cases to learn a low dimensional time pattern.
arXiv Detail & Related papers (2020-06-16T17:20:54Z) - Forecasting the Spread of Covid-19 Under Control Scenarios Using LSTM
and Dynamic Behavioral Models [2.11622808613962]
This study proposes a novel hybrid model which combines a Long short-term memory (LSTM) artificial recurrent neural network with dynamic behavioral models.
The proposed model considers the effect of multiple factors to enhance the accuracy in predicting the number of cases and deaths across the top ten most-affected countries and Australia.
arXiv Detail & Related papers (2020-05-24T10:43:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.