Optimal Policies for a Pandemic: A Stochastic Game Approach and a Deep
Learning Algorithm
- URL: http://arxiv.org/abs/2012.06745v2
- Date: Mon, 8 Mar 2021 20:15:55 GMT
- Title: Optimal Policies for a Pandemic: A Stochastic Game Approach and a Deep
Learning Algorithm
- Authors: Yao Xuan, Robert Balkin, Jiequn Han, Ruimeng Hu, Hector D. Ceniceros
- Abstract summary: Game theory has been an effective tool in the control of disease spread and in suggesting optimal policies at both individual and area levels.
We propose a multi-region SEIR model based on differential game theory, aiming to formulate optimal regional policies for infectious diseases.
We apply the proposed model and algorithm to study the COVID-19 pandemic in three states: New York, New Jersey, and Pennsylvania.
- Score: 1.124958340749622
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Game theory has been an effective tool in the control of disease spread and
in suggesting optimal policies at both individual and area levels. In this
paper, we propose a multi-region SEIR model based on stochastic differential
game theory, aiming to formulate optimal regional policies for infectious
diseases. Specifically, we enhance the standard epidemic SEIR model by taking
into account the social and health policies issued by multiple region planners.
This enhancement makes the model more realistic and powerful. However, it also
introduces a formidable computational challenge due to the high dimensionality
of the solution space brought by the presence of multiple regions. This
significant numerical difficulty of the model structure motivates us to
generalize the deep fictitious algorithm introduced in [Han and Hu, MSML2020,
pp.221--245, PMLR, 2020] and develop an improved algorithm to overcome the
curse of dimensionality. We apply the proposed model and algorithm to study the
COVID-19 pandemic in three states: New York, New Jersey, and Pennsylvania. The
model parameters are estimated from real data posted by the Centers for Disease
Control and Prevention (CDC). We are able to show the effects of the
lockdown/travel ban policy on the spread of COVID-19 for each state and how
their policies affect each other.
Related papers
- Deep Dynamic Epidemiological Modelling for COVID-19 Forecasting in
Multi-level Districts [23.725821734863487]
Epidemiological equations based on the SEIR model simulate disease development.
Traditional parameter estimation method to solve SEIR equations could not precisely fit real-world data.
Deep dynamic epidemiological (DDE) method combines epidemiological equations and deep-learning advantages.
arXiv Detail & Related papers (2023-06-21T06:30:02Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Multi-Objective Policy Gradients with Topological Constraints [108.10241442630289]
We present a new algorithm for a policy gradient in TMDPs by a simple extension of the proximal policy optimization (PPO) algorithm.
We demonstrate this on a real-world multiple-objective navigation problem with an arbitrary ordering of objectives both in simulation and on a real robot.
arXiv Detail & Related papers (2022-09-15T07:22:58Z) - Compartmental Models for COVID-19 and Control via Policy Interventions [0.0]
We demonstrate an approach to replicate and forecast the spread of the SARS-CoV-2 pandemic using the toolkit of probabilistic programming languages (PPLs)
Our goal is to study the impact of various modeling assumptions and motivate policy interventions enacted to limit the spread of infectious diseases.
We are not epidemiologists; the sole aim of this study is to serve as an exposition of methods, not to directly infer the real-world impact of policy-making for COVID-19.
arXiv Detail & Related papers (2022-03-06T02:50:54Z) - Modeling the geospatial evolution of COVID-19 using spatio-temporal
convolutional sequence-to-sequence neural networks [48.7576911714538]
Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000.
Despite its importance, accurate prediction of the geospatial evolution of COVID-19 remains a challenge.
arXiv Detail & Related papers (2021-05-06T15:24:00Z) - An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data.
We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z) - Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control [19.022696762983017]
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
arXiv Detail & Related papers (2020-09-09T23:55:27Z) - When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and
Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures.
Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential.
This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z) - Decentralized MCTS via Learned Teammate Models [89.24858306636816]
We present a trainable online decentralized planning algorithm based on decentralized Monte Carlo Tree Search.
We show that deep learning and convolutional neural networks can be employed to produce accurate policy approximators.
arXiv Detail & Related papers (2020-03-19T13:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.