Reinforcement Learning for Optimization of COVID-19 Mitigation policies
- URL: http://arxiv.org/abs/2010.10560v1
- Date: Tue, 20 Oct 2020 18:40:15 GMT
- Title: Reinforcement Learning for Optimization of COVID-19 Mitigation policies
- Authors: Varun Kompella, Roberto Capobianco, Stacy Jong, Jonathan Browne,
Spencer Fox, Lauren Meyers, Peter Wurman, Peter Stone
- Abstract summary: The year 2020 has seen the COVID-19 virus lead to one of the worst global pandemics in history.
Governments around the world are faced with the challenge of protecting public health, while keeping the economy running to the greatest extent possible.
Epidemiological models provide insight into the spread of these types of diseases and predict the effects of possible intervention policies.
- Score: 29.4529156655747
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The year 2020 has seen the COVID-19 virus lead to one of the worst global
pandemics in history. As a result, governments around the world are faced with
the challenge of protecting public health, while keeping the economy running to
the greatest extent possible. Epidemiological models provide insight into the
spread of these types of diseases and predict the effects of possible
intervention policies. However, to date,the even the most data-driven
intervention policies rely on heuristics. In this paper, we study how
reinforcement learning (RL) can be used to optimize mitigation policies that
minimize the economic impact without overwhelming the hospital capacity. Our
main contributions are (1) a novel agent-based pandemic simulator which, unlike
traditional models, is able to model fine-grained interactions among people at
specific locations in a community; and (2) an RL-based methodology for
optimizing fine-grained mitigation policies within this simulator. Our results
validate both the overall simulator behavior and the learned policies under
realistic conditions.
Related papers
- Reduced-Rank Multi-objective Policy Learning and Optimization [57.978477569678844]
In practice, causal researchers do not have a single outcome in mind a priori.
In government-assisted social benefit programs, policymakers collect many outcomes to understand the multidimensional nature of poverty.
We present a data-driven dimensionality-reduction methodology for multiple outcomes in the context of optimal policy learning.
arXiv Detail & Related papers (2024-04-29T08:16:30Z) - Game and Reference: Policy Combination Synthesis for Epidemic Prevention and Control [4.635793210136456]
We present a novel Policy Combination Synthesis (PCS) model for epidemic policy-making.
To prevent extreme decisions, we introduce adversarial learning between the model-made policies and the real policies.
We also employ contrastive learning to let the model draw on experience from the best historical policies under similar scenarios.
arXiv Detail & Related papers (2024-03-16T00:26:59Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Discrete Stochastic Optimization for Public Health Interventions with
Constraints [1.8275108630751844]
This paper addresses aspects of the 2009 H1N1 and the COVID-19 pandemics with the spread of disease modeled by the open source Monte Carlo simulations.
The objective of the optimization is to determine the best combination of intervention strategies so as to result in minimal economic loss to society.
arXiv Detail & Related papers (2022-06-27T21:21:25Z) - Building a Foundation for Data-Driven, Interpretable, and Robust Policy
Design using the AI Economist [67.08543240320756]
We show that the AI Economist framework enables effective, flexible, and interpretable policy design using two-level reinforcement learning and data-driven simulations.
We find that log-linear policies trained using RL significantly improve social welfare, based on both public health and economic outcomes, compared to past outcomes.
arXiv Detail & Related papers (2021-08-06T01:30:41Z) - Impact of Interventional Policies Including Vaccine on Covid-19
Propagation and Socio-Economic Factors [0.7874708385247353]
This study aims to provide a predictive analytics framework to model, predict and simulate COVID-19 propagation and socio-economic impact.
We have leveraged a recently launched open-source COVID-19 big data platform and used published research to find potentially relevant variables.
An advanced machine learning pipeline has been developed armed with a self-evolving model, deployed on a modern machine learning architecture.
arXiv Detail & Related papers (2021-01-11T15:08:07Z) - Data-driven Simulation and Optimization for Covid-19 Exit Strategies [16.31545249131776]
The rapid spread of the Coronavirus SARS-2 is a major challenge that led almost all governments worldwide to take drastic measures to respond to the tragedy.
We have built a pandemic simulation and forecasting toolkit that combines a deep learning estimation of the epidemiological parameters of the disease.
arXiv Detail & Related papers (2020-06-12T11:18:25Z) - COVI White Paper [67.04578448931741]
Contact tracing is an essential tool to change the course of the Covid-19 pandemic.
We present an overview of the rationale, design, ethical considerations and privacy strategy of COVI,' a Covid-19 public peer-to-peer contact tracing and risk awareness mobile application developed in Canada.
arXiv Detail & Related papers (2020-05-18T07:40:49Z) - A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z) - When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and
Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures.
Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential.
This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.