Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control
- URL: http://arxiv.org/abs/2009.04607v3
- Date: Sat, 26 Feb 2022 21:00:20 GMT
- Title: Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control
- Authors: Runzhe Wan, Xinyu Zhang, Rui Song
- Abstract summary: Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
- Score: 19.022696762983017
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Severe infectious diseases such as the novel coronavirus (COVID-19) pose a
huge threat to public health. Stringent control measures, such as school
closures and stay-at-home orders, while having significant effects, also bring
huge economic losses. In the face of an emerging infectious disease, a crucial
question for policymakers is how to make the trade-off and implement the
appropriate interventions timely given the huge uncertainty. In this work, we
propose a Multi-Objective Model-based Reinforcement Learning framework to
facilitate data-driven decision-making and minimize the overall long-term cost.
Specifically, at each decision point, a Bayesian epidemiological model is first
learned as the environment model, and then the proposed model-based
multi-objective planning algorithm is applied to find a set of Pareto-optimal
policies. This framework, combined with the prediction bands for each policy,
provides a real-time decision support tool for policymakers. The application is
demonstrated with the spread of COVID-19 in China.
Related papers
- Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Exploring the Pareto front of multi-objective COVID-19 mitigation
policies using reinforcement learning [1.7056617973440933]
Infectious disease outbreaks can have a disruptive impact on public health and societal processes.
Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate.
We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
arXiv Detail & Related papers (2022-04-11T11:55:06Z) - Compartmental Models for COVID-19 and Control via Policy Interventions [0.0]
We demonstrate an approach to replicate and forecast the spread of the SARS-CoV-2 pandemic using the toolkit of probabilistic programming languages (PPLs)
Our goal is to study the impact of various modeling assumptions and motivate policy interventions enacted to limit the spread of infectious diseases.
We are not epidemiologists; the sole aim of this study is to serve as an exposition of methods, not to directly infer the real-world impact of policy-making for COVID-19.
arXiv Detail & Related papers (2022-03-06T02:50:54Z) - Reinforcement Learning with Heterogeneous Data: Estimation and Inference [84.72174994749305]
We introduce the K-Heterogeneous Markov Decision Process (K-Hetero MDP) to address sequential decision problems with population heterogeneity.
We propose the Auto-Clustered Policy Evaluation (ACPE) for estimating the value of a given policy, and the Auto-Clustered Policy Iteration (ACPI) for estimating the optimal policy in a given policy class.
We present simulations to support our theoretical findings, and we conduct an empirical study on the standard MIMIC-III dataset.
arXiv Detail & Related papers (2022-01-31T20:58:47Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Optimal Policies for a Pandemic: A Stochastic Game Approach and a Deep
Learning Algorithm [1.124958340749622]
Game theory has been an effective tool in the control of disease spread and in suggesting optimal policies at both individual and area levels.
We propose a multi-region SEIR model based on differential game theory, aiming to formulate optimal regional policies for infectious diseases.
We apply the proposed model and algorithm to study the COVID-19 pandemic in three states: New York, New Jersey, and Pennsylvania.
arXiv Detail & Related papers (2020-12-12T07:10:46Z) - An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data.
We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z) - Machine Learning-Powered Mitigation Policy Optimization in
Epidemiological Models [33.88734751290751]
We propose a new approach for obtaining optimal policy recommendations based on epidemiological models.
We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
arXiv Detail & Related papers (2020-10-16T16:27:17Z) - A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z) - When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and
Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures.
Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential.
This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.