Related papers: Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control

Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control

URL: http://arxiv.org/abs/2009.04607v3
Date: Sat, 26 Feb 2022 21:00:20 GMT
Title: Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control
Authors: Runzhe Wan, Xinyu Zhang, Rui Song
Abstract summary: Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
Score: 19.022696762983017
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. In the face of an emerging infectious disease, a crucial question for policymakers is how to make the trade-off and implement the appropriate interventions timely given the huge uncertainty. In this work, we propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost. Specifically, at each decision point, a Bayesian epidemiological model is first learned as the environment model, and then the proposed model-based multi-objective planning algorithm is applied to find a set of Pareto-optimal policies. This framework, combined with the prediction bands for each policy, provides a real-time decision support tool for policymakers. The application is demonstrated with the spread of COVID-19 in China.

Related papers

Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences. Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous). We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z)
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework. $m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility. We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z)
Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning [1.7056617973440933]
Infectious disease outbreaks can have a disruptive impact on public health and societal processes. Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate. We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
arXiv Detail & Related papers (2022-04-11T11:55:06Z)
Compartmental Models for COVID-19 and Control via Policy Interventions [0.0]
We demonstrate an approach to replicate and forecast the spread of the SARS-CoV-2 pandemic using the toolkit of probabilistic programming languages (PPLs) Our goal is to study the impact of various modeling assumptions and motivate policy interventions enacted to limit the spread of infectious diseases. We are not epidemiologists; the sole aim of this study is to serve as an exposition of methods, not to directly infer the real-world impact of policy-making for COVID-19.
arXiv Detail & Related papers (2022-03-06T02:50:54Z)
Reinforcement Learning with Heterogeneous Data: Estimation and Inference [84.72174994749305]
We introduce the K-Heterogeneous Markov Decision Process (K-Hetero MDP) to address sequential decision problems with population heterogeneity. We propose the Auto-Clustered Policy Evaluation (ACPE) for estimating the value of a given policy, and the Auto-Clustered Policy Iteration (ACPI) for estimating the optimal policy in a given policy class. We present simulations to support our theoretical findings, and we conduct an empirical study on the standard MIMIC-III dataset.
arXiv Detail & Related papers (2022-01-31T20:58:47Z)
Evaluating model-based planning and planner amortization for continuous control [79.49319308600228]
We take a hybrid approach, combining model predictive control (MPC) with a learned model and model-free policy learning. We find that well-tuned model-free agents are strong baselines even for high DoF control problems. We show that it is possible to distil a model-based planner into a policy that amortizes the planning without any loss of performance.
arXiv Detail & Related papers (2021-10-07T12:00:40Z)
Adversarial Sample Enhanced Domain Adaptation: A Case Study on Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation. adversarially generated samples are used during domain adaptation. Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z)
Optimal Policies for a Pandemic: A Stochastic Game Approach and a Deep Learning Algorithm [1.124958340749622]
Game theory has been an effective tool in the control of disease spread and in suggesting optimal policies at both individual and area levels. We propose a multi-region SEIR model based on differential game theory, aiming to formulate optimal regional policies for infectious diseases. We apply the proposed model and algorithm to study the COVID-19 pandemic in three states: New York, New Jersey, and Pennsylvania.
arXiv Detail & Related papers (2020-12-12T07:10:46Z)
An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data. We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z)
Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models [33.88734751290751]
We propose a new approach for obtaining optimal policy recommendations based on epidemiological models. We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
arXiv Detail & Related papers (2020-10-16T16:27:17Z)
A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population. The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system. We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z)
When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures. Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential. This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.