Exploring the Pareto front of multi-objective COVID-19 mitigation
policies using reinforcement learning
- URL: http://arxiv.org/abs/2204.05027v1
- Date: Mon, 11 Apr 2022 11:55:06 GMT
- Title: Exploring the Pareto front of multi-objective COVID-19 mitigation
policies using reinforcement learning
- Authors: Mathieu Reymond, Conor F. Hayes, Lander Willem, Roxana R\u{a}dulescu,
Steven Abrams, Diederik M. Roijers, Enda Howley, Patrick Mannion, Niel Hens,
Ann Now\'e, Pieter Libin
- Abstract summary: Infectious disease outbreaks can have a disruptive impact on public health and societal processes.
Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate.
We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
- Score: 1.7056617973440933
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Infectious disease outbreaks can have a disruptive impact on public health
and societal processes. As decision making in the context of epidemic
mitigation is hard, reinforcement learning provides a methodology to
automatically learn prevention strategies in combination with complex epidemic
models. Current research focuses on optimizing policies w.r.t. a single
objective, such as the pathogen's attack rate. However, as the mitigation of
epidemics involves distinct, and possibly conflicting criteria (i.a.,
prevalence, mortality, morbidity, cost), a multi-objective approach is
warranted to learn balanced policies. To lift this decision-making process to
real-world epidemic models, we apply deep multi-objective reinforcement
learning and build upon a state-of-the-art algorithm, Pareto Conditioned
Networks (PCN), to learn a set of solutions that approximates the Pareto front
of the decision problem. We consider the first wave of the Belgian COVID-19
epidemic, which was mitigated by a lockdown, and study different deconfinement
strategies, aiming to minimize both COVID-19 cases (i.e., infections and
hospitalizations) and the societal burden that is induced by the applied
mitigation measures. We contribute a multi-objective Markov decision process
that encapsulates the stochastic compartment model that was used to inform
policy makers during the COVID-19 epidemic. As these social mitigation measures
are implemented in a continuous action space that modulates the contact matrix
of the age-structured epidemic model, we extend PCN to this setting. We
evaluate the solution returned by PCN, and observe that it correctly learns to
reduce the social burden whenever the hospitalization rates are sufficiently
low. In this work, we thus show that multi-objective reinforcement learning is
attainable in complex epidemiological models and provides essential insights to
balance complex mitigation policies.
Related papers
- Reinforcement Learning for Safe Occupancy Strategies in Educational
Spaces during an Epidemic [9.68145635795782]
This research focuses on reinforcement learning (RL) to develop strategies that balance minimizing infections with maximizing in-person interactions in educational settings.
We introduce SafeCampus, a novel tool that simulates infection spread and facilitates the exploration of various RL algorithms.
arXiv Detail & Related papers (2023-12-23T04:51:23Z) - Agent-Based Model: Simulating a Virus Expansion Based on the Acceptance
of Containment Measures [65.62256987706128]
Compartmental epidemiological models categorize individuals based on their disease status.
We propose an ABM architecture that combines an adapted SEIRD model with a decision-making model for citizens.
We illustrate the designed model by examining the progression of SARS-CoV-2 infections in A Coruna, Spain.
arXiv Detail & Related papers (2023-07-28T08:01:05Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Reinforcement Learning with Heterogeneous Data: Estimation and Inference [84.72174994749305]
We introduce the K-Heterogeneous Markov Decision Process (K-Hetero MDP) to address sequential decision problems with population heterogeneity.
We propose the Auto-Clustered Policy Evaluation (ACPE) for estimating the value of a given policy, and the Auto-Clustered Policy Iteration (ACPI) for estimating the optimal policy in a given policy class.
We present simulations to support our theoretical findings, and we conduct an empirical study on the standard MIMIC-III dataset.
arXiv Detail & Related papers (2022-01-31T20:58:47Z) - An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data.
We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z) - Semi-supervised Neural Networks solve an inverse problem for modeling
Covid-19 spread [61.9008166652035]
We study the spread of COVID-19 using a semi-supervised neural network.
We assume a passive part of the population remains isolated from the virus dynamics.
arXiv Detail & Related papers (2020-10-10T19:33:53Z) - Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control [19.022696762983017]
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
arXiv Detail & Related papers (2020-09-09T23:55:27Z) - Data-driven Simulation and Optimization for Covid-19 Exit Strategies [16.31545249131776]
The rapid spread of the Coronavirus SARS-2 is a major challenge that led almost all governments worldwide to take drastic measures to respond to the tragedy.
We have built a pandemic simulation and forecasting toolkit that combines a deep learning estimation of the epidemiological parameters of the disease.
arXiv Detail & Related papers (2020-06-12T11:18:25Z) - A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z) - Deep reinforcement learning for large-scale epidemic control [0.3694429692322631]
We investigate a deep reinforcement learning approach to automatically learn prevention strategies in the context of pandemic influenza.
Our model balances complexity and computational efficiency such that the use of reinforcement learning techniques becomes attainable.
This experiment shows that deep reinforcement learning can be used to learn mitigation policies in complex epidemiological models with a large state space.
arXiv Detail & Related papers (2020-03-30T17:57:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.