A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions
- URL: http://arxiv.org/abs/2005.07656v1
- Date: Fri, 15 May 2020 17:17:45 GMT
- Title: A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions
- Authors: Luis Miralles-Pechu\'an, Fernando Jim\'enez, Hiram Ponce, Lourdes
Mart\'inez-Villase\~nor
- Abstract summary: We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
- Score: 63.669642197519934
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Whenever countries are threatened by a pandemic, as is the case with the
COVID-19 virus, governments should take the right actions to safeguard public
health as well as to mitigate the negative effects on the economy. In this
regard, there are two completely different approaches governments can take: a
restrictive one, in which drastic measures such as self-isolation can seriously
damage the economy, and a more liberal one, where more relaxed restrictions may
put at risk a high percentage of the population. The optimal approach could be
somewhere in between, and, in order to make the right decisions, it is
necessary to accurately estimate the future effects of taking one or other
measures. In this paper, we use the SEIR epidemiological model (Susceptible -
Exposed - Infected - Recovered) for infectious diseases to represent the
evolution of the virus COVID-19 over time in the population. To optimize the
best sequences of actions governments can take, we propose a methodology with
two approaches, one based on Deep Q-Learning and another one based on Genetic
Algorithms. The sequences of actions (confinement, self-isolation, two-meter
distance or not taking restrictions) are evaluated according to a reward system
focused on meeting two objectives: firstly, getting few people infected so that
hospitals are not overwhelmed with critical patients, and secondly, avoiding
taking drastic measures for too long which can potentially cause serious damage
to the economy. The conducted experiments prove that our methodology is a valid
tool to discover actions governments can take to reduce the negative effects of
a pandemic in both senses. We also prove that the approach based on Deep
Q-Learning overcomes the one based on Genetic Algorithms for optimizing the
sequences of actions.
Related papers
- Modeling and Optimization of Epidemiological Control Policies Through
Reinforcement Learning [0.0]
The impact of a pandemic can be minimized by enforcing certain restrictions on a community.
While minimizing infection and death rates, these restrictions can also lead to economic crises.
We train a reinforcement learning agent to enforce the optimal restriction on a pandemic simulation based on a reward function.
arXiv Detail & Related papers (2024-01-25T22:39:39Z) - Agent-Based Model: Simulating a Virus Expansion Based on the Acceptance
of Containment Measures [65.62256987706128]
Compartmental epidemiological models categorize individuals based on their disease status.
We propose an ABM architecture that combines an adapted SEIRD model with a decision-making model for citizens.
We illustrate the designed model by examining the progression of SARS-CoV-2 infections in A Coruna, Spain.
arXiv Detail & Related papers (2023-07-28T08:01:05Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Discrete Stochastic Optimization for Public Health Interventions with
Constraints [1.8275108630751844]
This paper addresses aspects of the 2009 H1N1 and the COVID-19 pandemics with the spread of disease modeled by the open source Monte Carlo simulations.
The objective of the optimization is to determine the best combination of intervention strategies so as to result in minimal economic loss to society.
arXiv Detail & Related papers (2022-06-27T21:21:25Z) - Incorporating Expert Guidance in Epidemic Forecasting [79.91855362871496]
We propose a new approach leveraging the Seldonian optimization framework from AI safety.
We study two types of guidance: smoothness and regional consistency of errors.
We show that by its successful incorporation, we are able to not only bound the probability of undesirable behavior to happen, but also to reduce RMSE on test data by up to 17%.
arXiv Detail & Related papers (2020-12-24T06:21:53Z) - Epidemic mitigation by statistical inference from contact tracing data [61.04165571425021]
We develop Bayesian inference methods to estimate the risk that an individual is infected.
We propose to use probabilistic risk estimation in order to optimize testing and quarantining strategies for the control of an epidemic.
Our approaches translate into fully distributed algorithms that only require communication between individuals who have recently been in contact.
arXiv Detail & Related papers (2020-09-20T12:24:45Z) - Data-driven Simulation and Optimization for Covid-19 Exit Strategies [16.31545249131776]
The rapid spread of the Coronavirus SARS-2 is a major challenge that led almost all governments worldwide to take drastic measures to respond to the tragedy.
We have built a pandemic simulation and forecasting toolkit that combines a deep learning estimation of the epidemiological parameters of the disease.
arXiv Detail & Related papers (2020-06-12T11:18:25Z) - From Prediction to Prescription: Evolutionary Optimization of
Non-Pharmaceutical Interventions in the COVID-19 Pandemic [19.477459618274025]
Several models have been developed to predict how the COVID-19 pandemic spreads.
This paper demonstrates how evolutionary AI could be used to facilitate the next step, i.e. determining most effective intervention strategies.
arXiv Detail & Related papers (2020-05-28T03:43:31Z) - COVI White Paper [67.04578448931741]
Contact tracing is an essential tool to change the course of the Covid-19 pandemic.
We present an overview of the rationale, design, ethical considerations and privacy strategy of COVI,' a Covid-19 public peer-to-peer contact tracing and risk awareness mobile application developed in Canada.
arXiv Detail & Related papers (2020-05-18T07:40:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.