Epidemiologically and Socio-economically Optimal Policies via Bayesian
Optimization
- URL: http://arxiv.org/abs/2005.11257v2
- Date: Mon, 15 Jun 2020 03:44:27 GMT
- Title: Epidemiologically and Socio-economically Optimal Policies via Bayesian
Optimization
- Authors: Amit Chandak and Debojyoti Dey and Bhaskar Mukhoty and Purushottam Kar
- Abstract summary: Mass public quarantining, colloquially known as a lock-down, is a non-pharmaceutical intervention to check spread of disease.
This paper presents ESOP, a novel application of active machine learning techniques using Bayesian optimization, that interacts with an epidemiological model to arrive at lock-down schedules that optimally balance public health benefits and socio-economic downsides of reduced economic activity during lock-down periods.
- Score: 4.910276496877174
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Mass public quarantining, colloquially known as a lock-down, is a
non-pharmaceutical intervention to check spread of disease. This paper presents
ESOP (Epidemiologically and Socio-economically Optimal Policies), a novel
application of active machine learning techniques using Bayesian optimization,
that interacts with an epidemiological model to arrive at lock-down schedules
that optimally balance public health benefits and socio-economic downsides of
reduced economic activity during lock-down periods. The utility of ESOP is
demonstrated using case studies with VIPER
(Virus-Individual-Policy-EnviRonment), a stochastic agent-based simulator that
this paper also proposes. However, ESOP is flexible enough to interact with
arbitrary epidemiological simulators in a black-box manner, and produce
schedules that involve multiple phases of lock-downs.
Related papers
- SRAP-Agent: Simulating and Optimizing Scarce Resource Allocation Policy with LLM-based Agent [45.41401816514924]
We propose an innovative framework, SRAP-Agent, which integrates Large Language Models (LLMs) into economic simulations.
We conduct extensive policy simulation experiments to verify the feasibility and effectiveness of the SRAP-Agent.
arXiv Detail & Related papers (2024-10-18T03:43:42Z) - Generalization Bounds of Surrogate Policies for Combinatorial Optimization Problems [61.580419063416734]
A recent stream of structured learning approaches has improved the practical state of the art for a range of optimization problems.
The key idea is to exploit the statistical distribution over instances instead of dealing with instances separately.
In this article, we investigate methods that smooth the risk by perturbing the policy, which eases optimization and improves the generalization error.
arXiv Detail & Related papers (2024-07-24T12:00:30Z) - Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer [52.09480867526656]
We identify the source of misalignment as a form of distributional shift and uncertainty in learning human preferences.
To mitigate overoptimization, we first propose a theoretical algorithm that chooses the best policy for an adversarially chosen reward model.
Using the equivalence between reward models and the corresponding optimal policy, the algorithm features a simple objective that combines a preference optimization loss and a supervised learning loss.
arXiv Detail & Related papers (2024-05-26T05:38:50Z) - Model-based Causal Bayesian Optimization [74.78486244786083]
We introduce the first algorithm for Causal Bayesian Optimization with Multiplicative Weights (CBO-MW)
We derive regret bounds for CBO-MW that naturally depend on graph-related quantities.
Our experiments include a realistic demonstration of how CBO-MW can be used to learn users' demand patterns in a shared mobility system.
arXiv Detail & Related papers (2023-07-31T13:02:36Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Discrete Stochastic Optimization for Public Health Interventions with
Constraints [1.8275108630751844]
This paper addresses aspects of the 2009 H1N1 and the COVID-19 pandemics with the spread of disease modeled by the open source Monte Carlo simulations.
The objective of the optimization is to determine the best combination of intervention strategies so as to result in minimal economic loss to society.
arXiv Detail & Related papers (2022-06-27T21:21:25Z) - Building a Foundation for Data-Driven, Interpretable, and Robust Policy
Design using the AI Economist [67.08543240320756]
We show that the AI Economist framework enables effective, flexible, and interpretable policy design using two-level reinforcement learning and data-driven simulations.
We find that log-linear policies trained using RL significantly improve social welfare, based on both public health and economic outcomes, compared to past outcomes.
arXiv Detail & Related papers (2021-08-06T01:30:41Z) - Optimal Control Policies to Address the Pandemic Health-Economy Dilemma [2.5649050169091376]
Non-pharmaceutical interventions (NPIs) are effective measures to contain a pandemic.
Yet, such control measures commonly have a negative effect on the economy.
We propose a macro-level approach to support resolving this Health-Economy Dilemma (HED)
arXiv Detail & Related papers (2021-02-24T13:39:07Z) - Machine Learning-Powered Mitigation Policy Optimization in
Epidemiological Models [33.88734751290751]
We propose a new approach for obtaining optimal policy recommendations based on epidemiological models.
We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
arXiv Detail & Related papers (2020-10-16T16:27:17Z) - EpidemiOptim: A Toolbox for the Optimization of Control Policies in
Epidemiological Models [12.748861129923348]
EpidemiOptim is a Python toolbox that facilitates collaborations between researchers in epidemiology and optimization.
It turns epidemiological models and cost functions into optimization problems via a standard interface commonly used by optimization practitioners.
We illustrate the use of EpidemiOptim to find optimal policies for dynamical on-off lock-down control under the optimization of death toll and economic recess.
arXiv Detail & Related papers (2020-10-09T09:25:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.