Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration
- URL: http://arxiv.org/abs/2301.12822v1
- Date: Mon, 30 Jan 2023 12:22:30 GMT
- Title: Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration
- Authors: Alexandra Cimpean, Timothy Verstraeten, Lander Willem, Niel Hens, Ann
Now\'e, Pieter Libin
- Abstract summary: We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
- Score: 53.122045119395594
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Individual-based epidemiological models support the study of fine-grained
preventive measures, such as tailored vaccine allocation policies, in silico.
As individual-based models are computationally intensive, it is pivotal to
identify optimal strategies within a reasonable computational budget. Moreover,
due to the high societal impact associated with the implementation of
preventive strategies, uncertainty regarding decisions should be communicated
to policy makers, which is naturally embedded in a Bayesian approach.
We present a novel technique for evaluating vaccine allocation strategies
using a multi-armed bandit framework in combination with a Bayesian anytime
$m$-top exploration algorithm. $m$-top exploration allows the algorithm to
learn $m$ policies for which it expects the highest utility, enabling experts
to inspect this small set of alternative strategies, along with their
quantified uncertainty. The anytime component provides policy advisors with
flexibility regarding the computation time and the desired confidence, which is
important as it is difficult to make this trade-off beforehand.
We consider the Belgian COVID-19 epidemic using the individual-based model
STRIDE, where we learn a set of vaccination policies that minimize the number
of infections and hospitalisations. Through experiments we show that our method
can efficiently identify the $m$-top policies, which is validated in a scenario
where the ground truth is available. Finally, we explore how vaccination
policies can best be organised under different contact reduction schemes.
Through these experiments, we show that the top policies follow a clear trend
regarding the prioritised age groups and assigned vaccine type, which provides
insights for future vaccination campaigns.
Related papers
- Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Improved Policy Evaluation for Randomized Trials of Algorithmic Resource
Allocation [54.72195809248172]
We present a new estimator leveraging our proposed novel concept, that involves retrospective reshuffling of participants across experimental arms at the end of an RCT.
We prove theoretically that such an estimator is more accurate than common estimators based on sample means.
arXiv Detail & Related papers (2023-02-06T05:17:22Z) - Evaluating vaccine allocation strategies using simulation-assisted
causal modelling [7.9656669215132005]
Early on during a pandemic, vaccine availability is limited, requiring prioritisation of different population groups.
We develop a model to retrospectively evaluate age-dependent counterfactual vaccine allocation strategies against the COVID-19 pandemic.
We compare Israel's implemented vaccine allocation strategy in 2021 to counterfactual strategies such as no prioritisation, prioritisation of younger age groups or a strict risk-ranked approach.
arXiv Detail & Related papers (2022-12-14T14:24:17Z) - Nearly Optimal Latent State Decoding in Block MDPs [74.51224067640717]
In episodic Block MDPs, the decision maker has access to rich observations or contexts generated from a small number of latent states.
We are first interested in estimating the latent state decoding function based on data generated under a fixed behavior policy.
We then study the problem of learning near-optimal policies in the reward-free framework.
arXiv Detail & Related papers (2022-08-17T18:49:53Z) - Identification of Subgroups With Similar Benefits in Off-Policy Policy
Evaluation [60.71312668265873]
We develop a method to balance the need for personalization with confident predictions.
We show that our method can be used to form accurate predictions of heterogeneous treatment effects.
arXiv Detail & Related papers (2021-11-28T23:19:12Z) - Minimax Off-Policy Evaluation for Multi-Armed Bandits [58.7013651350436]
We study the problem of off-policy evaluation in the multi-armed bandit model with bounded rewards.
We develop minimax rate-optimal procedures under three settings.
arXiv Detail & Related papers (2021-01-19T18:55:29Z) - Stochastic Optimization for Vaccine and Testing Kit Allocation for the
COVID-19 Pandemic [0.0]
SARS-CoV-2 virus has exposed many flaws in the decision-making strategies used to distribute resources to combat global health crises.
In this paper, we leverage reinforcement learning and optimization to improve upon the allocation strategies for various resources.
arXiv Detail & Related papers (2021-01-04T19:08:32Z) - Machine Learning-Powered Mitigation Policy Optimization in
Epidemiological Models [33.88734751290751]
We propose a new approach for obtaining optimal policy recommendations based on epidemiological models.
We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
arXiv Detail & Related papers (2020-10-16T16:27:17Z) - Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control [19.022696762983017]
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
arXiv Detail & Related papers (2020-09-09T23:55:27Z) - A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.