Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control
- URL: http://arxiv.org/abs/2502.09831v1
- Date: Fri, 14 Feb 2025 00:08:06 GMT
- Title: Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control
- Authors: Zhuangzhuang Jia, Hyuk Park, Gökçe Dayanıklı, Grani A. Hanasusanto,
- Abstract summary: Infectious diseases pose major public health challenges to society.
We propose a framework for sequential decision-making under uncertainty to design fairness-aware disease mitigation policies.
- Score: 0.4583163610461423
- License:
- Abstract: Infectious diseases pose major public health challenges to society, highlighting the importance of designing effective policies to reduce economic loss and mortality. In this paper, we propose a framework for sequential decision-making under uncertainty to design fairness-aware disease mitigation policies that incorporate various measures of unfairness. Specifically, our approach learns equitable vaccination and lockdown strategies based on a stochastic multi-group SIR model. To address the challenges of solving the resulting sequential decision-making problem, we adopt the path integral control algorithm as an efficient solution scheme. Through a case study, we demonstrate that our approach effectively improves fairness compared to conventional methods and provides valuable insights for policymakers.
Related papers
- Including frameworks of public health ethics in computational modelling of infectious disease interventions [36.437757915645385]
Many values recognised as important for ethical decision-making are missing from computational models.
We demonstrate a proof-of-concept approach to incorporate multiple public health values into the evaluation of a simple computational model for vaccination against a pathogen such as SARS-CoV-2.
arXiv Detail & Related papers (2025-01-31T04:22:25Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Stochastic Methods for AUC Optimization subject to AUC-based Fairness
Constraints [51.12047280149546]
A direct approach for obtaining a fair predictive model is to train the model through optimizing its prediction performance subject to fairness constraints.
We formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.
We demonstrate the effectiveness of our approach on real-world data under different fairness metrics.
arXiv Detail & Related papers (2022-12-23T22:29:08Z) - Reinforcement Learning with Stepwise Fairness Constraints [50.538878453547966]
We introduce the study of reinforcement learning with stepwise fairness constraints.
We provide learning algorithms with strong theoretical guarantees in regard to policy optimality and fairness violation.
arXiv Detail & Related papers (2022-11-08T04:06:23Z) - Policy Optimization with Advantage Regularization for Long-Term Fairness
in Decision Systems [14.095401339355677]
Long-term fairness is an important factor of consideration in designing and deploying learning-based decision systems.
Recent work has proposed the use of Markov Decision Processes (MDPs) to formulate decision-making with long-term fairness requirements.
We show that policy optimization methods from deep reinforcement learning can be used to find strictly better decision policies.
arXiv Detail & Related papers (2022-10-22T20:41:36Z) - Exploring the Pareto front of multi-objective COVID-19 mitigation
policies using reinforcement learning [1.7056617973440933]
Infectious disease outbreaks can have a disruptive impact on public health and societal processes.
Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate.
We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
arXiv Detail & Related papers (2022-04-11T11:55:06Z) - Off-Policy Imitation Learning from Observations [78.30794935265425]
Learning from Observations (LfO) is a practical reinforcement learning scenario from which many applications can benefit.
We propose a sample-efficient LfO approach that enables off-policy optimization in a principled manner.
Our approach is comparable with state-of-the-art locomotion in terms of both sample-efficiency and performance.
arXiv Detail & Related papers (2021-02-25T21:33:47Z) - Reliable Off-policy Evaluation for Reinforcement Learning [53.486680020852724]
In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative reward of a target policy.
We propose a novel framework that provides robust and optimistic cumulative reward estimates using one or multiple logged data.
arXiv Detail & Related papers (2020-11-08T23:16:19Z) - Machine Learning-Powered Mitigation Policy Optimization in
Epidemiological Models [33.88734751290751]
We propose a new approach for obtaining optimal policy recommendations based on epidemiological models.
We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
arXiv Detail & Related papers (2020-10-16T16:27:17Z) - Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control [19.022696762983017]
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
arXiv Detail & Related papers (2020-09-09T23:55:27Z) - Adaptive Estimator Selection for Off-Policy Evaluation [48.66170976187225]
We develop a generic data-driven method for estimator selection in off-policy policy evaluation settings.
We establish a strong performance guarantee for the method, showing that it is competitive with the oracle estimator, up to a constant factor.
arXiv Detail & Related papers (2020-02-18T16:57:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.