Learning Fair Policies for Infectious Diseases Mitigation using Path   Integral Control
        - URL: http://arxiv.org/abs/2502.09831v1
- Date: Fri, 14 Feb 2025 00:08:06 GMT
- Title: Learning Fair Policies for Infectious Diseases Mitigation using Path   Integral Control
- Authors: Zhuangzhuang Jia, Hyuk Park, Gökçe Dayanıklı, Grani A. Hanasusanto, 
- Abstract summary: Infectious diseases pose major public health challenges to society.<n>We propose a framework for sequential decision-making under uncertainty to design fairness-aware disease mitigation policies.
- Score: 0.4583163610461423
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Infectious diseases pose major public health challenges to society, highlighting the importance of designing effective policies to reduce economic loss and mortality. In this paper, we propose a framework for sequential decision-making under uncertainty to design fairness-aware disease mitigation policies that incorporate various measures of unfairness. Specifically, our approach learns equitable vaccination and lockdown strategies based on a stochastic multi-group SIR model. To address the challenges of solving the resulting sequential decision-making problem, we adopt the path integral control algorithm as an efficient solution scheme. Through a case study, we demonstrate that our approach effectively improves fairness compared to conventional methods and provides valuable insights for policymakers. 
 
      
        Related papers
        - Optimization of Infectious Disease Intervention Measures Based on   Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic   data [1.2637032027754087]
 We establish a decision-making framework based on an individual agent-based transmission model.<n>Covasim, a detailed and widely used agent-based disease transmission model, was modified to support reinforcement learning research.
 arXiv  Detail & Related papers  (2025-05-07T06:23:26Z)
- Including frameworks of public health ethics in computational modelling   of infectious disease interventions [36.437757915645385]
 Many values recognised as important for ethical decision-making are missing from computational models.
We demonstrate a proof-of-concept approach to incorporate multiple public health values into the evaluation of a simple computational model for vaccination against a pathogen such as SARS-CoV-2.
 arXiv  Detail & Related papers  (2025-01-31T04:22:25Z)
- Optimal and Fair Encouragement Policy Evaluation and Learning [11.712023983596914]
 We study causal identification and robust estimation of optimal treatment rules, including under potential violations of positivity.
We develop a two-stage algorithm for solving over parametrized policy classes under general constraints to obtain variance-sensitive regret bounds.
We illustrate the methods in three case studies based on data from reminders of SNAP benefits, randomized encouragement to enroll in insurance, and from pretrial supervised release with electronic monitoring.
 arXiv  Detail & Related papers  (2023-09-12T20:45:30Z)
- Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
  exploration [53.122045119395594]
 We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
 arXiv  Detail & Related papers  (2023-01-30T12:22:30Z)
- Stochastic Methods for AUC Optimization subject to AUC-based Fairness
  Constraints [51.12047280149546]
 A direct approach for obtaining a fair predictive model is to train the model through optimizing its prediction performance subject to fairness constraints.
We formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.
We demonstrate the effectiveness of our approach on real-world data under different fairness metrics.
 arXiv  Detail & Related papers  (2022-12-23T22:29:08Z)
- Reinforcement Learning with Stepwise Fairness Constraints [50.538878453547966]
 We introduce the study of reinforcement learning with stepwise fairness constraints.
We provide learning algorithms with strong theoretical guarantees in regard to policy optimality and fairness violation.
 arXiv  Detail & Related papers  (2022-11-08T04:06:23Z)
- Policy Optimization with Advantage Regularization for Long-Term Fairness
  in Decision Systems [14.095401339355677]
 Long-term fairness is an important factor of consideration in designing and deploying learning-based decision systems.
Recent work has proposed the use of Markov Decision Processes (MDPs) to formulate decision-making with long-term fairness requirements.
We show that policy optimization methods from deep reinforcement learning can be used to find strictly better decision policies.
 arXiv  Detail & Related papers  (2022-10-22T20:41:36Z)
- Exploring the Pareto front of multi-objective COVID-19 mitigation
  policies using reinforcement learning [1.7056617973440933]
 Infectious disease outbreaks can have a disruptive impact on public health and societal processes.
Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate.
We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
 arXiv  Detail & Related papers  (2022-04-11T11:55:06Z)
- Constrained Policy Optimization via Bayesian World Models [79.0077602277004]
 LAMBDA is a model-based approach for policy optimization in safety critical tasks modeled via constrained Markov decision processes.
We demonstrate LAMBDA's state of the art performance on the Safety-Gym benchmark suite in terms of sample efficiency and constraint violation.
 arXiv  Detail & Related papers  (2022-01-24T17:02:22Z)
- Off-Policy Imitation Learning from Observations [78.30794935265425]
 Learning from Observations (LfO) is a practical reinforcement learning scenario from which many applications can benefit.
We propose a sample-efficient LfO approach that enables off-policy optimization in a principled manner.
Our approach is comparable with state-of-the-art locomotion in terms of both sample-efficiency and performance.
 arXiv  Detail & Related papers  (2021-02-25T21:33:47Z)
- Reliable Off-policy Evaluation for Reinforcement Learning [53.486680020852724]
 In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative reward of a target policy.
We propose a novel framework that provides robust and optimistic cumulative reward estimates using one or multiple logged data.
 arXiv  Detail & Related papers  (2020-11-08T23:16:19Z)
- Machine Learning-Powered Mitigation Policy Optimization in
  Epidemiological Models [33.88734751290751]
 We propose a new approach for obtaining optimal policy recommendations based on epidemiological models.
We find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified.
 arXiv  Detail & Related papers  (2020-10-16T16:27:17Z)
- Multi-Objective Model-based Reinforcement Learning for Infectious
  Disease Control [19.022696762983017]
 Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
 arXiv  Detail & Related papers  (2020-09-09T23:55:27Z)
- Adaptive Estimator Selection for Off-Policy Evaluation [48.66170976187225]
 We develop a generic data-driven method for estimator selection in off-policy policy evaluation settings.
We establish a strong performance guarantee for the method, showing that it is competitive with the oracle estimator, up to a constant factor.
 arXiv  Detail & Related papers  (2020-02-18T16:57:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.