Learning Pareto-Optimal Pandemic Intervention Policies with MORL
- URL: http://arxiv.org/abs/2510.03340v1
- Date: Thu, 02 Oct 2025 12:06:29 GMT
- Title: Learning Pareto-Optimal Pandemic Intervention Policies with MORL
- Authors: Marian Chen, Miri Zilka,
- Abstract summary: We develop a framework for modeling and evaluating disease-spread prevention strategies.<n>Our simulator reproduces national-scale pandemic dynamics with orders of magnitude higher fidelity than other models.<n>This work supports transparent, evidence-based policymaking for mitigating public health crises.
- Score: 1.160208922584163
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The COVID-19 pandemic underscored a critical need for intervention strategies that balance disease containment with socioeconomic stability. We approach this challenge by designing a framework for modeling and evaluating disease-spread prevention strategies. Our framework leverages multi-objective reinforcement learning (MORL) - a formulation necessitated by competing objectives - combined with a new stochastic differential equation (SDE) pandemic simulator, calibrated and validated against global COVID-19 data. Our simulator reproduces national-scale pandemic dynamics with orders of magnitude higher fidelity than other models commonly used in reinforcement learning (RL) approaches to pandemic intervention. Training a Pareto-Conditioned Network (PCN) agent on this simulator, we illustrate the direct policy trade-offs between epidemiological control and economic stability for COVID-19. Furthermore, we demonstrate the framework's generality by extending it to pathogens with different epidemiological profiles, such as polio and influenza, and show how these profiles lead the agent to discover fundamentally different intervention policies. To ground our work in contemporary policymaking challenges, we apply the model to measles outbreaks, quantifying how a modest 5% drop in vaccination coverage necessitates significantly more stringent and costly interventions to curb disease spread. This work provides a robust and adaptable framework to support transparent, evidence-based policymaking for mitigating public health crises.
Related papers
- Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants [51.26321657927398]
We propose a large language model (LLM) multi-agent policymaking framework that supports coordinated and proactive pandemic control across regions.<n>By integrating real-world data, a pandemic evolution simulator, and structured inter-agent communication, our framework enables agents to jointly explore counterfactual intervention scenarios.<n>Compared with real-world pandemic outcomes, our approach reduces cumulative infections and deaths by up to 63.7% and 40.1%, respectively, at the individual state level.
arXiv Detail & Related papers (2026-01-14T07:59:44Z) - Integrating Genomics into Multimodal EHR Foundation Models [56.31910745104141]
This paper introduces an innovative EHR foundation model that integrates Polygenic Risk Scores (PRS) as a foundational data modality.<n>The framework aims to learn complex relationships between clinical data and genetic predispositions.<n>This approach is pivotal for unlocking new insights into disease prediction, proactive health management, risk stratification, and personalized treatment strategies.
arXiv Detail & Related papers (2025-10-24T15:56:40Z) - Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data [1.2637032027754087]
We establish a decision-making framework based on an individual agent-based transmission model.<n>Covasim, a detailed and widely used agent-based disease transmission model, was modified to support reinforcement learning research.
arXiv Detail & Related papers (2025-05-07T06:23:26Z) - Agent-Based Model: Simulating a Virus Expansion Based on the Acceptance
of Containment Measures [65.62256987706128]
Compartmental epidemiological models categorize individuals based on their disease status.
We propose an ABM architecture that combines an adapted SEIRD model with a decision-making model for citizens.
We illustrate the designed model by examining the progression of SARS-CoV-2 infections in A Coruna, Spain.
arXiv Detail & Related papers (2023-07-28T08:01:05Z) - Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using
Deep Deterministic Policy Gradient [0.7244731714427565]
lockdowns, rapid vaccination programs, school closures, and economic stimulus can have positive or unintended negative consequences.
Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous).
We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation.
arXiv Detail & Related papers (2023-04-10T09:26:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top exploration [39.47724912690087]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.<n>$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.<n>We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies that minimize the number of infections and hospitalisations.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - Exploring the Pareto front of multi-objective COVID-19 mitigation
policies using reinforcement learning [1.7056617973440933]
Infectious disease outbreaks can have a disruptive impact on public health and societal processes.
Current research focuses on optimizing policies with a single objective, such as the pathogen's attack rate.
We apply deep multi-objective reinforcement learning and build upon a state-of-the-art algorithm to learn a set of solutions.
arXiv Detail & Related papers (2022-04-11T11:55:06Z) - Compartmental Models for COVID-19 and Control via Policy Interventions [0.0]
We demonstrate an approach to replicate and forecast the spread of the SARS-CoV-2 pandemic using the toolkit of probabilistic programming languages (PPLs)
Our goal is to study the impact of various modeling assumptions and motivate policy interventions enacted to limit the spread of infectious diseases.
We are not epidemiologists; the sole aim of this study is to serve as an exposition of methods, not to directly infer the real-world impact of policy-making for COVID-19.
arXiv Detail & Related papers (2022-03-06T02:50:54Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - An Optimal Control Approach to Learning in SIDARTHE Epidemic model [67.22168759751541]
We propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data.
We forecast the epidemic evolution in Italy and France.
arXiv Detail & Related papers (2020-10-28T10:58:59Z) - Steering a Historical Disease Forecasting Model Under a Pandemic: Case
of Flu and COVID-19 [75.99038202534628]
We propose CALI-Net, a neural transfer learning architecture which allows us to'steer' a historical disease forecasting model to new scenarios where flu and COVID co-exist.
Our experiments demonstrate that our approach is successful in adapting a historical forecasting model to the current pandemic.
arXiv Detail & Related papers (2020-09-23T22:35:43Z) - Multi-Objective Model-based Reinforcement Learning for Infectious
Disease Control [19.022696762983017]
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health.
Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses.
We propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost.
arXiv Detail & Related papers (2020-09-09T23:55:27Z) - When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and
Policy Assessment using Compartmental Gaussian Processes [111.69190108272133]
coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures.
Data-driven models that predict COVID-19 fatalities under different lockdown policy scenarios are essential.
This paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context.
arXiv Detail & Related papers (2020-05-13T18:21:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.