Finding Counterfactually Optimal Action Sequences in Continuous State
Spaces
- URL: http://arxiv.org/abs/2306.03929v2
- Date: Mon, 6 Nov 2023 11:01:28 GMT
- Title: Finding Counterfactually Optimal Action Sequences in Continuous State
Spaces
- Authors: Stratis Tsirtsis, Manuel Gomez-Rodriguez
- Abstract summary: We describe a sequence of discrete actions and continuous states using finite horizon Markov decision processes.
We then develop a search method based on the $A*$ continuity of the environment's dynamics.
Our method is very efficient in practice, and it has the potential to offer interesting insights for sequential decision making tasks.
- Score: 22.84932480886562
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Whenever a clinician reflects on the efficacy of a sequence of treatment
decisions for a patient, they may try to identify critical time steps where,
had they made different decisions, the patient's health would have improved.
While recent methods at the intersection of causal inference and reinforcement
learning promise to aid human experts, as the clinician above, to
retrospectively analyze sequential decision making processes, they have focused
on environments with finitely many discrete states. However, in many practical
applications, the state of the environment is inherently continuous in nature.
In this paper, we aim to fill this gap. We start by formally characterizing a
sequence of discrete actions and continuous states using finite horizon Markov
decision processes and a broad class of bijective structural causal models.
Building upon this characterization, we formalize the problem of finding
counterfactually optimal action sequences and show that, in general, we cannot
expect to solve it in polynomial time. Then, we develop a search method based
on the $A^*$ algorithm that, under a natural form of Lipschitz continuity of
the environment's dynamics, is guaranteed to return the optimal solution to the
problem. Experiments on real clinical data show that our method is very
efficient in practice, and it has the potential to offer interesting insights
for sequential decision making tasks.
Related papers
- Anomaly Detection via Learning-Based Sequential Controlled Sensing [25.282033825977827]
We address the problem of detecting anomalies among a set of binary processes via learning-based controlled sensing.
To identify the anomalies, the decision-making agent is allowed to observe a subset of the processes at each time instant.
Our objective is to design a sequential selection policy that dynamically determines which processes to observe at each time.
arXiv Detail & Related papers (2023-11-30T07:49:33Z) - Provably Efficient UCB-type Algorithms For Learning Predictive State
Representations [55.00359893021461]
The sequential decision-making problem is statistically learnable if it admits a low-rank structure modeled by predictive state representations (PSRs)
This paper proposes the first known UCB-type approach for PSRs, featuring a novel bonus term that upper bounds the total variation distance between the estimated and true models.
In contrast to existing approaches for PSRs, our UCB-type algorithms enjoy computational tractability, last-iterate guaranteed near-optimal policy, and guaranteed model accuracy.
arXiv Detail & Related papers (2023-07-01T18:35:21Z) - Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
Unobserved Confounders [16.193776814471768]
We study robust policy evaluation and policy optimization in the presence of sequentially-exogenous unobserved confounders.
We provide sample complexity bounds, insights, and show effectiveness both in simulations and on real-world longitudinal healthcare data of treating sepsis.
arXiv Detail & Related papers (2023-02-01T18:40:53Z) - Task-agnostic Continual Hippocampus Segmentation for Smooth Population
Shifts [1.069533806668766]
We explore how such methods perform in a task-agnostic setting with gradual population shifts.
We propose ODEx, a holistic solution that combines out-of-distribution detection with continual learning techniques.
arXiv Detail & Related papers (2022-08-05T14:46:00Z) - Continuous-Time Modeling of Counterfactual Outcomes Using Neural
Controlled Differential Equations [84.42837346400151]
Estimating counterfactual outcomes over time has the potential to unlock personalized healthcare.
Existing causal inference approaches consider regular, discrete-time intervals between observations and treatment decisions.
We propose a controllable simulation environment based on a model of tumor growth for a range of scenarios.
arXiv Detail & Related papers (2022-06-16T17:15:15Z) - Optimal discharge of patients from intensive care via a data-driven
policy learning framework [58.720142291102135]
It is important that the patient discharge task addresses the nuanced trade-off between decreasing a patient's length of stay and the risk of readmission or even death following the discharge decision.
This work introduces an end-to-end general framework for capturing this trade-off to recommend optimal discharge timing decisions.
A data-driven approach is used to derive a parsimonious, discrete state space representation that captures a patient's physiological condition.
arXiv Detail & Related papers (2021-12-17T04:39:33Z) - Finding Regions of Heterogeneity in Decision-Making via Expected
Conditional Covariance [3.9775905909091804]
We present an algorithm for identifying types of contexts with high inter-decision-maker disagreement.
We formalize this as a causal inference problem, seeking a region where the assignment of decision-maker has a large causal effect on the decision.
We apply our algorithm to real-world healthcare datasets, recovering variation that aligns with existing clinical knowledge.
arXiv Detail & Related papers (2021-10-27T15:20:12Z) - Counterfactual Explanations in Sequential Decision Making Under
Uncertainty [27.763369810430653]
We develop methods to find counterfactual explanations for sequential decision making processes.
In our problem formulation, the counterfactual explanation specifies an alternative sequence of actions differing in at most k actions.
We show that our algorithm finds can provide valuable insights to enhance decision making under uncertainty.
arXiv Detail & Related papers (2021-07-06T17:38:19Z) - The Medkit-Learn(ing) Environment: Medical Decision Modelling through
Simulation [81.72197368690031]
We present a new benchmarking suite designed specifically for medical sequential decision making.
The Medkit-Learn(ing) Environment is a publicly available Python package providing simple and easy access to high-fidelity synthetic medical data.
arXiv Detail & Related papers (2021-06-08T10:38:09Z) - Resource Planning for Hospitals Under Special Consideration of the
COVID-19 Pandemic: Optimization and Sensitivity Analysis [87.31348761201716]
Crises like the COVID-19 pandemic pose a serious challenge to health-care institutions.
BaBSim.Hospital is a tool for capacity planning based on discrete event simulation.
We aim to investigate and optimize these parameters to improve BaBSim.Hospital.
arXiv Detail & Related papers (2021-05-16T12:38:35Z) - Explaining Clinical Decision Support Systems in Medical Imaging using
Cycle-Consistent Activation Maximization [112.2628296775395]
Clinical decision support using deep neural networks has become a topic of steadily growing interest.
clinicians are often hesitant to adopt the technology because its underlying decision-making process is considered to be intransparent and difficult to comprehend.
We propose a novel decision explanation scheme based on CycleGAN activation which generates high-quality visualizations of classifier decisions even in smaller data sets.
arXiv Detail & Related papers (2020-10-09T14:39:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.