Identifying Patient-Specific Root Causes of Disease
- URL: http://arxiv.org/abs/2205.11627v1
- Date: Mon, 23 May 2022 20:54:24 GMT
- Title: Identifying Patient-Specific Root Causes of Disease
- Authors: Eric V. Strobl, Thomas A. Lasko
- Abstract summary: Complex diseases are caused by a multitude of factors that may differ between patients.
A few highly predictive root causes may nevertheless generate disease within each patient.
- Score: 10.885111578191564
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Complex diseases are caused by a multitude of factors that may differ between
patients. As a result, hypothesis tests comparing all patients to all healthy
controls can detect many significant variables with inconsequential effect
sizes. A few highly predictive root causes may nevertheless generate disease
within each patient. In this paper, we define patient-specific root causes as
variables subject to exogenous "shocks" which go on to perturb an otherwise
healthy system and induce disease. In other words, the variables are associated
with the exogenous errors of a structural equation model (SEM), and these
errors predict a downstream diagnostic label. We quantify predictivity using
sample-specific Shapley values. This derivation allows us to develop a fast
algorithm called Root Causal Inference for identifying patient-specific root
causes by extracting the error terms of a linear SEM and then computing the
Shapley value associated with each error. Experiments highlight considerable
improvements in accuracy because the method uncovers root causes that may have
large effect sizes at the individual level but clinically insignificant effect
sizes at the group level. An R implementation is available at
github.com/ericstrobl/RCI.
Related papers
- How Does Pruning Impact Long-Tailed Multi-Label Medical Image
Classifiers? [49.35105290167996]
Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance.
This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification.
arXiv Detail & Related papers (2023-08-17T20:40:30Z) - A Causal Framework for Decomposing Spurious Variations [68.12191782657437]
We develop tools for decomposing spurious variations in Markovian and Semi-Markovian models.
We prove the first results that allow a non-parametric decomposition of spurious effects.
The described approach has several applications, ranging from explainable and fair AI to questions in epidemiology and medicine.
arXiv Detail & Related papers (2023-06-08T09:40:28Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Counterfactual Formulation of Patient-Specific Root Causes of Disease [7.6146285961466]
Root causes of disease intuitively correspond to root vertices that increase the likelihood of a diagnosis.
Prior work defined patient-specific root causes of disease using an interventionalist account that only climbs to the second rung of Pearl's Ladder of Causation.
We propose a counterfactual definition matching clinical intuition based on fixed factual data alone.
arXiv Detail & Related papers (2023-05-27T20:24:27Z) - Causal Discovery with Stage Variables for Health Time Series [3.0712335337791288]
Causal Discovery with Stage Variables (CDSV) uses stage variables to reweight data from multiple time-series while accounting for different causal relationships in each stage.
In simulated data, CDSV discovers more causes with fewer false discoveries compared to baselines, in eICU it has a lower FDR than baselines, and in MIMIC-III it discovers more clinically relevant causes of high blood pressure.
arXiv Detail & Related papers (2023-05-05T16:30:28Z) - Sample-Specific Root Causal Inference with Latent Variables [10.885111578191564]
Root causal analysis seeks to identify the set of initial perturbations that induce an unwanted outcome.
We rigorously quantified predictivity using Shapley values.
We introduce a corresponding procedure called Extract Errors with Latents (EEL) for recovering the error terms up to contamination.
arXiv Detail & Related papers (2022-10-27T11:33:26Z) - Identifying Patient-Specific Root Causes with the Heteroscedastic Noise
Model [10.885111578191564]
We focus on identifying patient-specific root causes of disease, which we equate to the sample-specific predictivity of the error terms in a structural equation model.
A customized algorithm called Generalized Root Causal Inference (GRCI) is used to extract the error terms correctly.
arXiv Detail & Related papers (2022-05-25T23:51:31Z) - Variance Minimization in the Wasserstein Space for Invariant Causal
Prediction [72.13445677280792]
In this work, we show that the approach taken in ICP may be reformulated as a series of nonparametric tests that scales linearly in the number of predictors.
Each of these tests relies on the minimization of a novel loss function that is derived from tools in optimal transport theory.
We prove under mild assumptions that our method is able to recover the set of identifiable direct causes, and we demonstrate in our experiments that it is competitive with other benchmark causal discovery algorithms.
arXiv Detail & Related papers (2021-10-13T22:30:47Z) - The Causal Neural Connection: Expressiveness, Learnability, and
Inference [125.57815987218756]
An object called structural causal model (SCM) represents a collection of mechanisms and sources of random variation of the system under investigation.
In this paper, we show that the causal hierarchy theorem (Thm. 1, Bareinboim et al., 2020) still holds for neural models.
We introduce a special type of SCM called a neural causal model (NCM), and formalize a new type of inductive bias to encode structural constraints necessary for performing causal inferences.
arXiv Detail & Related papers (2021-07-02T01:55:18Z) - Efficient Causal Inference from Combined Observational and
Interventional Data through Causal Reductions [68.6505592770171]
Unobserved confounding is one of the main challenges when estimating causal effects.
We propose a novel causal reduction method that replaces an arbitrary number of possibly high-dimensional latent confounders.
We propose a learning algorithm to estimate the parameterized reduced model jointly from observational and interventional data.
arXiv Detail & Related papers (2021-03-08T14:29:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.