A Two-Stage Feature Selection Approach for Robust Evaluation of
Treatment Effects in High-Dimensional Observational Data
- URL: http://arxiv.org/abs/2111.13800v2
- Date: Tue, 12 Mar 2024 17:25:35 GMT
- Title: A Two-Stage Feature Selection Approach for Robust Evaluation of
Treatment Effects in High-Dimensional Observational Data
- Authors: Md Saiful Islam, Sahil Shikalgar, Md. Noor-E-Alam
- Abstract summary: We propose a novel two-stage feature selection technique called, Outcome Adaptive Elastic Net (OAENet)
OAENet is explicitly designed for making robust causal inference decisions using matching techniques.
Numerical experiments on simulated data demonstrate that OAENet significantly outperforms state-of-the-art methods.
- Score: 1.4710887888397084
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A Randomized Control Trial (RCT) is considered as the gold standard for
evaluating the effect of any intervention or treatment. However, its
feasibility is often hindered by ethical, economical, and legal considerations,
making observational data a valuable alternative for drawing causal
conclusions. Nevertheless, healthcare observational data presents a difficult
challenge due to its high dimensionality, requiring careful consideration to
ensure unbiased, reliable, and robust causal inferences. To overcome this
challenge, in this study, we propose a novel two-stage feature selection
technique called, Outcome Adaptive Elastic Net (OAENet), explicitly designed
for making robust causal inference decisions using matching techniques. OAENet
offers several key advantages over existing methods: superior performance on
correlated and high-dimensional data compared to the existing methods and the
ability to select specific sets of variables (including confounders and
variables associated only with the outcome). This ensures robustness and
facilitates an unbiased estimate of the causal effect. Numerical experiments on
simulated data demonstrate that OAENet significantly outperforms
state-of-the-art methods by either producing a higher-quality estimate or a
comparable estimate in significantly less time. To illustrate the applicability
of OAENet, we employ large-scale US healthcare data to estimate the effect of
Opioid Use Disorder (OUD) on suicidal behavior. When compared to competing
methods, OAENet closely aligns with existing literature on the relationship
between OUD and suicidal behavior. Performance on both simulated and real-world
data highlights that OAENet notably enhances the accuracy of estimating
treatment effects or evaluating policy decision-making with causal inference.
Related papers
- C-XGBoost: A tree boosting model for causal effect estimation [8.246161706153805]
Causal effect estimation aims at estimating the Average Treatment Effect as well as the Conditional Average Treatment Effect of a treatment to an outcome from the available data.
We propose a new causal inference model, named C-XGBoost, for the prediction of potential outcomes.
arXiv Detail & Related papers (2024-03-31T17:43:37Z) - Efficient adjustment for complex covariates: Gaining efficiency with
DOPE [56.537164957672715]
We propose a framework that accommodates adjustment for any subset of information expressed by the covariates.
Based on our theoretical results, we propose the Debiased Outcome-adapted Propensity Estorimator (DOPE) for efficient estimation of the average treatment effect (ATE)
Our results show that the DOPE provides an efficient and robust methodology for ATE estimation in various observational settings.
arXiv Detail & Related papers (2024-02-20T13:02:51Z) - Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment
Effect Estimation [137.3520153445413]
A notable gap exists in the evaluation of causal discovery methods, where insufficient emphasis is placed on downstream inference.
We evaluate seven established baseline causal discovery methods including a newly proposed method based on GFlowNets.
The results of our study demonstrate that some of the algorithms studied are able to effectively capture a wide range of useful and diverse ATE modes.
arXiv Detail & Related papers (2023-07-11T02:58:10Z) - A Causal Inference Framework for Leveraging External Controls in Hybrid
Trials [1.7942265700058988]
We consider the challenges associated with causal inference in settings where data from a randomized trial is augmented with control data from an external source.
We propose estimators, review efficiency bounds, and an approach for efficient doubly-robust estimation.
We apply the framework to a trial investigating the effect of risdisplam on motor function in patients with spinal muscular atrophy.
arXiv Detail & Related papers (2023-05-15T19:15:32Z) - B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under
Hidden Confounding [51.74479522965712]
We propose a meta-learner called the B-Learner, which can efficiently learn sharp bounds on the CATE function under limits on hidden confounding.
We prove its estimates are valid, sharp, efficient, and have a quasi-oracle property with respect to the constituent estimators under more general conditions than existing methods.
arXiv Detail & Related papers (2023-04-20T18:07:19Z) - Causal Inference under Data Restrictions [0.0]
This dissertation focuses on modern causal inference under uncertainty and data restrictions.
It includes applications to neoadjuvant clinical trials, distributed data networks, and robust individualized decision making.
arXiv Detail & Related papers (2023-01-20T20:14:32Z) - CausalBench: A Large-scale Benchmark for Network Inference from
Single-cell Perturbation Data [61.088705993848606]
We introduce CausalBench, a benchmark suite for evaluating causal inference methods on real-world interventional data.
CaulBench incorporates biologically-motivated performance metrics, including new distribution-based interventional metrics.
arXiv Detail & Related papers (2022-10-31T13:04:07Z) - Demarcating Endogenous and Exogenous Opinion Dynamics: An Experimental
Design Approach [27.975266406080152]
In this paper, we design a suite of unsupervised classification methods based on experimental design approaches.
We aim to select the subsets of events which minimize different measures of mean estimation error.
Our experiments range from validating prediction performance on unsanitized and sanitized events to checking the effect of selecting optimal subsets of various sizes.
arXiv Detail & Related papers (2021-02-11T11:38:15Z) - Confounding Feature Acquisition for Causal Effect Estimation [6.174721516017138]
We frame this challenge as a problem of feature acquisition of confounding features for causal inference.
Our goal is to prioritize acquiring values for a fixed and known subset of missing confounders in samples that lead to efficient average treatment effect estimation.
arXiv Detail & Related papers (2020-11-17T16:28:43Z) - Enabling Counterfactual Survival Analysis with Balanced Representations [64.17342727357618]
Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials.
We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes.
arXiv Detail & Related papers (2020-06-14T01:15:00Z) - Decomposed Adversarial Learned Inference [118.27187231452852]
We propose a novel approach, Decomposed Adversarial Learned Inference (DALI)
DALI explicitly matches prior and conditional distributions in both data and code spaces.
We validate the effectiveness of DALI on the MNIST, CIFAR-10, and CelebA datasets.
arXiv Detail & Related papers (2020-04-21T20:00:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.