Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data
- URL: http://arxiv.org/abs/2412.19711v2
- Date: Sun, 13 Apr 2025 10:24:57 GMT
- Title: Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data
- Authors: Matthew Pryce, Karla Diaz-Ordaz, Ruth H. Keogh, Stijn Vansteelandt,
- Abstract summary: We discuss the impact that missing outcome data has on causal machine learning estimators for the conditional average treatment effect (CATE)<n>We propose two de-biased machine learning estimators for the CATE, the mDR-learner and mEP-learner, which address the issue of under-representation.<n>We show that under reasonable conditions, these estimators are oracle efficient, and illustrate their favorable performance through simulated data settings.
- Score: 0.9087641068861047
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: When estimating heterogeneous treatment effects, missing outcome data can complicate treatment effect estimation, causing certain subgroups of the population to be poorly represented. In this work, we discuss this commonly overlooked problem and consider the impact that missing at random (MAR) outcome data has on causal machine learning estimators for the conditional average treatment effect (CATE). We propose two de-biased machine learning estimators for the CATE, the mDR-learner and mEP-learner, which address the issue of under-representation by integrating inverse probability of censoring weights into the DR-learner and EP-learner respectively. We show that under reasonable conditions, these estimators are oracle efficient, and illustrate their favorable performance through simulated data settings, comparing them to existing CATE estimators, including comparison to estimators which use common missing data techniques. We present an example of their application using the GBSG2 trial, exploring treatment effect heterogeneity when comparing hormonal therapies to non-hormonal therapies among breast cancer patients post surgery, and offer guidance on the decisions a practitioner must make when implementing these estimators.
Related papers
- Counterfactual Data Augmentation with Contrastive Learning [27.28511396131235]
We introduce a model-agnostic data augmentation method that imputes the counterfactual outcomes for a selected subset of individuals.
We use contrastive learning to learn a representation space and a similarity measure such that in the learned representation space close individuals identified by the learned similarity measure have similar potential outcomes.
This property ensures reliable imputation of counterfactual outcomes for the individuals with close neighbors from the alternative treatment group.
arXiv Detail & Related papers (2023-11-07T00:36:51Z) - Estimating treatment effects from single-arm trials via latent-variable
modeling [14.083487062917085]
Single-arm trials, where all patients belong to the treatment group, can be a viable alternative but require access to an external control group.
We propose an identifiable deep latent-variable model for this scenario.
Our results show improved performance both for direct treatment effect estimation as well as for effect estimation via patient matching.
arXiv Detail & Related papers (2023-11-06T10:12:54Z) - Efficient estimation of weighted cumulative treatment effects by
double/debiased machine learning [3.086361225427304]
We propose a class of one-step cross-fitted double/debiased machine learning estimators for the weighted cumulative causal effect.
We apply the proposed methods to real-world observational data from a UK primary care database to compare the effects of anti-diabetic drugs on cancer outcomes.
arXiv Detail & Related papers (2023-05-03T18:19:18Z) - B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under
Hidden Confounding [51.74479522965712]
We propose a meta-learner called the B-Learner, which can efficiently learn sharp bounds on the CATE function under limits on hidden confounding.
We prove its estimates are valid, sharp, efficient, and have a quasi-oracle property with respect to the constituent estimators under more general conditions than existing methods.
arXiv Detail & Related papers (2023-04-20T18:07:19Z) - Treatment Effect Estimation with Observational Network Data using
Machine Learning [0.0]
Causal inference methods for treatment effect estimation usually assume independent units.
We develop augmented inverse probability (AIPW) for estimation and inference of the direct effect of the treatment with observational data from a single (social) network with spillover effects.
arXiv Detail & Related papers (2022-06-29T12:52:41Z) - BITES: Balanced Individual Treatment Effect for Survival data [0.0]
Estimating the effects of interventions on patient outcome is one of the key aspects of personalized medicine.
Time-to-event data is rarely used for treatment optimization.
We suggest an approach named BITES, which combines a treatment-specific semi-parametric Cox loss with a treatment-balanced deep neural network.
arXiv Detail & Related papers (2022-01-05T10:39:31Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Causal Effect Variational Autoencoder with Uniform Treatment [50.895390968371665]
Causal effect variational autoencoder (CEVAE) are trained to predict the outcome given observational treatment data.
Uniform treatment variational autoencoders (UTVAE) are trained with uniform treatment distribution using importance sampling.
arXiv Detail & Related papers (2021-11-16T17:40:57Z) - SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event
Data [83.50281440043241]
We study the problem of inferring heterogeneous treatment effects from time-to-event data.
We propose a novel deep learning method for treatment-specific hazard estimation based on balancing representations.
arXiv Detail & Related papers (2021-10-26T20:13:17Z) - Doing Great at Estimating CATE? On the Neglected Assumptions in
Benchmark Comparisons of Treatment Effect Estimators [91.3755431537592]
We show that even in arguably the simplest setting, estimation under ignorability assumptions can be misleading.
We consider two popular machine learning benchmark datasets for evaluation of heterogeneous treatment effect estimators.
We highlight that the inherent characteristics of the benchmark datasets favor some algorithms over others.
arXiv Detail & Related papers (2021-07-28T13:21:27Z) - Estimating heterogeneous survival treatment effect in observational data
using machine learning [9.951103976634407]
Methods for estimating heterogeneous treatment effect in observational data have largely focused on continuous or binary outcomes.
Using flexible machine learning methods in the counterfactual framework is a promising approach to address challenges due to complex individual characteristics.
arXiv Detail & Related papers (2020-08-17T01:02:14Z) - Enabling Counterfactual Survival Analysis with Balanced Representations [64.17342727357618]
Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials.
We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes.
arXiv Detail & Related papers (2020-06-14T01:15:00Z) - Generalization Bounds and Representation Learning for Estimation of
Potential Outcomes and Causal Effects [61.03579766573421]
We study estimation of individual-level causal effects, such as a single patient's response to alternative medication.
We devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance.
We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances.
arXiv Detail & Related papers (2020-01-21T10:16:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.