Augmented Learning of Heterogeneous Treatment Effects via Gradient
Boosting Trees
- URL: http://arxiv.org/abs/2302.01367v1
- Date: Thu, 2 Feb 2023 19:10:04 GMT
- Title: Augmented Learning of Heterogeneous Treatment Effects via Gradient
Boosting Trees
- Authors: Heng Chen, Michael L. LeBlanc and James Y. Dai
- Abstract summary: We propose a two-stage statistical learning procedure for estimating Heterogeneous treatment effects (HTE)
The first stage is to estimate the main-effect equivalency of the baseline markers on the outcome, which is then used as an augmentation term in the second stage estimation for HTE.
A permutation test is proposed for global assessment of evidence for HTE.
- Score: 0.2320417845168326
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Heterogeneous treatment effects (HTE) based on patients' genetic or clinical
factors are of significant interest to precision medicine. Simultaneously
modeling HTE and corresponding main effects for randomized clinical trials with
high-dimensional predictive markers is challenging. Motivated by the modified
covariates approach, we propose a two-stage statistical learning procedure for
estimating HTE with optimal efficiency augmentation, generalizing to arbitrary
interaction model and exploiting powerful extreme gradient boosting trees
(XGBoost). Target estimands for HTE are defined in the scale of mean difference
for quantitative outcomes, or risk ratio for binary outcomes, which are the
minimizers of specialized loss functions. The first stage is to estimate the
main-effect equivalency of the baseline markers on the outcome, which is then
used as an augmentation term in the second stage estimation for HTE. The
proposed two-stage procedure is robust to model mis-specification of main
effects and improves efficiency for estimating HTE through nonparametric
function estimation, e.g., XGBoost. A permutation test is proposed for global
assessment of evidence for HTE. An analysis of a genetic study in Prostate
Cancer Prevention Trial led by the SWOG Cancer Research Network, is conducted
to showcase the properties and the utilities of the two-stage method.
Related papers
- C-XGBoost: A tree boosting model for causal effect estimation [8.246161706153805]
Causal effect estimation aims at estimating the Average Treatment Effect as well as the Conditional Average Treatment Effect of a treatment to an outcome from the available data.
We propose a new causal inference model, named C-XGBoost, for the prediction of potential outcomes.
arXiv Detail & Related papers (2024-03-31T17:43:37Z) - A Weighted Prognostic Covariate Adjustment Method for Efficient and
Powerful Treatment Effect Inferences in Randomized Controlled Trials [0.28087862620958753]
A crucial task for a randomized controlled trial (RCT) is to specify a statistical method that can yield an efficient estimator and powerful test for the treatment effect.
Training a generative AI algorithm on historical control data enables one to construct a digital twin generator (DTG) for RCT participants.
DTG generates a probability distribution for RCT participants' potential control outcome.
arXiv Detail & Related papers (2023-09-25T16:14:13Z) - Heterogeneous Treatment Effect Estimation for Observational Data using
Model-based Forests [0.0]
We propose modifications to model-based forests to address the confounding issue in observational data.
We found that this strategy reduces confounding effects in a simulated study with various outcome distributions.
We demonstrate the practical aspects of HTE estimation for survival and ordinal outcomes by an assessment of the potentially heterogeneous effect of Riluzole on the progress of Amyotrophic Lateral Sclerosis.
arXiv Detail & Related papers (2022-10-06T11:49:39Z) - Robust and Agnostic Learning of Conditional Distributional Treatment
Effects [62.44901952244514]
The conditional average treatment effect (CATE) is the best point prediction of individual causal effects.
In aggregate analyses, this is usually addressed by measuring distributional treatment effect (DTE)
We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a wide class of problems.
arXiv Detail & Related papers (2022-05-23T17:40:31Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Cardiovascular Disease Prediction using Recursive Feature Elimination
and Gradient Boosting Classification Techniques [0.0]
This paper proposes a proposed gradient boosting (RFE-GB) algorithm in order to obtain accurate heart disease prediction.
The patients health record with important CVD features has been analyzed for the evaluation of the results.
arXiv Detail & Related papers (2021-06-11T16:17:42Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Bayesian prognostic covariate adjustment [59.75318183140857]
Historical data about disease outcomes can be integrated into the analysis of clinical trials in many ways.
We build on existing literature that uses prognostic scores from a predictive model to increase the efficiency of treatment effect estimates.
arXiv Detail & Related papers (2020-12-24T05:19:03Z) - Increasing the efficiency of randomized trial estimates via linear
adjustment for a prognostic score [59.75318183140857]
Estimating causal effects from randomized experiments is central to clinical research.
Most methods for historical borrowing achieve reductions in variance by sacrificing strict type-I error rate control.
arXiv Detail & Related papers (2020-12-17T21:10:10Z) - Improved Preterm Prediction Based on Optimized Synthetic Sampling of EHG
Signal [3.0625456792807424]
The inter-relationship between uterine contraction and electrical activities makes uterine electrohysterogram (EHG) a promising direction for preterm detection and prediction.
Due the scarcity of EHG signals, especially those of preterm patients, synthetic algorithms are applied to create artificial samples of preterm type.
arXiv Detail & Related papers (2020-07-03T01:12:31Z) - Learning for Dose Allocation in Adaptive Clinical Trials with Safety
Constraints [84.09488581365484]
Phase I dose-finding trials are increasingly challenging as the relationship between efficacy and toxicity of new compounds becomes more complex.
Most commonly used methods in practice focus on identifying a Maximum Tolerated Dose (MTD) by learning only from toxicity events.
We present a novel adaptive clinical trial methodology that aims at maximizing the cumulative efficacies while satisfying the toxicity safety constraint with high probability.
arXiv Detail & Related papers (2020-06-09T03:06:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.