Prognostic Covariate Adjustment for Logistic Regression in Randomized
Controlled Trials
- URL: http://arxiv.org/abs/2402.18900v1
- Date: Thu, 29 Feb 2024 06:53:16 GMT
- Title: Prognostic Covariate Adjustment for Logistic Regression in Randomized
Controlled Trials
- Authors: Yunfan Li and Arman Sabbaghi and Jonathan R. Walsh and Charles K.
Fisher
- Abstract summary: We show that prognostic score adjustment can increase the power of the Wald test for the conditional odds ratio under a fixed sample size.
We utilize g-computation to expand the scope of prognostic score adjustment to inferences on the marginal risk difference, relative risk, and odds ratio estimands.
- Score: 1.5020330976600735
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Randomized controlled trials (RCTs) with binary primary endpoints introduce
novel challenges for inferring the causal effects of treatments. The most
significant challenge is non-collapsibility, in which the conditional odds
ratio estimand under covariate adjustment differs from the unconditional
estimand in the logistic regression analysis of RCT data. This issue gives rise
to apparent paradoxes, such as the variance of the estimator for the
conditional odds ratio from a covariate-adjusted model being greater than the
variance of the estimator from the unadjusted model. We address this challenge
in the context of adjustment based on predictions of control outcomes from
generative artificial intelligence (AI) algorithms, which are referred to as
prognostic scores. We demonstrate that prognostic score adjustment in logistic
regression increases the power of the Wald test for the conditional odds ratio
under a fixed sample size, or alternatively reduces the necessary sample size
to achieve a desired power, compared to the unadjusted analysis. We derive
formulae for prospective calculations of the power gain and sample size
reduction that can result from adjustment for the prognostic score.
Furthermore, we utilize g-computation to expand the scope of prognostic score
adjustment to inferences on the marginal risk difference, relative risk, and
odds ratio estimands. We demonstrate the validity of our formulae via extensive
simulation studies that encompass different types of logistic regression model
specifications. Our simulation studies also indicate how prognostic score
adjustment can reduce the variance of g-computation estimators for the marginal
estimands while maintaining frequentist properties such as asymptotic
unbiasedness and Type I error rate control. Our methodology can ultimately
enable more definitive and conclusive analyses for RCTs with binary primary
endpoints.
Related papers
- Deep Learning Methods for the Noniterative Conditional Expectation G-Formula for Causal Inference from Complex Observational Data [3.0958655016140892]
The g-formula can be used to estimate causal effects of sustained treatment strategies using observational data.
Parametric models are subject to model misspecification, which may result in biased causal estimates.
We propose a unified deep learning framework for the NICE g-formula estimator.
arXiv Detail & Related papers (2024-10-28T21:00:46Z) - Risk and cross validation in ridge regression with correlated samples [72.59731158970894]
We provide training examples for the in- and out-of-sample risks of ridge regression when the data points have arbitrary correlations.
We further extend our analysis to the case where the test point has non-trivial correlations with the training set, setting often encountered in time series forecasting.
We validate our theory across a variety of high dimensional data.
arXiv Detail & Related papers (2024-08-08T17:27:29Z) - High Precision Causal Model Evaluation with Conditional Randomization [10.23470075454725]
We introduce a novel low-variance estimator for causal error, dubbed as the pairs estimator.
By applying the same IPW estimator to both the model and true experimental effects, our estimator effectively cancels out the variance due to IPW and achieves a smaller variance.
Our method offers a simple yet powerful solution to evaluate causal inference models in conditional randomization settings without complicated modification of the IPW estimator itself.
arXiv Detail & Related papers (2023-11-03T13:22:27Z) - Selective Nonparametric Regression via Testing [54.20569354303575]
We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
arXiv Detail & Related papers (2023-09-28T13:04:11Z) - Improved Policy Evaluation for Randomized Trials of Algorithmic Resource
Allocation [54.72195809248172]
We present a new estimator leveraging our proposed novel concept, that involves retrospective reshuffling of participants across experimental arms at the end of an RCT.
We prove theoretically that such an estimator is more accurate than common estimators based on sample means.
arXiv Detail & Related papers (2023-02-06T05:17:22Z) - Modeling High-Dimensional Data with Unknown Cut Points: A Fusion
Penalized Logistic Threshold Regression [2.520538806201793]
In traditional logistic regression models, the link function is often assumed to be linear and continuous in predictors.
We consider a threshold model that all continuous features are discretized into ordinal levels, which further determine the binary responses.
We find the lasso model is well suited in the problem of early detection and prediction for chronic disease like diabetes.
arXiv Detail & Related papers (2022-02-17T04:16:40Z) - Bayesian prognostic covariate adjustment [59.75318183140857]
Historical data about disease outcomes can be integrated into the analysis of clinical trials in many ways.
We build on existing literature that uses prognostic scores from a predictive model to increase the efficiency of treatment effect estimates.
arXiv Detail & Related papers (2020-12-24T05:19:03Z) - Increasing the efficiency of randomized trial estimates via linear
adjustment for a prognostic score [59.75318183140857]
Estimating causal effects from randomized experiments is central to clinical research.
Most methods for historical borrowing achieve reductions in variance by sacrificing strict type-I error rate control.
arXiv Detail & Related papers (2020-12-17T21:10:10Z) - On Signal-to-Noise Ratio Issues in Variational Inference for Deep
Gaussian Processes [55.62520135103578]
We show that the gradient estimates used in training Deep Gaussian Processes (DGPs) with importance-weighted variational inference are susceptible to signal-to-noise ratio (SNR) issues.
We show that our fix can lead to consistent improvements in the predictive performance of DGP models.
arXiv Detail & Related papers (2020-11-01T14:38:02Z) - On Model Identification and Out-of-Sample Prediction of Principal
Component Regression: Applications to Synthetic Controls [20.96904429337912]
We analyze principal component regression (PCR) in a high-dimensional error-in-variables setting with fixed design.
We establish non-asymptotic out-of-sample prediction guarantees that improve upon the best known rates.
arXiv Detail & Related papers (2020-10-27T17:07:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.