Inference for High Dimensional Censored Quantile Regression
- URL: http://arxiv.org/abs/2107.10959v1
- Date: Thu, 22 Jul 2021 23:57:06 GMT
- Title: Inference for High Dimensional Censored Quantile Regression
- Authors: Zhe Fei, Qi Zheng, Hyokyoung G. Hong, Yi Li
- Abstract summary: This paper proposes a novel procedure to draw inference on all predictors within the framework of global censored quantile regression.
We show that our procedure can properly quantify the uncertainty of the estimates in high dimensional settings.
We apply our method to analyze the heterogeneous effects of SNPs residing in lung cancer pathways on patients' survival.
- Score: 8.993036560782137
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the availability of high dimensional genetic biomarkers, it is of
interest to identify heterogeneous effects of these predictors on patients'
survival, along with proper statistical inference. Censored quantile regression
has emerged as a powerful tool for detecting heterogeneous effects of
covariates on survival outcomes. To our knowledge, there is little work
available to draw inference on the effects of high dimensional predictors for
censored quantile regression. This paper proposes a novel procedure to draw
inference on all predictors within the framework of global censored quantile
regression, which investigates covariate-response associations over an interval
of quantile levels, instead of a few discrete values. The proposed estimator
combines a sequence of low dimensional model estimates that are based on
multi-sample splittings and variable selection. We show that, under some
regularity conditions, the estimator is consistent and asymptotically follows a
Gaussian process indexed by the quantile level. Simulation studies indicate
that our procedure can properly quantify the uncertainty of the estimates in
high dimensional settings. We apply our method to analyze the heterogeneous
effects of SNPs residing in lung cancer pathways on patients' survival, using
the Boston Lung Cancer Survival Cohort, a cancer epidemiology study on the
molecular mechanism of lung cancer.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - A sparse PAC-Bayesian approach for high-dimensional quantile prediction [0.0]
This paper presents a novel probabilistic machine learning approach for high-dimensional quantile prediction.
It uses a pseudo-Bayesian framework with a scaled Student-t prior and Langevin Monte Carlo for efficient computation.
Its effectiveness is validated through simulations and real-world data, where it performs competitively against established frequentist and Bayesian techniques.
arXiv Detail & Related papers (2024-09-03T08:01:01Z) - Risk and cross validation in ridge regression with correlated samples [72.59731158970894]
We provide training examples for the in- and out-of-sample risks of ridge regression when the data points have arbitrary correlations.
We further extend our analysis to the case where the test point has non-trivial correlations with the training set, setting often encountered in time series forecasting.
We validate our theory across a variety of high dimensional data.
arXiv Detail & Related papers (2024-08-08T17:27:29Z) - Causal Inference for Genomic Data with Multiple Heterogeneous Outcomes [1.5845117761091052]
We propose a generic semiparametric framework for doubly robust estimation with multiple derived outcomes.
We specialize the analysis to standardized average treatment effects and quantile treatment effects.
arXiv Detail & Related papers (2024-04-14T02:07:14Z) - Estimating Unknown Population Sizes Using the Hypergeometric Distribution [1.03590082373586]
We tackle the challenge of estimating discrete distributions when both the total population size and the sizes of its constituent categories are unknown.
We develop our approach to account for a data generating process where the ground-truth is a mixture of distributions conditional on a continuous latent variable.
Empirical data simulation demonstrates that our method outperforms other likelihood functions used to model count data.
arXiv Detail & Related papers (2024-02-22T01:53:56Z) - Bayesian Quantile Regression with Subset Selection: A Posterior Summarization Perspective [0.0]
Quantile regression is a powerful tool in epidemiological studies where interest lies in inferring how different exposures affect specific percentiles of the distribution of a health or life outcome.
Existing methods either estimate conditional quantiles separately for each quantile of interest or estimate the entire conditional distribution using semi- or non-parametric models.
We pose the fundamental problems of linear quantile estimation, uncertainty quantification, and subset selection from a Bayesian decision analysis perspective.
Our approach introduces a quantile-focused squared error loss, which enables efficient, closed-form computing and maintains a close relationship with Wasserstein-based density estimation.
arXiv Detail & Related papers (2023-11-03T17:19:31Z) - Kernel Cox partially linear regression: building predictive models for
cancer patients' survival [4.230753712933184]
We build a kernel Cox proportional hazards semi-parametric model and propose a novel regularized garrotized kernel machine (RegGKM) method to fit the model.
We use the kernel machine method to describe the complex relationship between survival and predictors, while automatically removing irrelevant parametric and non-parametric predictors.
Our results can help classify patients into groups with different death risks, facilitating treatment for better clinical outcomes.
arXiv Detail & Related papers (2023-10-11T04:27:54Z) - Selective Nonparametric Regression via Testing [54.20569354303575]
We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
arXiv Detail & Related papers (2023-09-28T13:04:11Z) - Quantifying predictive uncertainty of aphasia severity in stroke patients with sparse heteroscedastic Bayesian high-dimensional regression [47.1405366895538]
Sparse linear regression methods for high-dimensional data commonly assume that residuals have constant variance, which can be violated in practice.
This paper proposes estimating high-dimensional heteroscedastic linear regression models using a heteroscedastic partitioned empirical Bayes Expectation Conditional Maximization algorithm.
arXiv Detail & Related papers (2023-09-15T22:06:29Z) - Bayesian Uncertainty Estimation of Learned Variational MRI
Reconstruction [63.202627467245584]
We introduce a Bayesian variational framework to quantify the model-immanent (epistemic) uncertainty.
We demonstrate that our approach yields competitive results for undersampled MRI reconstruction.
arXiv Detail & Related papers (2021-02-12T18:08:14Z) - Increasing the efficiency of randomized trial estimates via linear
adjustment for a prognostic score [59.75318183140857]
Estimating causal effects from randomized experiments is central to clinical research.
Most methods for historical borrowing achieve reductions in variance by sacrificing strict type-I error rate control.
arXiv Detail & Related papers (2020-12-17T21:10:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.