Variational Bayes for high-dimensional proportional hazards models with
applications to gene expression variable selection
- URL: http://arxiv.org/abs/2112.10270v1
- Date: Sun, 19 Dec 2021 22:10:41 GMT
- Title: Variational Bayes for high-dimensional proportional hazards models with
applications to gene expression variable selection
- Authors: Michael Komodromos, Eric Aboagye, Marina Evangelou, Sarah Filippi,
Kolyan Ray
- Abstract summary: We propose a variational Bayesian proportional hazards model for prediction and variable selection regarding high-dimensional survival data.
Our method, based on a mean-field variational approximation, overcomes the high computational cost of MCMC.
We demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes.
- Score: 3.8761064607384195
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose a variational Bayesian proportional hazards model for prediction
and variable selection regarding high-dimensional survival data. Our method,
based on a mean-field variational approximation, overcomes the high
computational cost of MCMC whilst retaining the useful features, providing
excellent point estimates and offering a natural mechanism for variable
selection via posterior inclusion probabilities. The performance of our
proposed method is assessed via extensive simulations and compared against
other state-of-the-art Bayesian variable selection methods, demonstrating
comparable or better performance. Finally, we demonstrate how the proposed
method can be used for variable selection on two transcriptomic datasets with
censored survival outcomes, where we identify genes with pre-existing
biological interpretations.
Related papers
- Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Selective Nonparametric Regression via Testing [54.20569354303575]
We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
arXiv Detail & Related papers (2023-09-28T13:04:11Z) - Variational Classification [51.2541371924591]
We derive a variational objective to train the model, analogous to the evidence lower bound (ELBO) used to train variational auto-encoders.
Treating inputs to the softmax layer as samples of a latent variable, our abstracted perspective reveals a potential inconsistency.
We induce a chosen latent distribution, instead of the implicit assumption found in a standard softmax layer.
arXiv Detail & Related papers (2023-05-17T17:47:19Z) - Copula Entropy based Variable Selection for Survival Analysis [2.3980064191633232]
We propose to apply the Copula Entropy (CE)-based method for variable selection to survival analysis.
The idea is to measure the correlation between variables and time-to-event with CE and then select variables according to their CE value.
arXiv Detail & Related papers (2022-09-04T08:14:07Z) - Bayesian Variable Selection in a Million Dimensions [7.366246663367533]
We introduce an efficient MCMC scheme whose cost per iteration is sublinear in P.
We show how this scheme can be extended to generalized linear models for count data.
In experiments we demonstrate the effectiveness of our methods, including on cancer and maize genomic data.
arXiv Detail & Related papers (2022-08-02T00:11:15Z) - A Variational Bayesian Approach to Learning Latent Variables for
Acoustic Knowledge Transfer [55.20627066525205]
We propose a variational Bayesian (VB) approach to learning distributions of latent variables in deep neural network (DNN) models.
Our proposed VB approach can obtain good improvements on target devices, and consistently outperforms 13 state-of-the-art knowledge transfer algorithms.
arXiv Detail & Related papers (2021-10-16T15:54:01Z) - Fast Bayesian Variable Selection in Binomial and Negative Binomial
Regression [9.774282306558465]
We introduce an efficient MCMC scheme for variable selection in binomial and negative binomial regression, that exploits logistic regression as a special case.
In experiments we demonstrate the effectiveness of our approach, including on data with seventeen thousand covariates.
arXiv Detail & Related papers (2021-06-28T20:54:41Z) - Variable selection with missing data in both covariates and outcomes:
Imputation and machine learning [1.0333430439241666]
The missing data issue is ubiquitous in health studies.
Machine learning methods weaken parametric assumptions.
XGBoost and BART have the overall best performance across various settings.
arXiv Detail & Related papers (2021-04-06T20:18:29Z) - Efficient Ensemble Model Generation for Uncertainty Estimation with
Bayesian Approximation in Segmentation [74.06904875527556]
We propose a generic and efficient segmentation framework to construct ensemble segmentation models.
In the proposed method, ensemble models can be efficiently generated by using the layer selection method.
We also devise a new pixel-wise uncertainty loss, which improves the predictive performance.
arXiv Detail & Related papers (2020-05-21T16:08:38Z) - Interpretable random forest models through forward variable selection [0.0]
We develop a forward variable selection method using the continuous ranked probability score (CRPS) as the loss function.
We demonstrate an application of our method to statistical post-processing of daily maximum temperature forecasts in the Netherlands.
arXiv Detail & Related papers (2020-05-11T13:56:49Z) - Decision-Making with Auto-Encoding Variational Bayes [71.44735417472043]
We show that a posterior approximation distinct from the variational distribution should be used for making decisions.
Motivated by these theoretical results, we propose learning several approximate proposals for the best model.
In addition to toy examples, we present a full-fledged case study of single-cell RNA sequencing.
arXiv Detail & Related papers (2020-02-17T19:23:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.