Covariance regression with random forests
- URL: http://arxiv.org/abs/2209.08173v3
- Date: Thu, 11 May 2023 15:32:42 GMT
- Title: Covariance regression with random forests
- Authors: Cansu Alakus, Denis Larocque, Aurelie Labbe
- Abstract summary: CovRegRF is implemented in a freely available R package on CRAN.
An application of the proposed method to thyroid disease data is also presented.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Capturing the conditional covariances or correlations among the elements of a
multivariate response vector based on covariates is important to various fields
including neuroscience, epidemiology and biomedicine. We propose a new method
called Covariance Regression with Random Forests (CovRegRF) to estimate the
covariance matrix of a multivariate response given a set of covariates, using a
random forest framework. Random forest trees are built with a splitting rule
specially designed to maximize the difference between the sample covariance
matrix estimates of the child nodes. We also propose a significance test for
the partial effect of a subset of covariates. We evaluate the performance of
the proposed method and significance test through a simulation study which
shows that the proposed method provides accurate covariance matrix estimates
and that the Type-1 error is well controlled. An application of the proposed
method to thyroid disease data is also presented. CovRegRF is implemented in a
freely available R package on CRAN.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - Selective Nonparametric Regression via Testing [54.20569354303575]
We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
arXiv Detail & Related papers (2023-09-28T13:04:11Z) - An ADMM approach for multi-response regression with overlapping groups
and interaction effects [0.0]
We propose MADMMplasso, a novel regularized regression method.
For parameter estimation, we develop an ADMM algorithm that allows us to implement the overlapping groups in a simple way.
The results from the simulations and analysis of a pharmacogenomic screen data set show that the proposed method has an advantage in handling correlated responses and interaction effects.
arXiv Detail & Related papers (2023-03-20T14:43:22Z) - Risk Consistent Multi-Class Learning from Label Proportions [64.0125322353281]
This study addresses a multiclass learning from label proportions (MCLLP) setting in which training instances are provided in bags.
Most existing MCLLP methods impose bag-wise constraints on the prediction of instances or assign them pseudo-labels.
A risk-consistent method is proposed for instance classification using the empirical risk minimization framework.
arXiv Detail & Related papers (2022-03-24T03:49:04Z) - CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator [60.799183326613395]
We propose an unbiased estimator for categorical random variables based on multiple mutually negatively correlated (jointly antithetic) samples.
CARMS combines REINFORCE with copula based sampling to avoid duplicate samples and reduce its variance, while keeping the estimator unbiased using importance sampling.
We evaluate CARMS on several benchmark datasets on a generative modeling task, as well as a structured output prediction task, and find it to outperform competing methods including a strong self-control baseline.
arXiv Detail & Related papers (2021-10-26T20:14:30Z) - Inference for High Dimensional Censored Quantile Regression [8.993036560782137]
This paper proposes a novel procedure to draw inference on all predictors within the framework of global censored quantile regression.
We show that our procedure can properly quantify the uncertainty of the estimates in high dimensional settings.
We apply our method to analyze the heterogeneous effects of SNPs residing in lung cancer pathways on patients' survival.
arXiv Detail & Related papers (2021-07-22T23:57:06Z) - AdaPT-GMM: Powerful and robust covariate-assisted multiple testing [0.7614628596146599]
We propose a new empirical Bayes method for co-assisted multiple testing with false discovery rate (FDR) control.
Our method refines the adaptive p-value thresholding (AdaPT) procedure by generalizing its masking scheme.
We show in extensive simulations and real data examples that our new method, which we call AdaPT-GMM, consistently delivers high power.
arXiv Detail & Related papers (2021-06-30T05:06:18Z) - On the Use of Minimum Penalties in Statistical Learning [2.1320960069210475]
We propose a framework to simultaneously estimate regression coefficients associated with a multivariate regression model and relationships between outcome variables.
An iterative algorithm that generalizes current state art methods is proposed as a solution.
We extend the proposed MinPen framework to other exponential family loss functions, with a specific focus on multiple binomial responses.
arXiv Detail & Related papers (2021-06-09T16:15:46Z) - Conditional canonical correlation estimation based on covariates with
random forests [0.0]
We propose a new method called Random Forest with Canonical Correlation Analysis (RFCCA) to estimate the conditional canonical correlations between two sets of variables.
The proposed method and the global significance test is evaluated through simulation studies that show it provides accurate canonical correlation estimations and well-controlled Type-1 error.
arXiv Detail & Related papers (2020-11-23T17:09:46Z) - Estimating Gradients for Discrete Random Variables by Sampling without
Replacement [93.09326095997336]
We derive an unbiased estimator for expectations over discrete random variables based on sampling without replacement.
We show that our estimator can be derived as the Rao-Blackwellization of three different estimators.
arXiv Detail & Related papers (2020-02-14T14:15:18Z) - Adaptive Correlated Monte Carlo for Contextual Categorical Sequence
Generation [77.7420231319632]
We adapt contextual generation of categorical sequences to a policy gradient estimator, which evaluates a set of correlated Monte Carlo (MC) rollouts for variance control.
We also demonstrate the use of correlated MC rollouts for binary-tree softmax models, which reduce the high generation cost in large vocabulary scenarios.
arXiv Detail & Related papers (2019-12-31T03:01:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.