PowerGraph: Using neural networks and principal components to
multivariate statistical power trade-offs
- URL: http://arxiv.org/abs/2201.00719v1
- Date: Wed, 29 Dec 2021 19:06:29 GMT
- Title: PowerGraph: Using neural networks and principal components to
multivariate statistical power trade-offs
- Authors: Ajinkya K Mulay, Sean Lane and Erin Hennes
- Abstract summary: A priori statistical power estimation for planned studies with multiple model parameters is inherently a multivariate problem.
Explicit solutions in such cases are either impractical or impossible to solve, leaving researchers with the prevailing method of simulating power.
This paper explores the efficient estimation and graphing of statistical power for a study over varying model parameter combinations.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: It is increasingly acknowledged that a priori statistical power estimation
for planned studies with multiple model parameters is inherently a multivariate
problem. Power for individual parameters of interest cannot be reliably
estimated univariately because sampling variably in, correlation with, and
variance explained relative to one parameter will impact the power for another
parameter, all usual univariate considerations being equal. Explicit solutions
in such cases, especially for models with many parameters, are either
impractical or impossible to solve, leaving researchers with the prevailing
method of simulating power. However, point estimates for a vector of model
parameters are uncertain, and the impact of inaccuracy is unknown. In such
cases, sensitivity analysis is recommended such that multiple combinations of
possible observable parameter vectors are simulated to understand power
trade-offs. A limitation to this approach is that it is computationally
expensive to generate sufficient sensitivity combinations to accurately map the
power trade-off function in increasingly high dimensional spaces for the models
that social scientists estimate. This paper explores the efficient estimation
and graphing of statistical power for a study over varying model parameter
combinations. Optimally powering a study is crucial to ensure a minimum
probability of finding the hypothesized effect. We first demonstrate the impact
of varying parameter values on power for specific hypotheses of interest and
quantify the computational intensity of computing such a graph for a given
level of precision. Finally, we propose a simple and generalizable machine
learning inspired solution to cut the computational cost to less than 7\% of
what could be called a brute force approach. [abridged]
Related papers
- Variational Bayesian surrogate modelling with application to robust design optimisation [0.9626666671366836]
Surrogate models provide a quick-to-evaluate approximation to complex computational models.
We consider Bayesian inference for constructing statistical surrogates with input uncertainties and dimensionality reduction.
We demonstrate intrinsic and robust structural optimisation problems where cost functions depend on a weighted sum of the mean and standard deviation of model outputs.
arXiv Detail & Related papers (2024-04-23T09:22:35Z) - Determining the significance and relative importance of parameters of a
simulated quenching algorithm using statistical tools [0.0]
In this paper the ANOVA (ANalysis Of the VAriance) method is used to carry out an exhaustive analysis of a simulated annealing based method.
The significance and relative importance of the parameters regarding the obtained results, as well as suitable values for each of these, were obtained using ANOVA and post-hoc Tukey HSD test.
arXiv Detail & Related papers (2024-02-08T16:34:00Z) - Structured Radial Basis Function Network: Modelling Diversity for
Multiple Hypotheses Prediction [51.82628081279621]
Multi-modal regression is important in forecasting nonstationary processes or with a complex mixture of distributions.
A Structured Radial Basis Function Network is presented as an ensemble of multiple hypotheses predictors for regression problems.
It is proved that this structured model can efficiently interpolate this tessellation and approximate the multiple hypotheses target distribution.
arXiv Detail & Related papers (2023-09-02T01:27:53Z) - Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - Sparse high-dimensional linear regression with a partitioned empirical
Bayes ECM algorithm [62.997667081978825]
We propose a computationally efficient and powerful Bayesian approach for sparse high-dimensional linear regression.
Minimal prior assumptions on the parameters are used through the use of plug-in empirical Bayes estimates.
The proposed approach is implemented in the R package probe.
arXiv Detail & Related papers (2022-09-16T19:15:50Z) - Data-Driven Influence Functions for Optimization-Based Causal Inference [105.5385525290466]
We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing.
We study the case where probability distributions are not known a priori but need to be estimated from data.
arXiv Detail & Related papers (2022-08-29T16:16:22Z) - Sparse Bayesian Learning for Complex-Valued Rational Approximations [0.03392423750246091]
Surrogate models are used to alleviate the computational burden in engineering tasks.
These models show a strongly non-linear dependence on their input parameters.
We apply a sparse learning approach to the rational approximation.
arXiv Detail & Related papers (2022-06-06T12:06:13Z) - Long Story Short: Omitted Variable Bias in Causal Machine Learning [26.60315380737132]
We develop a theory of omitted variable bias for a wide range of common causal parameters.
We show how simple plausibility judgments on the maximum explanatory power of omitted variables are sufficient to bound the magnitude of the bias.
We provide flexible and efficient statistical inference methods for the bounds, which can leverage modern machine learning algorithms for estimation.
arXiv Detail & Related papers (2021-12-26T15:38:23Z) - Multivariate Probabilistic Regression with Natural Gradient Boosting [63.58097881421937]
We propose a Natural Gradient Boosting (NGBoost) approach based on nonparametrically modeling the conditional parameters of the multivariate predictive distribution.
Our method is robust, works out-of-the-box without extensive tuning, is modular with respect to the assumed target distribution, and performs competitively in comparison to existing approaches.
arXiv Detail & Related papers (2021-06-07T17:44:49Z) - Instability, Computational Efficiency and Statistical Accuracy [101.32305022521024]
We develop a framework that yields statistical accuracy based on interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (instability) when applied to an empirical object based on $n$ samples.
We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models.
arXiv Detail & Related papers (2020-05-22T22:30:52Z) - Optimal statistical inference in the presence of systematic
uncertainties using neural network optimization based on binned Poisson
likelihoods with nuisance parameters [0.0]
This work presents a novel strategy to construct the dimensionality reduction with neural networks for feature engineering.
We discuss how this approach results in an estimate of the parameters of interest that is close to optimal.
arXiv Detail & Related papers (2020-03-16T13:27:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.