Additive interaction modelling using I-priors
- URL: http://arxiv.org/abs/2007.15766v4
- Date: Tue, 13 Jun 2023 15:22:31 GMT
- Title: Additive interaction modelling using I-priors
- Authors: Wicher Bergsma and Haziq Jamil
- Abstract summary: We introduce a parsimonious specification of models with interactions, which has two benefits.
It reduces the number of scale parameters and thus facilitates the estimation of models with interactions.
- Score: 0.571097144710995
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Additive regression models with interactions are widely studied in the
literature, using methods such as splines or Gaussian process regression.
However, these methods can pose challenges for estimation and model selection,
due to the presence of many smoothing parameters and the lack of suitable
criteria. We propose to address these challenges by extending the I-prior
methodology (Bergsma, 2020) to multiple covariates, which may be
multidimensional. The I-prior methodology has some advantages over other
methods, such as Gaussian process regression and Tikhonov regularization, both
theoretically and practically. In particular, the I-prior is a proper prior, is
based on minimal assumptions, yields an admissible posterior mean, and
estimation of the scale (or smoothing) parameters can be done using an EM
algorithm with simple E and M steps. Moreover, we introduce a parsimonious
specification of models with interactions, which has two benefits: (i) it
reduces the number of scale parameters and thus facilitates the estimation of
models with interactions, and (ii) it enables straightforward model selection
(among models with different interactions) based on the marginal likelihood.
Related papers
- SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models [85.67096251281191]
We present an innovative approach to model fusion called zero-shot Sparse MIxture of Low-rank Experts (SMILE) construction.
SMILE allows for the upscaling of source models into an MoE model without extra data or further training.
We conduct extensive experiments across diverse scenarios, such as image classification and text generation tasks, using full fine-tuning and LoRA fine-tuning.
arXiv Detail & Related papers (2024-08-19T17:32:15Z) - Solving Inverse Problems with Model Mismatch using Untrained Neural Networks within Model-based Architectures [14.551812310439004]
We introduce an untrained forward model residual block within the model-based architecture to match the data consistency in the measurement domain for each instance.
Our approach offers a unified solution that is less parameter-sensitive, requires no additional data, and enables simultaneous fitting of the forward model and reconstruction in a single pass.
arXiv Detail & Related papers (2024-03-07T19:02:13Z) - Multi-Response Heteroscedastic Gaussian Process Models and Their
Inference [1.52292571922932]
We propose a novel framework for the modeling of heteroscedastic covariance functions.
We employ variational inference to approximate the posterior and facilitate posterior predictive modeling.
We show that our proposed framework offers a robust and versatile tool for a wide array of applications.
arXiv Detail & Related papers (2023-08-29T15:06:47Z) - Sparse high-dimensional linear regression with a partitioned empirical
Bayes ECM algorithm [62.997667081978825]
We propose a computationally efficient and powerful Bayesian approach for sparse high-dimensional linear regression.
Minimal prior assumptions on the parameters are used through the use of plug-in empirical Bayes estimates.
The proposed approach is implemented in the R package probe.
arXiv Detail & Related papers (2022-09-16T19:15:50Z) - Time varying regression with hidden linear dynamics [74.9914602730208]
We revisit a model for time-varying linear regression that assumes the unknown parameters evolve according to a linear dynamical system.
Counterintuitively, we show that when the underlying dynamics are stable the parameters of this model can be estimated from data by combining just two ordinary least squares estimates.
arXiv Detail & Related papers (2021-12-29T23:37:06Z) - A Variational Inference Approach to Inverse Problems with Gamma
Hyperpriors [60.489902135153415]
This paper introduces a variational iterative alternating scheme for hierarchical inverse problems with gamma hyperpriors.
The proposed variational inference approach yields accurate reconstruction, provides meaningful uncertainty quantification, and is easy to implement.
arXiv Detail & Related papers (2021-11-26T06:33:29Z) - MINIMALIST: Mutual INformatIon Maximization for Amortized Likelihood
Inference from Sampled Trajectories [61.3299263929289]
Simulation-based inference enables learning the parameters of a model even when its likelihood cannot be computed in practice.
One class of methods uses data simulated with different parameters to infer an amortized estimator for the likelihood-to-evidence ratio.
We show that this approach can be formulated in terms of mutual information between model parameters and simulated data.
arXiv Detail & Related papers (2021-06-03T12:59:16Z) - Towards a Better Understanding of Linear Models for Recommendation [28.422943262159933]
We derivation and analysis the closed-form solutions for two basic regression and matrix factorization approaches.
We introduce a new learning algorithm in searching (hyper) parameters for the closed-form solution.
The experimental results demonstrate that the basic models and their closed-form solutions are indeed quite competitive against the state-of-the-art models.
arXiv Detail & Related papers (2021-05-27T04:17:04Z) - Gaussian Process Models with Low-Rank Correlation Matrices for Both
Continuous and Categorical Inputs [0.0]
We introduce a method that uses low-rank approximations of cross-correlation matrices in mixed continuous and categorical Gaussian Process models.
Low-Rank Correlation (LRC) offers the ability to flexibly adapt the number of parameters to the problem at hand by choosing an appropriate rank of the approximation.
arXiv Detail & Related papers (2020-10-06T09:38:35Z) - Estimation of Switched Markov Polynomial NARX models [75.91002178647165]
We identify a class of models for hybrid dynamical systems characterized by nonlinear autoregressive (NARX) components.
The proposed approach is demonstrated on a SMNARX problem composed by three nonlinear sub-models with specific regressors.
arXiv Detail & Related papers (2020-09-29T15:00:47Z) - Bayesian System ID: Optimal management of parameter, model, and
measurement uncertainty [0.0]
We evaluate the robustness of a probabilistic formulation of system identification (ID) to sparse, noisy, and indirect data.
We show that the log posterior has improved geometric properties compared with the objective function surfaces of traditional methods.
arXiv Detail & Related papers (2020-03-04T22:48:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.