Conjugate priors for count and rounded data regression
- URL: http://arxiv.org/abs/2110.12316v1
- Date: Sat, 23 Oct 2021 23:26:01 GMT
- Title: Conjugate priors for count and rounded data regression
- Authors: Daniel R. Kowal
- Abstract summary: We introduce conjugate priors that enable closed-form posterior inference.
Key posterior and predictive functionals are computable analytically or via direct Monte Carlo simulation.
These tools are broadly useful for linear regression, nonlinear models via basis expansions, and model and variable selection.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Discrete data are abundant and often arise as counts or rounded data.
However, even for linear regression models, conjugate priors and closed-form
posteriors are typically unavailable, thereby necessitating approximations or
Markov chain Monte Carlo for posterior inference. For a broad class of count
and rounded data regression models, we introduce conjugate priors that enable
closed-form posterior inference. Key posterior and predictive functionals are
computable analytically or via direct Monte Carlo simulation. Crucially, the
predictive distributions are discrete to match the support of the data and can
be evaluated or simulated jointly across multiple covariate values. These tools
are broadly useful for linear regression, nonlinear models via basis
expansions, and model and variable selection. Multiple simulation studies
demonstrate significant advantages in computing, predictive modeling, and
selection relative to existing alternatives.
Related papers
- von Mises Quasi-Processes for Bayesian Circular Regression [57.88921637944379]
We explore a family of expressive and interpretable distributions over circle-valued random functions.
The resulting probability model has connections with continuous spin models in statistical physics.
For posterior inference, we introduce a new Stratonovich-like augmentation that lends itself to fast Markov Chain Monte Carlo sampling.
arXiv Detail & Related papers (2024-06-19T01:57:21Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Fusion of Gaussian Processes Predictions with Monte Carlo Sampling [61.31380086717422]
In science and engineering, we often work with models designed for accurate prediction of variables of interest.
Recognizing that these models are approximations of reality, it becomes desirable to apply multiple models to the same data and integrate their outcomes.
arXiv Detail & Related papers (2024-03-03T04:21:21Z) - Adaptive Optimization for Prediction with Missing Data [6.800113478497425]
We show that some adaptive linear regression models are equivalent to learning an imputation rule and a downstream linear regression model simultaneously.
In settings where data is strongly not missing at random, our methods achieve a 2-10% improvement in out-of-sample accuracy.
arXiv Detail & Related papers (2024-02-02T16:35:51Z) - Conformal inference for regression on Riemannian Manifolds [49.7719149179179]
We investigate prediction sets for regression scenarios when the response variable, denoted by $Y$, resides in a manifold, and the covariable, denoted by X, lies in Euclidean space.
We prove the almost sure convergence of the empirical version of these regions on the manifold to their population counterparts.
arXiv Detail & Related papers (2023-10-12T10:56:25Z) - Engression: Extrapolation through the Lens of Distributional Regression [2.519266955671697]
We propose a neural network-based distributional regression methodology called engression'
An engression model is generative in the sense that we can sample from the fitted conditional distribution and is also suitable for high-dimensional outcomes.
We show that engression can successfully perform extrapolation under some assumptions such as monotonicity, whereas traditional regression approaches such as least-squares or quantile regression fall short under the same assumptions.
arXiv Detail & Related papers (2023-07-03T08:19:00Z) - Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood
Estimation for Latent Gaussian Models [69.22568644711113]
We introduce probabilistic unrolling, a method that combines Monte Carlo sampling with iterative linear solvers to circumvent matrix inversions.
Our theoretical analyses reveal that unrolling and backpropagation through the iterations of the solver can accelerate gradient estimation for maximum likelihood estimation.
In experiments on simulated and real data, we demonstrate that probabilistic unrolling learns latent Gaussian models up to an order of magnitude faster than gradient EM, with minimal losses in model performance.
arXiv Detail & Related papers (2023-06-05T21:08:34Z) - ecpc: An R-package for generic co-data models for high-dimensional
prediction [0.0]
R-package ecpc originally accommodated various and possibly multiple co-data sources.
We present an extension to the method and software for generic co-data models.
We show how ridge penalties may be transformed to elastic net penalties with the R-package squeezy.
arXiv Detail & Related papers (2022-05-16T12:55:19Z) - Two-step penalised logistic regression for multi-omic data with an
application to cardiometabolic syndrome [62.997667081978825]
We implement a two-step approach to multi-omic logistic regression in which variable selection is performed on each layer separately.
Our approach should be preferred if the goal is to select as many relevant predictors as possible.
Our proposed approach allows us to identify features that characterise cardiometabolic syndrome at the molecular level.
arXiv Detail & Related papers (2020-08-01T10:36:27Z) - Variable fusion for Bayesian linear regression via spike-and-slab priors [0.0]
This paper presents a novel variable fusion method in terms of Bayesian linear regression models.
A spike-and-slab prior is tailored to perform variable fusion.
Simulation studies and a real data analysis show that our proposed method achieves better performance than previous methods.
arXiv Detail & Related papers (2020-03-30T09:38:00Z) - Review of Probability Distributions for Modeling Count Data [0.0]
Generalized linear models enable direct modeling of counts in a regression context.
When counts contain only relative information, multinomial or Dirichlet-multinomial models can be more appropriate.
arXiv Detail & Related papers (2020-01-10T18:28:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.