Determinantal Point Processes Implicitly Regularize Semi-parametric
Regression Problems
- URL: http://arxiv.org/abs/2011.06964v2
- Date: Tue, 9 Mar 2021 13:47:11 GMT
- Title: Determinantal Point Processes Implicitly Regularize Semi-parametric
Regression Problems
- Authors: Micha\"el Fanuel, Joachim Schreurs, Johan A.K. Suykens
- Abstract summary: We discuss the use of a finite Determinantal Point Process (DPP) for approximating semi-parametric models.
With the help of this formalism, we derive a key identity illustrating the implicit regularization effect of determinantal sampling.
Also, a novel projected Nystr"om approximation is defined and used to derive a bound on the expected risk for the corresponding approximation.
- Score: 13.136143245702915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semi-parametric regression models are used in several applications which
require comprehensibility without sacrificing accuracy. Typical examples are
spline interpolation in geophysics, or non-linear time series problems, where
the system includes a linear and non-linear component. We discuss here the use
of a finite Determinantal Point Process (DPP) for approximating semi-parametric
models. Recently, Barthelm\'e, Tremblay, Usevich, and Amblard introduced a
novel representation of some finite DPPs. These authors formulated extended
L-ensembles that can conveniently represent partial-projection DPPs and suggest
their use for optimal interpolation. With the help of this formalism, we derive
a key identity illustrating the implicit regularization effect of determinantal
sampling for semi-parametric regression and interpolation. Also, a novel
projected Nystr\"om approximation is defined and used to derive a bound on the
expected risk for the corresponding approximation of semi-parametric
regression. This work naturally extends similar results obtained for kernel
ridge regression.
Related papers
- Multivariate root-n-consistent smoothing parameter free matching estimators and estimators of inverse density weighted expectations [51.000851088730684]
We develop novel modifications of nearest-neighbor and matching estimators which converge at the parametric $sqrt n $-rate.
We stress that our estimators do not involve nonparametric function estimators and in particular do not rely on sample-size dependent parameters smoothing.
arXiv Detail & Related papers (2024-07-11T13:28:34Z) - Overparameterized Multiple Linear Regression as Hyper-Curve Fitting [0.0]
It is proven that a linear model will produce exact predictions even in the presence of nonlinear dependencies that violate the model assumptions.
The hyper-curve approach is especially suited for the regularization of problems with noise in predictor variables and can be used to remove noisy and "improper" predictors from the model.
arXiv Detail & Related papers (2024-04-11T15:43:11Z) - Gradient-based bilevel optimization for multi-penalty Ridge regression
through matrix differential calculus [0.46040036610482665]
We introduce a gradient-based approach to the problem of linear regression with l2-regularization.
We show that our approach outperforms LASSO, Ridge, and Elastic Net regression.
The analytical of the gradient proves to be more efficient in terms of computational time compared to automatic differentiation.
arXiv Detail & Related papers (2023-11-23T20:03:51Z) - Monte Carlo inference for semiparametric Bayesian regression [5.488491124945426]
This paper introduces a simple, general, and efficient strategy for joint posterior inference of an unknown transformation and all regression model parameters.
It delivers (1) joint posterior consistency under general conditions, including multiple model misspecifications, and (2) efficient Monte Carlo (not Markov chain Monte Carlo) inference for the transformation and all parameters for important special cases.
arXiv Detail & Related papers (2023-06-08T18:42:42Z) - Kernel-based off-policy estimation without overlap: Instance optimality
beyond semiparametric efficiency [53.90687548731265]
We study optimal procedures for estimating a linear functional based on observational data.
For any convex and symmetric function class $mathcalF$, we derive a non-asymptotic local minimax bound on the mean-squared error.
arXiv Detail & Related papers (2023-01-16T02:57:37Z) - Sparse high-dimensional linear regression with a partitioned empirical
Bayes ECM algorithm [62.997667081978825]
We propose a computationally efficient and powerful Bayesian approach for sparse high-dimensional linear regression.
Minimal prior assumptions on the parameters are used through the use of plug-in empirical Bayes estimates.
The proposed approach is implemented in the R package probe.
arXiv Detail & Related papers (2022-09-16T19:15:50Z) - Inverting brain grey matter models with likelihood-free inference: a
tool for trustable cytoarchitecture measurements [62.997667081978825]
characterisation of the brain grey matter cytoarchitecture with quantitative sensitivity to soma density and volume remains an unsolved challenge in dMRI.
We propose a new forward model, specifically a new system of equations, requiring a few relatively sparse b-shells.
We then apply modern tools from Bayesian analysis known as likelihood-free inference (LFI) to invert our proposed model.
arXiv Detail & Related papers (2021-11-15T09:08:27Z) - Understanding Implicit Regularization in Over-Parameterized Single Index
Model [55.41685740015095]
We design regularization-free algorithms for the high-dimensional single index model.
We provide theoretical guarantees for the induced implicit regularization phenomenon.
arXiv Detail & Related papers (2020-07-16T13:27:47Z) - SLEIPNIR: Deterministic and Provably Accurate Feature Expansion for
Gaussian Process Regression with Derivatives [86.01677297601624]
We propose a novel approach for scaling GP regression with derivatives based on quadrature Fourier features.
We prove deterministic, non-asymptotic and exponentially fast decaying error bounds which apply for both the approximated kernel as well as the approximated posterior.
arXiv Detail & Related papers (2020-03-05T14:33:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.