Modelling High-Dimensional Categorical Data Using Nonconvex Fusion
Penalties
- URL: http://arxiv.org/abs/2002.12606v5
- Date: Fri, 17 Dec 2021 21:31:08 GMT
- Title: Modelling High-Dimensional Categorical Data Using Nonconvex Fusion
Penalties
- Authors: Benjamin G. Stokell, Rajen D. Shah, Ryan J. Tibshirani
- Abstract summary: Our estimator, called SCOPE, fuses levels together by making their corresponding coefficients exactly equal.
We provide an algorithm for exact and efficient clustering within a non-dimensional block of variables.
- Score: 7.262048441360131
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a method for estimation in high-dimensional linear models with
nominal categorical data. Our estimator, called SCOPE, fuses levels together by
making their corresponding coefficients exactly equal. This is achieved using
the minimax concave penalty on differences between the order statistics of the
coefficients for a categorical variable, thereby clustering the coefficients.
We provide an algorithm for exact and efficient computation of the global
minimum of the resulting nonconvex objective in the case with a single variable
with potentially many levels, and use this within a block coordinate descent
procedure in the multivariate case. We show that an oracle least squares
solution that exploits the unknown level fusions is a limit point of the
coordinate descent with high probability, provided the true levels have a
certain minimum separation; these conditions are known to be minimal in the
univariate case. We demonstrate the favourable performance of SCOPE across a
range of real and simulated datasets. An R package CatReg implementing SCOPE
for linear models and also a version for logistic regression is available on
CRAN.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - Branch and Bound to Assess Stability of Regression Coefficients in Uncertain Models [0.6990493129893112]
We introduce our algorithm, along with supporting mathematical results, an example application, and a link to our computer code.
It helps researchers summarize high-dimensional data and assess the stability of regression coefficients in uncertain models.
arXiv Detail & Related papers (2024-08-19T01:37:14Z) - Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation
and Inference [5.924780594614676]
We show that the error of estimating a single coordinate can be enlarged by a multiple of $sqrtd$ when data are allowed to be arbitrarily adaptive.
We propose a novel estimator for single coordinate inference via solving a Two-stage Adaptive Linear Estimating equation (TALE)
arXiv Detail & Related papers (2023-10-01T00:45:09Z) - AdaCat: Adaptive Categorical Discretization for Autoregressive Models [84.85102013917606]
We propose an efficient, expressive, multimodal parameterization called Adaptive Categorical Discretization (AdaCat)
AdaCat discretizes each dimension of an autoregressive model adaptively, which allows the model to allocate density to fine intervals of interest.
arXiv Detail & Related papers (2022-08-03T17:53:46Z) - Faster One-Sample Stochastic Conditional Gradient Method for Composite
Convex Minimization [61.26619639722804]
We propose a conditional gradient method (CGM) for minimizing convex finite-sum objectives formed as a sum of smooth and non-smooth terms.
The proposed method, equipped with an average gradient (SAG) estimator, requires only one sample per iteration. Nevertheless, it guarantees fast convergence rates on par with more sophisticated variance reduction techniques.
arXiv Detail & Related papers (2022-02-26T19:10:48Z) - Piecewise linear regression and classification [0.20305676256390928]
This paper proposes a method for solving multivariate regression and classification problems using piecewise linear predictors.
A Python implementation of the algorithm described in this paper is available at http://cse.lab.imtlucca.it/bemporad/parc.
arXiv Detail & Related papers (2021-03-10T17:07:57Z) - Autoregressive Score Matching [113.4502004812927]
We propose autoregressive conditional score models (AR-CSM) where we parameterize the joint distribution in terms of the derivatives of univariable log-conditionals (scores)
For AR-CSM models, this divergence between data and model distributions can be computed and optimized efficiently, requiring no expensive sampling or adversarial training.
We show with extensive experimental results that it can be applied to density estimation on synthetic data, image generation, image denoising, and training latent variable models with implicit encoders.
arXiv Detail & Related papers (2020-10-24T07:01:24Z) - Instability, Computational Efficiency and Statistical Accuracy [101.32305022521024]
We develop a framework that yields statistical accuracy based on interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (instability) when applied to an empirical object based on $n$ samples.
We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models.
arXiv Detail & Related papers (2020-05-22T22:30:52Z) - Multiclass classification by sparse multinomial logistic regression [10.312968200748116]
We consider high-dimensional multiclass classification by sparse multinomial logistic regression.
We propose a feature selection procedure based on penalized maximum likelihood with a complexity penalty.
We show that there exist two regimes corresponding to small and large number of classes.
arXiv Detail & Related papers (2020-03-04T08:44:48Z) - Adaptive Correlated Monte Carlo for Contextual Categorical Sequence
Generation [77.7420231319632]
We adapt contextual generation of categorical sequences to a policy gradient estimator, which evaluates a set of correlated Monte Carlo (MC) rollouts for variance control.
We also demonstrate the use of correlated MC rollouts for binary-tree softmax models, which reduce the high generation cost in large vocabulary scenarios.
arXiv Detail & Related papers (2019-12-31T03:01:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.