A Theoretical Study of The Effects of Adversarial Attacks on Sparse
  Regression
        - URL: http://arxiv.org/abs/2212.11209v2
- Date: Thu, 22 Dec 2022 09:43:32 GMT
- Title: A Theoretical Study of The Effects of Adversarial Attacks on Sparse
  Regression
- Authors: Deepak Maurya, Jean Honorio
- Abstract summary: We use the primal-dual witness paradigm to provide provable performance guarantees for the support of the estimated regression parameter vector.
Our theoretical analysis shows the counter-intuitive result that an adversary can influence sample complexity by corrupting the irrelevant features.
- Score: 28.776950569604026
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   This paper analyzes $\ell_1$ regularized linear regression under the
challenging scenario of having only adversarially corrupted data for training.
We use the primal-dual witness paradigm to provide provable performance
guarantees for the support of the estimated regression parameter vector to
match the actual parameter. Our theoretical analysis shows the
counter-intuitive result that an adversary can influence sample complexity by
corrupting the irrelevant features, i.e., those corresponding to zero
coefficients of the regression parameter vector, which, consequently, do not
affect the dependent variable. As any adversarially robust algorithm has its
limitations, our theoretical analysis identifies the regimes under which the
learning algorithm and adversary can dominate over each other. It helps us to
analyze these fundamental limits and address critical scientific questions of
which parameters (like mutual incoherence, the maximum and minimum eigenvalue
of the covariance matrix, and the budget of adversarial perturbation) play a
role in the high or low probability of success of the LASSO algorithm. Also,
the derived sample complexity is logarithmic with respect to the size of the
regression parameter vector, and our theoretical claims are validated by
empirical analysis on synthetic and real-world datasets.
 
      
        Related papers
        - Bridging Internal Probability and Self-Consistency for Effective and   Efficient LLM Reasoning [53.25336975467293]
 We present the first theoretical error decomposition analysis of methods such as perplexity and self-consistency.
Our analysis reveals a fundamental trade-off: perplexity methods suffer from substantial model error due to the absence of a proper consistency function.
We propose Reasoning-Pruning Perplexity Consistency (RPC), which integrates perplexity with self-consistency, and Reasoning Pruning, which eliminates low-probability reasoning paths.
 arXiv  Detail & Related papers  (2025-02-01T18:09:49Z)
- High-dimensional logistic regression with missing data: Imputation,   regularization, and universality [7.167672851569787]
 We study high-dimensional, ridge-regularized logistic regression.
We provide exact characterizations of both the prediction error and the estimation error.
 arXiv  Detail & Related papers  (2024-10-01T21:41:21Z)
- Scaling and renormalization in high-dimensional regression [72.59731158970894]
 We present a unifying perspective on recent results on ridge regression.<n>We use the basic tools of random matrix theory and free probability, aimed at readers with backgrounds in physics and deep learning.<n>Our results extend and provide a unifying perspective on earlier models of scaling laws.
 arXiv  Detail & Related papers  (2024-05-01T15:59:00Z)
- Selective Nonparametric Regression via Testing [54.20569354303575]
 We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
 arXiv  Detail & Related papers  (2023-09-28T13:04:11Z)
- Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
 We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
 Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
 arXiv  Detail & Related papers  (2023-06-09T08:30:51Z)
- Understanding Augmentation-based Self-Supervised Representation Learning
  via RKHS Approximation and Regression [53.15502562048627]
 Recent work has built the connection between self-supervised learning and the approximation of the top eigenspace of a graph Laplacian operator.
This work delves into a statistical analysis of augmentation-based pretraining.
 arXiv  Detail & Related papers  (2023-06-01T15:18:55Z)
- Errors-in-variables Fr\'echet Regression with Low-rank Covariate
  Approximation [2.1756081703276]
 Fr'echet regression has emerged as a promising approach for regression analysis involving non-Euclidean response variables.
Our proposed framework combines the concepts of global Fr'echet regression and principal component regression, aiming to improve the efficiency and accuracy of the regression estimator.
 arXiv  Detail & Related papers  (2023-05-16T08:37:54Z)
- Robust Regularized Low-Rank Matrix Models for Regression and
  Classification [14.698622796774634]
 We propose a framework for matrix variate regression models based on a rank constraint, vector regularization (e.g., sparsity), and a general loss function.
We show that the algorithm is guaranteed to converge, all accumulation points of the algorithm have estimation errors in the order of $O(sqrtn)$ally and substantially attaining the minimax rate.
 arXiv  Detail & Related papers  (2022-05-14T18:03:48Z)
- Long Story Short: Omitted Variable Bias in Causal Machine Learning [26.60315380737132]
 We develop a theory of omitted variable bias for a wide range of common causal parameters.
We show how simple plausibility judgments on the maximum explanatory power of omitted variables are sufficient to bound the magnitude of the bias.
We provide flexible and efficient statistical inference methods for the bounds, which can leverage modern machine learning algorithms for estimation.
 arXiv  Detail & Related papers  (2021-12-26T15:38:23Z)
- The Predictive Normalized Maximum Likelihood for Over-parameterized
  Linear Regression with Norm Constraint: Regret and Double Descent [12.929639356256928]
 We show that modern machine learning models do not obey a trade-off between the complexity of a prediction rule and its ability to generalize.
We use the recently proposed predictive normalized maximum likelihood (pNML) which is the min-max regret solution for individual data.
We demonstrate the use of the pNML regret as a point-wise learnability measure on synthetic data and that it can successfully predict the double-decent phenomenon.
 arXiv  Detail & Related papers  (2021-02-14T15:49:04Z)
- Understanding Implicit Regularization in Over-Parameterized Single Index
  Model [55.41685740015095]
 We design regularization-free algorithms for the high-dimensional single index model.
We provide theoretical guarantees for the induced implicit regularization phenomenon.
 arXiv  Detail & Related papers  (2020-07-16T13:27:47Z)
- An Investigation of Why Overparameterization Exacerbates Spurious
  Correlations [98.3066727301239]
 We identify two key properties of the training data that drive this behavior.
We show how the inductive bias of models towards "memorizing" fewer examples can cause over parameterization to hurt.
 arXiv  Detail & Related papers  (2020-05-09T01:59:13Z)
- Asymptotic Analysis of an Ensemble of Randomly Projected Linear
  Discriminants [94.46276668068327]
 In [1], an ensemble of randomly projected linear discriminants is used to classify datasets.
We develop a consistent estimator of the misclassification probability as an alternative to the computationally-costly cross-validation estimator.
We also demonstrate the use of our estimator for tuning the projection dimension on both real and synthetic data.
 arXiv  Detail & Related papers  (2020-04-17T12:47:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.