Conditional Gradients for the Approximately Vanishing Ideal
- URL: http://arxiv.org/abs/2202.03349v3
- Date: Wed, 9 Feb 2022 13:12:35 GMT
- Title: Conditional Gradients for the Approximately Vanishing Ideal
- Authors: E. Wirth, S. Pokutta
- Abstract summary: Conditional Conditional Gradients Approximately Vanishing Ideal algorithm (CGAVI)
We introduce the Conditional Conditional Gradients Approximately Vanishing Ideal algorithm (CGAVI) for the construction of the set of generators of the approximately ideal.
The constructed set of generators captures structures in data and gives rise to a feature map that can be used in combination with a linear classifier for supervised learning.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The vanishing ideal of a set of points $X\subseteq \mathbb{R}^n$ is the set
of polynomials that evaluate to $0$ over all points $\mathbf{x} \in X$ and
admits an efficient representation by a finite set of polynomials called
generators. To accommodate the noise in the data set, we introduce the
Conditional Gradients Approximately Vanishing Ideal algorithm (CGAVI) for the
construction of the set of generators of the approximately vanishing ideal. The
constructed set of generators captures polynomial structures in data and gives
rise to a feature map that can, for example, be used in combination with a
linear classifier for supervised learning. In CGAVI, we construct the set of
generators by solving specific instances of (constrained) convex optimization
problems with the Pairwise Frank-Wolfe algorithm (PFW). Among other things, the
constructed generators inherit the LASSO generalization bound and not only
vanish on the training but also on out-sample data. Moreover, CGAVI admits a
compact representation of the approximately vanishing ideal by constructing few
generators with sparse coefficient vectors.
Related papers
- Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning [0.0]
This paper proposes the mutual-energy inner product optimization method for constructing a feature coordinate system.
By expressing the mutual-energy inner product as a series of eigenfunctions, it shows a significant advantage of enhancing low-frequency features.
A stable and efficient sequential linearization algorithm is constructed to solve the optimization model.
arXiv Detail & Related papers (2024-11-09T07:26:03Z) - Covering Number of Real Algebraic Varieties and Beyond: Improved Bounds and Applications [8.438718130535296]
We prove upper bounds on the covering number of sets in Euclidean space.
We show that bounds improve the best known general bound by Yomdin-Comte.
We illustrate the power of the result on three computational applications.
arXiv Detail & Related papers (2023-11-09T03:06:59Z) - Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and
Optimal Algorithms [64.10576998630981]
We show the first tight characterization of the optimal Hessian-dependent sample complexity.
A Hessian-independent algorithm universally achieves the optimal sample complexities for all Hessian instances.
The optimal sample complexities achieved by our algorithm remain valid for heavy-tailed noise distributions.
arXiv Detail & Related papers (2023-06-21T17:03:22Z) - An inexact LPA for DC composite optimization and application to matrix completions with outliers [5.746154410100363]
This paper concerns a class of composite optimization problems.
By leveraging the composite structure, we provide a condition for the potential function to have the KL property of $1/2$ at the iterate sequence.
arXiv Detail & Related papers (2023-03-29T16:15:34Z) - Efficient construction of involutory linear combinations of
anti-commuting Pauli generators for large-scale iterative qubit coupled
cluster calculations [0.0]
We present an efficient method for construction of a fully anti-commutative set of Pauli generators.
The algorithm complexity is linear in the size of the X set and quadratic in the number of qubits.
The resulting anti-commutative sets are used to construct the qubit coupled cluster Ansatz.
arXiv Detail & Related papers (2023-01-25T16:52:02Z) - Approximate Vanishing Ideal Computations at Scale [21.01267270902429]
We focus on scaling up the Oracle Approximate Vanishing Ideal algorithm (OAVI)
OAVI is an attractive preprocessing technique for large-scale machine learning.
arXiv Detail & Related papers (2022-07-04T07:17:28Z) - Unfolding Projection-free SDP Relaxation of Binary Graph Classifier via
GDPA Linearization [59.87663954467815]
Algorithm unfolding creates an interpretable and parsimonious neural network architecture by implementing each iteration of a model-based algorithm as a neural layer.
In this paper, leveraging a recent linear algebraic theorem called Gershgorin disc perfect alignment (GDPA), we unroll a projection-free algorithm for semi-definite programming relaxation (SDR) of a binary graph.
Experimental results show that our unrolled network outperformed pure model-based graph classifiers, and achieved comparable performance to pure data-driven networks but using far fewer parameters.
arXiv Detail & Related papers (2021-09-10T07:01:15Z) - Optimal Gradient-based Algorithms for Non-concave Bandit Optimization [76.57464214864756]
This work considers a large family of bandit problems where the unknown underlying reward function is non-concave.
Our algorithms are based on a unified zeroth-order optimization paradigm that applies in great generality.
We show that the standard optimistic algorithms are sub-optimal by dimension factors.
arXiv Detail & Related papers (2021-07-09T16:04:24Z) - Analysis of Truncated Orthogonal Iteration for Sparse Eigenvector
Problems [78.95866278697777]
We propose two variants of the Truncated Orthogonal Iteration to compute multiple leading eigenvectors with sparsity constraints simultaneously.
We then apply our algorithms to solve the sparse principle component analysis problem for a wide range of test datasets.
arXiv Detail & Related papers (2021-03-24T23:11:32Z) - Bilinear Classes: A Structural Framework for Provable Generalization in
RL [119.42509700822484]
Bilinear Classes is a new structural framework which permits generalization in reinforcement learning.
The framework incorporates nearly all existing models in which a sample complexity is achievable.
Our main result provides an RL algorithm which has sample complexity for Bilinear Classes.
arXiv Detail & Related papers (2021-03-19T16:34:20Z) - Multi-View Spectral Clustering with High-Order Optimal Neighborhood
Laplacian Matrix [57.11971786407279]
Multi-view spectral clustering can effectively reveal the intrinsic cluster structure among data.
This paper proposes a multi-view spectral clustering algorithm that learns a high-order optimal neighborhood Laplacian matrix.
Our proposed algorithm generates the optimal Laplacian matrix by searching the neighborhood of the linear combination of both the first-order and high-order base.
arXiv Detail & Related papers (2020-08-31T12:28:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.