When is Momentum Extragradient Optimal? A Polynomial-Based Analysis
- URL: http://arxiv.org/abs/2211.04659v3
- Date: Sat, 10 Feb 2024 23:26:03 GMT
- Title: When is Momentum Extragradient Optimal? A Polynomial-Based Analysis
- Authors: Junhyung Lyle Kim, Gauthier Gidel, Anastasios Kyrillidis, Fabian
Pedregosa
- Abstract summary: Game dynamics involve complex interactions reflected by the eigenvalues of the game vector field's Jacobian scattered across the complex plane.
This complexity can cause the simple method to diverge, even for bilinear games, while the extragradient method achieves convergence.
We use a gradient-based analysis to identify three distinct scenarios where this method exhibits further accelerated convergence.
- Score: 35.327135714647824
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The extragradient method has gained popularity due to its robust convergence
properties for differentiable games. Unlike single-objective optimization, game
dynamics involve complex interactions reflected by the eigenvalues of the game
vector field's Jacobian scattered across the complex plane. This complexity can
cause the simple gradient method to diverge, even for bilinear games, while the
extragradient method achieves convergence. Building on the recently proven
accelerated convergence of the momentum extragradient method for bilinear games
\citep{azizian2020accelerating}, we use a polynomial-based analysis to identify
three distinct scenarios where this method exhibits further accelerated
convergence. These scenarios encompass situations where the eigenvalues reside
on the (positive) real line, lie on the real line alongside complex conjugates,
or exist solely as complex conjugates. Furthermore, we derive the
hyperparameters for each scenario that achieve the fastest convergence rate.
Related papers
- Online Learning Guided Quasi-Newton Methods with Global Non-Asymptotic Convergence [20.766358513158206]
We prove a global convergence rate of $O(min1/k,sqrtd/k1.25)$ in terms of the duality gap.
These results are the first global convergence results to demonstrate a provable advantage of a quasi-Newton method over the extragradient method.
arXiv Detail & Related papers (2024-10-03T16:08:16Z) - Convergence of Adam Under Relaxed Assumptions [72.24779199744954]
We show that Adam converges to $epsilon$-stationary points with $O(epsilon-4)$ gradient complexity under far more realistic conditions.
We also propose a variance-reduced version of Adam with an accelerated gradient complexity of $O(epsilon-3)$.
arXiv Detail & Related papers (2023-04-27T06:27:37Z) - Min-Max Optimization Made Simple: Approximating the Proximal Point
Method via Contraction Maps [77.8999425439444]
We present a first-order method that admits near-optimal convergence rates for convex/concave min-max problems.
Our work is based on the fact that the update rule of the Proximal Point method can be approximated up to accuracy.
arXiv Detail & Related papers (2023-01-10T12:18:47Z) - On Asymptotic Linear Convergence of Projected Gradient Descent for
Constrained Least Squares [22.851500417035947]
This manuscript presents a unified framework for the analysis of projected gradient descent in the context of constrained least squares.
We present a recipe for the convergence analysis of PGD and demonstrate it via a beginning-to-end application of the recipe on four fundamental problems.
arXiv Detail & Related papers (2021-12-22T09:49:51Z) - A Discrete Variational Derivation of Accelerated Methods in Optimization [68.8204255655161]
We introduce variational which allow us to derive different methods for optimization.
We derive two families of optimization methods in one-to-one correspondence.
The preservation of symplecticity of autonomous systems occurs here solely on the fibers.
arXiv Detail & Related papers (2021-06-04T20:21:53Z) - Better Regularization for Sequential Decision Spaces: Fast Convergence
Rates for Nash, Correlated, and Team Equilibria [121.36609493711292]
We study the application of iterative first-order methods to the problem of computing equilibria of large-scale two-player extensive-form games.
By instantiating first-order methods with our regularizers, we develop the first accelerated first-order methods for computing correlated equilibra and ex-ante coordinated team equilibria.
arXiv Detail & Related papers (2021-05-27T06:10:24Z) - AMITE: A Novel Polynomial Expansion for Analyzing Neural Network
Nonlinearities [1.8761314918771685]
Polynomial expansions are important in the analysis of neural network nonlinearities.
Existing approaches span classical Taylor and Chebyshev methods.
There are no approaches that provide a consistent method an expansion with all these properties.
arXiv Detail & Related papers (2020-07-13T07:58:47Z) - Stochastic spectral embedding [0.0]
We propose a novel sequential adaptive surrogate modeling method based on "stochastic spectral embedding" (SSE)
We show how the method compares favorably against state-of-the-art sparse chaos expansions on a set of models with different complexity and input dimension.
arXiv Detail & Related papers (2020-04-09T11:00:07Z) - Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic
Perspectives [97.16266088683061]
The article rigorously establishes why symplectic discretization schemes are important for momentum-based optimization algorithms.
It provides a characterization of algorithms that exhibit accelerated convergence.
arXiv Detail & Related papers (2020-02-28T00:32:47Z) - Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth
Non-Convex Optimization [25.680334940504405]
This paper establishes the convergence of the rate of a non-smooth subient method with momentum for constrained problems.
For problems, we show how the unconstrained case can be analyzed under weaker assumptions than the state-of-the-art.
arXiv Detail & Related papers (2020-02-13T12:10:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.