A Decentralized Adaptive Momentum Method for Solving a Class of Min-Max
Optimization Problems
- URL: http://arxiv.org/abs/2106.06075v1
- Date: Thu, 10 Jun 2021 22:32:01 GMT
- Title: A Decentralized Adaptive Momentum Method for Solving a Class of Min-Max
Optimization Problems
- Authors: Babak Barazandeh, Tianjian Huang, George Michailidis
- Abstract summary: We develop a decentralized adaptive momentum (ADAM)-type algorithm for solving min-max optimization problem.
We obtain non-asymptotic rates of convergence of the proposed algorithm for finding a (stochastic) first-order Nash equilibrium point.
- Score: 9.653157073271021
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Min-max saddle point games have recently been intensely studied, due to their
wide range of applications, including training Generative Adversarial
Networks~(GANs). However, most of the recent efforts for solving them are
limited to special regimes such as convex-concave games. Further, it is
customarily assumed that the underlying optimization problem is solved either
by a single machine or in the case of multiple machines connected in
centralized fashion, wherein each one communicates with a central node. The
latter approach becomes challenging, when the underlying communications network
has low bandwidth. In addition, privacy considerations may dictate that certain
nodes can communicate with a subset of other nodes. Hence, it is of interest to
develop methods that solve min-max games in a decentralized manner. To that
end, we develop a decentralized adaptive momentum (ADAM)-type algorithm for
solving min-max optimization problem under the condition that the objective
function satisfies a Minty Variational Inequality condition, which is a
generalization to convex-concave case. The proposed method overcomes
shortcomings of recent non-adaptive gradient-based decentralized algorithms for
min-max optimization problems that do not perform well in practice and require
careful tuning. In this paper, we obtain non-asymptotic rates of convergence of
the proposed algorithm (coined DADAM$^3$) for finding a (stochastic)
first-order Nash equilibrium point and subsequently evaluate its performance on
training GANs. The extensive empirical evaluation shows that DADAM$^3$
outperforms recently developed methods, including decentralized optimistic
stochastic gradient for solving such min-max problems.
Related papers
- Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying Networks [57.24087627267086]
We consider the task of minimizing the sum of convex functions stored in a decentralized manner across the nodes of a communication network.
Lower bounds on the number of decentralized communications and (sub)gradient computations required to solve the problem have been established.
We develop the first optimal algorithm that matches these lower bounds and offers substantially improved theoretical performance compared to the existing state of the art.
arXiv Detail & Related papers (2024-05-28T10:28:45Z) - Can Decentralized Stochastic Minimax Optimization Algorithms Converge
Linearly for Finite-Sum Nonconvex-Nonconcave Problems? [56.62372517641597]
Decentralized minimax optimization has been actively studied in the past few years due to its application in a wide range machine learning.
This paper develops two novel decentralized minimax optimization algorithms for the non-strongly-nonconcave problem.
arXiv Detail & Related papers (2023-04-24T02:19:39Z) - Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax
Problems [39.197569803430646]
Minimax optimization plays an important role in many machine learning tasks such as adversarial networks (GANs) and adversarial training.
Although recently a wide variety of optimization methods have been proposed to solve the minimax problems, most of them ignore the distributed setting.
arXiv Detail & Related papers (2023-04-21T11:38:41Z) - Decentralized Riemannian Algorithm for Nonconvex Minimax Problems [82.50374560598493]
The minimax algorithms for neural networks have been developed to solve many problems.
In this paper, we propose two types of minimax algorithms.
For the setting, we propose DRSGDA and prove that our method achieves a gradient.
arXiv Detail & Related papers (2023-02-08T01:42:45Z) - Lower Bounds and Optimal Algorithms for Smooth and Strongly Convex
Decentralized Optimization Over Time-Varying Networks [79.16773494166644]
We consider the task of minimizing the sum of smooth and strongly convex functions stored in a decentralized manner across the nodes of a communication network.
We design two optimal algorithms that attain these lower bounds.
We corroborate the theoretical efficiency of these algorithms by performing an experimental comparison with existing state-of-the-art methods.
arXiv Detail & Related papers (2021-06-08T15:54:44Z) - Solving a class of non-convex min-max games using adaptive momentum
methods [9.538456363995161]
Adaptive momentum methods have attracted a lot of attention for deep neural networks.
In this paper, we propose an adaptive momentum min-max optimization problem in adversarial networks.
Experimental results illustrate its superior performance vis-avis methods for solving such problems.
arXiv Detail & Related papers (2021-04-26T16:06:39Z) - Efficient Methods for Structured Nonconvex-Nonconcave Min-Max
Optimization [98.0595480384208]
We propose a generalization extraient spaces which converges to a stationary point.
The algorithm applies not only to general $p$-normed spaces, but also to general $p$-dimensional vector spaces.
arXiv Detail & Related papers (2020-10-31T21:35:42Z) - Second-Order Guarantees in Centralized, Federated and Decentralized
Nonconvex Optimization [64.26238893241322]
Simple algorithms have been shown to lead to good empirical results in many contexts.
Several works have pursued rigorous analytical justification for studying non optimization problems.
A key insight in these analyses is that perturbations play a critical role in allowing local descent algorithms.
arXiv Detail & Related papers (2020-03-31T16:54:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.