An efficient nonconvex reformulation of stagewise convex optimization
  problems
        - URL: http://arxiv.org/abs/2010.14322v1
- Date: Tue, 27 Oct 2020 14:30:32 GMT
- Title: An efficient nonconvex reformulation of stagewise convex optimization
  problems
- Authors: Rudy Bunel, Oliver Hinder, Srinadh Bhojanapalli, Krishnamurthy (Dj)
  Dvijotham
- Abstract summary: We develop a non reformulation designed to exploit staged structure problems.
For neural network verification approach obtains spurious local gaps in only a few steps.
It can quickly solve problems faster than both offthe-shelf and specialized solvers.
- Score: 21.61123565780424
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Convex optimization problems with staged structure appear in several
contexts, including optimal control, verification of deep neural networks, and
isotonic regression. Off-the-shelf solvers can solve these problems but may
scale poorly. We develop a nonconvex reformulation designed to exploit this
staged structure. Our reformulation has only simple bound constraints, enabling
solution via projected gradient methods and their accelerated variants. The
method automatically generates a sequence of primal and dual feasible solutions
to the original convex problem, making optimality certification easy. We
establish theoretical properties of the nonconvex formulation, showing that it
is (almost) free of spurious local minima and has the same global optimum as
the convex problem. We modify PGD to avoid spurious local minimizers so it
always converges to the global minimizer. For neural network verification, our
approach obtains small duality gaps in only a few gradient steps. Consequently,
it can quickly solve large-scale verification problems faster than both
off-the-shelf and specialized solvers.
 
      
        Related papers
        - Optimal Guarantees for Algorithmic Reproducibility and Gradient
  Complexity in Convex Optimization [55.115992622028685]
 Previous work suggests that first-order methods would need to trade-off convergence rate (gradient convergence rate) for better.
We demonstrate that both optimal complexity and near-optimal convergence guarantees can be achieved for smooth convex minimization and smooth convex-concave minimax problems.
 arXiv  Detail & Related papers  (2023-10-26T19:56:52Z)
- Constrained Optimization via Exact Augmented Lagrangian and Randomized
  Iterative Sketching [55.28394191394675]
 We develop an adaptive inexact Newton method for equality-constrained nonlinear, nonIBS optimization problems.
We demonstrate the superior performance of our method on benchmark nonlinear problems, constrained logistic regression with data from LVM, and a PDE-constrained problem.
 arXiv  Detail & Related papers  (2023-05-28T06:33:37Z)
- Decentralized gradient descent maximization method for composite
  nonconvex strongly-concave minimax problems [7.5573375809946395]
 We make the first attempt on solving NCSC minimax problems that can focus on both stationary nonsmooth terms.
Our algorithm is designed based on a novel reformulation of the minimax problem.
 arXiv  Detail & Related papers  (2023-04-05T13:54:43Z)
- Smooth over-parameterized solvers for non-smooth structured optimization [3.756550107432323]
 Non-smoothness encodes structural constraints on the solutions, such as sparsity, group sparsity, low-rank edges and sharp edges.
We operate a non-weighted but smooth overparametrization of the underlying nonsmooth optimization problems.
Our main contribution is to apply the Variable Projection (VarPro) which defines a new formulation by explicitly minimizing over part of the variables.
 arXiv  Detail & Related papers  (2022-05-03T09:23:07Z)
- Efficient Algorithms for High-Dimensional Convex Subspace Optimization
  via Strict Complementarity [19.24470467199451]
 We consider optimization problems in which the goal is find a $k$ subspace of $realsn$, $k$, which minimizes a convex smooth loss.
While this problem is highly in high-dimensional regimes, it is difficult to find a global optimal solution.
In this paper we present a natural.
determinate optimal dimension relaxation for which convergence to the.
global is straightforward.
 arXiv  Detail & Related papers  (2022-02-08T17:36:43Z)
- Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model
  Classes and Cone Decompositions [41.337814204665364]
 We develop algorithms for convex optimization of two-layer neural networks with ReLU activation functions.
We show that convex gated ReLU models obtain data-dependent approximation bounds for the ReLU training problem.
 arXiv  Detail & Related papers  (2022-02-02T23:50:53Z)
- A framework for bilevel optimization that enables stochastic and global   variance reduction algorithms [21.67411847762289]
 Bilevel optimization is a problem of minimizing a value function which involves the arg-minimum of another function.
We introduce a novel framework, in which the solution of the inner problem, the solution of the linear system, and the main variable evolve at the same time.
We demonstrate that SABA, an adaptation of the celebrated SAGA algorithm in our framework, has $O(frac1T)$ convergence rate.
 arXiv  Detail & Related papers  (2022-01-31T18:17:25Z)
- On Constraints in First-Order Optimization: A View from Non-Smooth
  Dynamical Systems [99.59934203759754]
 We introduce a class of first-order methods for smooth constrained optimization.
Two distinctive features of our approach are that projections or optimizations over the entire feasible set are avoided.
The resulting algorithmic procedure is simple to implement even when constraints are nonlinear.
 arXiv  Detail & Related papers  (2021-07-17T11:45:13Z)
- A Feasible Level Proximal Point Method for Nonconvex Sparse Constrained
  Optimization [25.73397307080647]
 We present a new model of a general convex or non objective machine machine objectives.
We propose an algorithm that solves a constraint with gradually relaxed point levels of each subproblem.
We demonstrate the effectiveness of our new numerical scale problems.
 arXiv  Detail & Related papers  (2020-10-23T05:24:05Z)
- Balancing Rates and Variance via Adaptive Batch-Size for Stochastic
  Optimization Problems [120.21685755278509]
 In this work, we seek to balance the fact that attenuating step-size is required for exact convergence with the fact that constant step-size learns faster in time up to an error.
Rather than fixing the minibatch the step-size at the outset, we propose to allow parameters to evolve adaptively.
 arXiv  Detail & Related papers  (2020-07-02T16:02:02Z)
- The Hidden Convex Optimization Landscape of Two-Layer ReLU Neural
  Networks: an Exact Characterization of the Optimal Solutions [51.60996023961886]
 We prove that finding all globally optimal two-layer ReLU neural networks can be performed by solving a convex optimization program with cone constraints.
Our analysis is novel, characterizes all optimal solutions, and does not leverage duality-based analysis which was recently used to lift neural network training into convex spaces.
 arXiv  Detail & Related papers  (2020-06-10T15:38:30Z)
- Second-Order Guarantees in Centralized, Federated and Decentralized
  Nonconvex Optimization [64.26238893241322]
 Simple algorithms have been shown to lead to good empirical results in many contexts.
Several works have pursued rigorous analytical justification for studying non optimization problems.
A key insight in these analyses is that perturbations play a critical role in allowing local descent algorithms.
 arXiv  Detail & Related papers  (2020-03-31T16:54:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.