Discounted Adaptive Online Learning: Towards Better Regularization
        - URL: http://arxiv.org/abs/2402.02720v2
- Date: Tue, 18 Jun 2024 18:47:21 GMT
- Title: Discounted Adaptive Online Learning: Towards Better Regularization
- Authors: Zhiyu Zhang, David Bombara, Heng Yang, 
- Abstract summary: We study online learning in adversarial nonstationary environments.
We propose an adaptive (i.e., instance optimal) algorithm that improves the widespread non-adaptive baseline.
We also consider the (Gibbs and Candes, 2021)-style online conformal prediction problem.
- Score: 5.5899168074961265
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   We study online learning in adversarial nonstationary environments. Since the future can be very different from the past, a critical challenge is to gracefully forget the history while new data comes in. To formalize this intuition, we revisit the discounted regret in online convex optimization, and propose an adaptive (i.e., instance optimal), FTRL-based algorithm that improves the widespread non-adaptive baseline -- gradient descent with a constant learning rate. From a practical perspective, this refines the classical idea of regularization in lifelong learning: we show that designing good regularizers can be guided by the principled theory of adaptive online optimization.   Complementing this result, we also consider the (Gibbs and Cand\`es, 2021)-style online conformal prediction problem, where the goal is to sequentially predict the uncertainty sets of a black-box machine learning model. We show that the FTRL nature of our algorithm can simplify the conventional gradient-descent-based analysis, leading to instance-dependent performance guarantees. 
 
      
        Related papers
        - Optimizers Qualitatively Alter Solutions And We Should Leverage This [62.662640460717476]
 Deep Neural Networks (DNNs) can not guarantee convergence to a unique global minimum of the loss when using only local information, such as SGD.<n>We argue that the community should aim at understanding the biases of already existing methods, as well as aim to build new DNNs with the explicit intent of inducing certain properties of the solution.
 arXiv  Detail & Related papers  (2025-07-16T13:33:31Z)
- Online Learning-guided Learning Rate Adaptation via Gradient Alignment [25.688764889273237]
 The performance of an on large-scale deep learning models depends critically on fine-tuning the learning rate.<n>We propose a principled framework called GALA (Gradient Alignment-based Adaptation) which adjusts by tracking the alignment between consecutive gradients and a local curvature estimate.<n>When paired with an online learning algorithm such as Follow-the-Regularized-Leader, our method produces a flexible, adaptive learning schedule.
 arXiv  Detail & Related papers  (2025-06-10T03:46:41Z)
- Online Decision-Focused Learning [63.83903681295497]
 Decision-focused learning (DFL) is an increasingly popular paradigm for training predictive models whose outputs are used in decision-making tasks.<n>We investigate DFL in dynamic environments where the objective function does not evolve over time.<n>We establish bounds on the expected dynamic regret, both when decision space is a simplex and when it is a general bounded convex polytope.
 arXiv  Detail & Related papers  (2025-05-19T10:40:30Z)
- Incorporating Surrogate Gradient Norm to Improve Offline Optimization   Techniques [8.750390242872138]
 We develop a model-agnostic approach to offline optimization.
We show that reducing surrogate sharpness on the offline dataset provably reduces its generalized sharpness on unseen data.
Our analysis extends existing theories from bounding generalized prediction loss (on unseen data) with loss sharpness to bounding the worst-case generalized surrogate sharpness with its empirical estimate on training data.
 arXiv  Detail & Related papers  (2025-03-06T09:24:23Z)
- Online-BLS: An Accurate and Efficient Online Broad Learning System for   Data Stream Classification [52.251569042852815]
 We introduce an online broad learning system framework with closed-form solutions for each online update.
We design an effective weight estimation algorithm and an efficient online updating strategy.
Our framework is naturally extended to data stream scenarios with concept drift and exceeds state-of-the-art baselines.
 arXiv  Detail & Related papers  (2025-01-28T13:21:59Z)
- Adaptive Conformal Inference by Betting [51.272991377903274]
 We consider the problem of adaptive conformal inference without any assumptions about the data generating process.
Existing approaches for adaptive conformal inference are based on optimizing the pinball loss using variants of online gradient descent.
We propose a different approach for adaptive conformal inference that leverages parameter-free online convex optimization techniques.
 arXiv  Detail & Related papers  (2024-12-26T18:42:08Z)
- Gradient-Variation Online Learning under Generalized Smoothness [56.38427425920781]
 gradient-variation online learning aims to achieve regret guarantees that scale with variations in gradients of online functions.
Recent efforts in neural network optimization suggest a generalized smoothness condition, allowing smoothness to correlate with gradient norms.
We provide the applications for fast-rate convergence in games and extended adversarial optimization.
 arXiv  Detail & Related papers  (2024-08-17T02:22:08Z)
- Improving Adaptive Online Learning Using Refined Discretization [44.646191058243645]
 We study unconstrained Online Linear Optimization with Lipschitz losses.
Motivated by the pursuit of instance optimality, we propose a new algorithm.
Central to these results is a continuous time approach to online learning.
 arXiv  Detail & Related papers  (2023-09-27T21:54:52Z)
- Model-based Offline Imitation Learning with Non-expert Data [7.615595533111191]
 We propose a scalable model-based offline imitation learning algorithmic framework that leverages datasets collected by both suboptimal and optimal policies.
We show that the proposed method textitalways outperforms Behavioral Cloning in the low data regime on simulated continuous control domains.
 arXiv  Detail & Related papers  (2022-06-11T13:08:08Z)
- Adaptive Fairness-Aware Online Meta-Learning for Changing Environments [29.073555722548956]
 The fairness-aware online learning framework has arisen as a powerful tool for the continual lifelong learning setting.
Existing methods make heavy use of the i.i.d assumption for data and hence provide static regret analysis for the framework.
We propose a novel adaptive fairness-aware online meta-learning algorithm, namely FairSAOML, which is able to adapt to changing environments in both bias control and model precision.
 arXiv  Detail & Related papers  (2022-05-20T15:29:38Z)
- Near-optimal Offline Reinforcement Learning with Linear Representation:
  Leveraging Variance Information with Pessimism [65.46524775457928]
 offline reinforcement learning seeks to utilize offline/historical data to optimize sequential decision-making strategies.
We study the statistical limits of offline reinforcement learning with linear model representations.
 arXiv  Detail & Related papers  (2022-03-11T09:00:12Z)
- Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient
  for Out-of-Distribution Generalization [52.7137956951533]
 We argue that devising simpler methods for learning predictors on existing features is a promising direction for future research.
We introduce Domain-Adjusted Regression (DARE), a convex objective for learning a linear predictor that is provably robust under a new model of distribution shift.
Under a natural model, we prove that the DARE solution is the minimax-optimal predictor for a constrained set of test distributions.
 arXiv  Detail & Related papers  (2022-02-14T16:42:16Z)
- Last Layer Marginal Likelihood for Invariance Learning [12.00078928875924]
 We introduce a new lower bound to the marginal likelihood, which allows us to perform inference for a larger class of likelihood functions.
We work towards bringing this approach to neural networks by using an architecture with a Gaussian process in the last layer.
 arXiv  Detail & Related papers  (2021-06-14T15:40:51Z)
- COMBO: Conservative Offline Model-Based Policy Optimization [120.55713363569845]
 Uncertainty estimation with complex models, such as deep neural networks, can be difficult and unreliable.
We develop a new model-based offline RL algorithm, COMBO, that regularizes the value function on out-of-support state-actions.
We find that COMBO consistently performs as well or better as compared to prior offline model-free and model-based methods.
 arXiv  Detail & Related papers  (2021-02-16T18:50:32Z)
- LQF: Linear Quadratic Fine-Tuning [114.3840147070712]
 We present the first method for linearizing a pre-trained model that achieves comparable performance to non-linear fine-tuning.
LQF consists of simple modifications to the architecture, loss function and optimization typically used for classification.
 arXiv  Detail & Related papers  (2020-12-21T06:40:20Z)
- Extrapolation for Large-batch Training in Deep Learning [72.61259487233214]
 We show that a host of variations can be covered in a unified framework that we propose.
We prove the convergence of this novel scheme and rigorously evaluate its empirical performance on ResNet, LSTM, and Transformer.
 arXiv  Detail & Related papers  (2020-06-10T08:22:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.