Learning to optimize by multi-gradient for multi-objective optimization
- URL: http://arxiv.org/abs/2311.00559v1
- Date: Wed, 1 Nov 2023 14:55:54 GMT
- Title: Learning to optimize by multi-gradient for multi-objective optimization
- Authors: Linxi Yang, Xinmin Yang, Liping Tang
- Abstract summary: We introduce a new automatic learning paradigm for optimizing MOO problems, and propose a multi-gradient learning to optimize (ML2O) method.
As a learning-based method, ML2O acquires knowledge of local landscapes by leveraging information from the current step.
We show that our learned outperforms hand-designed competitors on training multi-task learning (MTL) neural network.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The development of artificial intelligence (AI) for science has led to the
emergence of learning-based research paradigms, necessitating a compelling
reevaluation of the design of multi-objective optimization (MOO) methods. The
new generation MOO methods should be rooted in automated learning rather than
manual design. In this paper, we introduce a new automatic learning paradigm
for optimizing MOO problems, and propose a multi-gradient learning to optimize
(ML2O) method, which automatically learns a generator (or mappings) from
multiple gradients to update directions. As a learning-based method, ML2O
acquires knowledge of local landscapes by leveraging information from the
current step and incorporates global experience extracted from historical
iteration trajectory data. By introducing a new guarding mechanism, we propose
a guarded multi-gradient learning to optimize (GML2O) method, and prove that
the iterative sequence generated by GML2O converges to a Pareto critical point.
The experimental results demonstrate that our learned optimizer outperforms
hand-designed competitors on training multi-task learning (MTL) neural network.
Related papers
- Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate [105.86576388991713]
We introduce a normalized gradient difference (NGDiff) algorithm, enabling us to have better control over the trade-off between the objectives.
We provide a theoretical analysis and empirically demonstrate the superior performance of NGDiff among state-of-the-art unlearning methods on the TOFU and MUSE datasets.
arXiv Detail & Related papers (2024-10-29T14:41:44Z) - Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with Gradient-based Model Optimizers [108.72225067368592]
We propose a novel perspective to investigate the design of large language models (LLMs)-based prompts.
We identify two pivotal factors in model parameter learning: update direction and update method.
In particular, we borrow the theoretical framework and learning methods from gradient-based optimization to design improved strategies.
arXiv Detail & Related papers (2024-02-27T15:05:32Z) - Learning Large-scale Neural Fields via Context Pruned Meta-Learning [60.93679437452872]
We introduce an efficient optimization-based meta-learning technique for large-scale neural field training.
We show how gradient re-scaling at meta-test time allows the learning of extremely high-quality neural fields.
Our framework is model-agnostic, intuitive, straightforward to implement, and shows significant reconstruction improvements for a wide range of signals.
arXiv Detail & Related papers (2023-02-01T17:32:16Z) - Learning to Optimize: A Primer and A Benchmark [94.29436694770953]
Learning to optimize (L2O) is an emerging approach that leverages machine learning to develop optimization methods.
This article is poised to be the first comprehensive survey and benchmark of L2O for continuous optimization.
arXiv Detail & Related papers (2021-03-23T20:46:20Z) - Interleaving Learning, with Application to Neural Architecture Search [12.317568257671427]
We propose a novel machine learning framework referred to as interleaving learning (IL)
In our framework, a set of models collaboratively learn a data encoder in an interleaving fashion.
We apply interleaving learning to search neural architectures for image classification on CIFAR-10, CIFAR-100, and ImageNet.
arXiv Detail & Related papers (2021-03-12T00:54:22Z) - Meta-Learning with Neural Tangent Kernels [58.06951624702086]
We propose the first meta-learning paradigm in the Reproducing Kernel Hilbert Space (RKHS) induced by the meta-model's Neural Tangent Kernel (NTK)
Within this paradigm, we introduce two meta-learning algorithms, which no longer need a sub-optimal iterative inner-loop adaptation as in the MAML framework.
We achieve this goal by 1) replacing the adaptation with a fast-adaptive regularizer in the RKHS; and 2) solving the adaptation analytically based on the NTK theory.
arXiv Detail & Related papers (2021-02-07T20:53:23Z) - MTL2L: A Context Aware Neural Optimiser [25.114351877091785]
Multi-Task Learning to Learn (MTL2L) is a context aware neural optimiser which self-modifies its optimisation rules based on input data.
We show that MTL2L is capable of updating learners to classify on data of an unseen input-domain at the meta-testing phase.
arXiv Detail & Related papers (2020-07-18T06:37:30Z) - Transfer Learning without Knowing: Reprogramming Black-box Machine
Learning Models with Scarce Data and Limited Resources [78.72922528736011]
We propose a novel approach, black-box adversarial reprogramming (BAR), that repurposes a well-trained black-box machine learning model.
Using zeroth order optimization and multi-label mapping techniques, BAR can reprogram a black-box ML model solely based on its input-output responses.
BAR outperforms state-of-the-art methods and yields comparable performance to the vanilla adversarial reprogramming method.
arXiv Detail & Related papers (2020-07-17T01:52:34Z) - Gradient Monitored Reinforcement Learning [0.0]
We focus on the enhancement of training and evaluation performance in reinforcement learning algorithms.
We propose an approach to steer the learning in the weight parameters of a neural network based on the dynamic development and feedback from the training process itself.
arXiv Detail & Related papers (2020-05-25T13:45:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.