Bayesian Optimization for Developmental Robotics with Meta-Learning by
Parameters Bounds Reduction
- URL: http://arxiv.org/abs/2007.15375v1
- Date: Thu, 30 Jul 2020 10:55:56 GMT
- Title: Bayesian Optimization for Developmental Robotics with Meta-Learning by
Parameters Bounds Reduction
- Authors: Maxime Petit, Emmanuel Dellandrea and Liming Chen
- Abstract summary: We present a developmental framework based on long-term memory and reasoning modules (Bayesian optimisation, visual similarity and parameters bounds reduction)
We used a simulator to create bin-picking tasks for 8 different objects (7 in simulation and one with real setup, without and with meta-learning with experiences coming from other similar objects) achieving goods results despite a very small optimization budget, with a better performance reached when meta-learning is used.
- Score: 6.19424794628672
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In robotics, methods and softwares usually require optimizations of
hyperparameters in order to be efficient for specific tasks, for instance
industrial bin-picking from homogeneous heaps of different objects. We present
a developmental framework based on long-term memory and reasoning modules
(Bayesian Optimisation, visual similarity and parameters bounds reduction)
allowing a robot to use meta-learning mechanism increasing the efficiency of
such continuous and constrained parameters optimizations. The new optimization,
viewed as a learning for the robot, can take advantage of past experiences
(stored in the episodic and procedural memories) to shrink the search space by
using reduced parameters bounds computed from the best optimizations realized
by the robot with similar tasks of the new one (e.g. bin-picking from an
homogenous heap of a similar object, based on visual similarity of objects
stored in the semantic memory). As example, we have confronted the system to
the constrained optimizations of 9 continuous hyperparameters for a
professional software (Kamido) in industrial robotic arm bin-picking tasks, a
step that is needed each time to handle correctly new object. We used a
simulator to create bin-picking tasks for 8 different objects (7 in simulation
and one with real setup, without and with meta-learning with experiences coming
from other similar objects) achieving goods results despite a very small
optimization budget, with a better performance reached when meta-learning is
used (84.3% vs 78.9% of success overall, with a small budget of 30 iterations
for each optimization) for every object tested (p-value=0.036).
Related papers
- Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate [105.86576388991713]
We introduce a normalized gradient difference (NGDiff) algorithm, enabling us to have better control over the trade-off between the objectives.
We provide a theoretical analysis and empirically demonstrate the superior performance of NGDiff among state-of-the-art unlearning methods on the TOFU and MUSE datasets.
arXiv Detail & Related papers (2024-10-29T14:41:44Z) - CURE: Simulation-Augmented Auto-Tuning in Robotics [15.943773140929856]
This paper proposes CURE -- a method that identifies causally relevant configuration options.
CURE abstracts the causal relationships between various configuration options and robot performance objectives.
We demonstrate the effectiveness and transferability of CURE by conducting experiments in both physical robots and simulation.
arXiv Detail & Related papers (2024-02-08T04:27:14Z) - AdaLomo: Low-memory Optimization with Adaptive Learning Rate [59.64965955386855]
We introduce low-memory optimization with adaptive learning rate (AdaLomo) for large language models.
AdaLomo results on par with AdamW, while significantly reducing memory requirements, thereby lowering the hardware barrier to training large language models.
arXiv Detail & Related papers (2023-10-16T09:04:28Z) - Design Optimizer for Planar Soft-Growing Robot Manipulators [1.1888144645004388]
This work presents a novel approach for design optimization of soft-growing robots.
I optimize the kinematic chain of a soft manipulator to reach targets and avoid unnecessary overuse of material and resources.
I tested the proposed method on different tasks to access its optimality, which showed significant performance in solving the problem.
arXiv Detail & Related papers (2023-10-05T08:23:17Z) - Energy-efficient Task Adaptation for NLP Edge Inference Leveraging
Heterogeneous Memory Architectures [68.91874045918112]
adapter-ALBERT is an efficient model optimization for maximal data reuse across different tasks.
We demonstrate the advantage of mapping the model to a heterogeneous on-chip memory architecture by performing simulations on a validated NLP edge accelerator.
arXiv Detail & Related papers (2023-03-25T14:40:59Z) - Agent-based Collaborative Random Search for Hyper-parameter Tuning and
Global Function Optimization [0.0]
This paper proposes an agent-based collaborative technique for finding near-optimal values for any arbitrary set of hyper- parameters in a machine learning model.
The behavior of the presented model, specifically against the changes in its design parameters, is investigated in both machine learning and global function optimization applications.
arXiv Detail & Related papers (2023-03-03T21:10:17Z) - Visualization and Optimization Techniques for High Dimensional Parameter
Spaces [4.111899441919165]
We propose a novel approach to create an auto-tuning framework for storage systems optimization combining both direct optimization techniques and visual analytics research.
Our system was developed in tight collaboration with a group of systems performance researchers and its final effectiveness was evaluated with expert interviews, a comparative user study, and two case studies.
arXiv Detail & Related papers (2022-04-28T23:01:15Z) - DiffSkill: Skill Abstraction from Differentiable Physics for Deformable
Object Manipulations with Tools [96.38972082580294]
DiffSkill is a novel framework that uses a differentiable physics simulator for skill abstraction to solve deformable object manipulation tasks.
In particular, we first obtain short-horizon skills using individual tools from a gradient-based simulator.
We then learn a neural skill abstractor from the demonstration trajectories which takes RGBD images as input.
arXiv Detail & Related papers (2022-03-31T17:59:38Z) - Conservative Objective Models for Effective Offline Model-Based
Optimization [78.19085445065845]
Computational design problems arise in a number of settings, from synthetic biology to computer architectures.
We propose a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs.
COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems.
arXiv Detail & Related papers (2021-07-14T17:55:28Z) - Bayesian Optimization with High-Dimensional Outputs [42.311308135418805]
In practice, we often wish to optimize objectives defined over many correlated outcomes (or tasks)
We devise an efficient technique for exact multi-task GP sampling that combines exploiting Kronecker structure in the covariance matrices with Matheron's identity.
We demonstrate how this unlocks a new class of applications for Bayesian Optimization across a range of tasks in science and engineering.
arXiv Detail & Related papers (2021-06-24T13:15:12Z) - AutoSimulate: (Quickly) Learning Synthetic Data Generation [70.82315853981838]
We propose an efficient alternative for optimal synthetic data generation based on a novel differentiable approximation of the objective.
We demonstrate that the proposed method finds the optimal data distribution faster (up to $50times$), with significantly reduced training data generation (up to $30times$) and better accuracy ($+8.7%$) on real-world test datasets than previous methods.
arXiv Detail & Related papers (2020-08-16T11:36:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.