ELSIM: End-to-end learning of reusable skills through intrinsic
motivation
- URL: http://arxiv.org/abs/2006.12903v1
- Date: Tue, 23 Jun 2020 11:20:46 GMT
- Title: ELSIM: End-to-end learning of reusable skills through intrinsic
motivation
- Authors: Arthur Aubret, Laetitia Matignon and Salima Hassas
- Abstract summary: We present a novel reinforcement learning architecture which hierarchically learns and represents self-generated skills in an end-to-end way.
With this architecture, an agent focuses only on task-rewarded skills while keeping the learning process of skills bottom-up.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Taking inspiration from developmental learning, we present a novel
reinforcement learning architecture which hierarchically learns and represents
self-generated skills in an end-to-end way. With this architecture, an agent
focuses only on task-rewarded skills while keeping the learning process of
skills bottom-up. This bottom-up approach allows to learn skills that 1- are
transferable across tasks, 2- improves exploration when rewards are sparse. To
do so, we combine a previously defined mutual information objective with a
novel curriculum learning algorithm, creating an unlimited and explorable tree
of skills. We test our agent on simple gridworld environments to understand and
visualize how the agent distinguishes between its skills. Then we show that our
approach can scale on more difficult MuJoCo environments in which our agent is
able to build a representation of skills which improve over a baseline both
transfer learning and exploration when rewards are sparse.
Related papers
- Agentic Skill Discovery [19.5703917813767]
Language-conditioned robotic skills make it possible to apply the high-level reasoning of Large Language Models (LLMs) to low-level robotic control.
A remaining challenge is to acquire a diverse set of fundamental skills.
We introduce a novel framework for skill discovery that is entirely driven by LLMs.
arXiv Detail & Related papers (2024-05-23T19:44:03Z) - Bootstrap Your Own Skills: Learning to Solve New Tasks with Large
Language Model Guidance [66.615355754712]
BOSS learns to accomplish new tasks by performing "skill bootstrapping"
We demonstrate through experiments in realistic household environments that agents trained with our LLM-guided bootstrapping procedure outperform those trained with naive bootstrapping.
arXiv Detail & Related papers (2023-10-16T02:43:47Z) - Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
Learning [65.41865750258775]
General purpose agents will require large repertoires of skills.
We introduce a new framework, Hierarchical Empowerment, that makes computing empowerment more tractable.
In a popular ant navigation domain, our four level agents are able to learn skills that cover a surface area over two orders of magnitude larger than prior work.
arXiv Detail & Related papers (2023-07-06T02:27:05Z) - Choreographer: Learning and Adapting Skills in Imagination [60.09911483010824]
We present Choreographer, a model-based agent that exploits its world model to learn and adapt skills in imagination.
Our method decouples the exploration and skill learning processes, being able to discover skills in the latent state space of the model.
Choreographer is able to learn skills both from offline data, and by collecting data simultaneously with an exploration policy.
arXiv Detail & Related papers (2022-11-23T23:31:14Z) - SKILL-IL: Disentangling Skill and Knowledge in Multitask Imitation
Learning [21.222568055417717]
Humans are able to transfer skills and knowledge. If we can cycle to work and drive to the store, we can also cycle to the store and drive to work.
We take inspiration from this and hypothesize the latent memory of a policy network can be disentangled into two partitions.
These contain either the knowledge of the environmental context for the task or the generalizable skill needed to solve the task.
arXiv Detail & Related papers (2022-05-06T10:38:01Z) - Hierarchical Skills for Efficient Exploration [70.62309286348057]
In reinforcement learning, pre-trained low-level skills have the potential to greatly facilitate exploration.
Prior knowledge of the downstream task is required to strike the right balance between generality (fine-grained control) and specificity (faster learning) in skill design.
We propose a hierarchical skill learning framework that acquires skills of varying complexity in an unsupervised manner.
arXiv Detail & Related papers (2021-10-20T22:29:32Z) - Multi-task curriculum learning in a complex, visual, hard-exploration
domain: Minecraft [18.845438529816004]
We explore curriculum learning in a complex, visual domain with many hard exploration challenges: Minecraft.
We find that learning progress is a reliable measure of learnability for automatically constructing an effective curriculum.
arXiv Detail & Related papers (2021-06-28T17:50:40Z) - Latent Skill Planning for Exploration and Transfer [49.25525932162891]
In this paper, we investigate how these two approaches can be integrated into a single reinforcement learning agent.
We leverage the idea of partial amortization for fast adaptation at test time.
We demonstrate the benefits of our design decisions across a suite of challenging locomotion tasks.
arXiv Detail & Related papers (2020-11-27T18:40:03Z) - Emergent Real-World Robotic Skills via Unsupervised Off-Policy
Reinforcement Learning [81.12201426668894]
We develop efficient reinforcement learning methods that acquire diverse skills without any reward function, and then repurpose these skills for downstream tasks.
We show that our proposed algorithm provides substantial improvement in learning efficiency, making reward-free real-world training feasible.
We also demonstrate that the learned skills can be composed using model predictive control for goal-oriented navigation, without any additional training.
arXiv Detail & Related papers (2020-04-27T17:38:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.