Hindsight States: Blending Sim and Real Task Elements for Efficient
Reinforcement Learning
- URL: http://arxiv.org/abs/2303.02234v1
- Date: Fri, 3 Mar 2023 21:55:04 GMT
- Title: Hindsight States: Blending Sim and Real Task Elements for Efficient
Reinforcement Learning
- Authors: Simon Guist, Jan Schneider, Alexander Dittrich, Vincent Berenz,
Bernhard Sch\"olkopf, Dieter B\"uchler
- Abstract summary: In robotics, one approach to generate training data builds on simulations based on dynamics models derived from first principles.
Here, we leverage the imbalance in complexity of the dynamics to learn more sample-efficiently.
We validate our method on several challenging simulated tasks and demonstrate that it improves learning both alone and when combined with an existing hindsight algorithm.
- Score: 61.3506230781327
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reinforcement learning has shown great potential in solving complex tasks
when large amounts of data can be generated with little effort. In robotics,
one approach to generate training data builds on simulations based on dynamics
models derived from first principles. However, for tasks that, for instance,
involve complex soft robots, devising such models is substantially more
challenging. Being able to train effectively in increasingly complicated
scenarios with reinforcement learning enables to take advantage of complex
systems such as soft robots. Here, we leverage the imbalance in complexity of
the dynamics to learn more sample-efficiently. We (i) abstract the task into
distinct components, (ii) off-load the simple dynamics parts into the
simulation, and (iii) multiply these virtual parts to generate more data in
hindsight. Our new method, Hindsight States (HiS), uses this data and selects
the most useful transitions for training. It can be used with an arbitrary
off-policy algorithm. We validate our method on several challenging simulated
tasks and demonstrate that it improves learning both alone and when combined
with an existing hindsight algorithm, Hindsight Experience Replay (HER).
Finally, we evaluate HiS on a physical system and show that it boosts
performance on a complex table tennis task with a muscular robot. Videos and
code of the experiments can be found on webdav.tuebingen.mpg.de/his/.
Related papers
- VITAL: Visual Teleoperation to Enhance Robot Learning through Human-in-the-Loop Corrections [10.49712834719005]
We propose a low-cost visual teleoperation system for bimanual manipulation tasks, called VITAL.
Our approach leverages affordable hardware and visual processing techniques to collect demonstrations.
We enhance the generalizability and robustness of the learned policies by utilizing both real and simulated environments.
arXiv Detail & Related papers (2024-07-30T23:29:47Z) - Offline Imitation Learning Through Graph Search and Retrieval [57.57306578140857]
Imitation learning is a powerful machine learning algorithm for a robot to acquire manipulation skills.
We propose GSR, a simple yet effective algorithm that learns from suboptimal demonstrations through Graph Search and Retrieval.
GSR can achieve a 10% to 30% higher success rate and over 30% higher proficiency compared to baselines.
arXiv Detail & Related papers (2024-07-22T06:12:21Z) - GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot [27.410618312830497]
In this paper, we propose GeRM (Generalist Robotic Model)
We utilize offline reinforcement learning to optimize data utilization strategies.
We employ a transformer-based VLA network to process multi-modal inputs and output actions.
arXiv Detail & Related papers (2024-03-20T07:36:43Z) - REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous
Manipulation [61.7171775202833]
We introduce an efficient system for learning dexterous manipulation skills withReinforcement learning.
The main idea of our approach is the integration of recent advances in sample-efficient RL and replay buffer bootstrapping.
Our system completes the real-world training cycle by incorporating learned resets via an imitation-based pickup policy.
arXiv Detail & Related papers (2023-09-06T19:05:31Z) - Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial
Application Case on Autonomous Disassembly [55.41644538483948]
We present an industrial application case that uses sim2real transfer learning for point cloud data.
We provide insights on how to generate and process synthetic point cloud data.
A novel patch-based attention network is proposed additionally to tackle this problem.
arXiv Detail & Related papers (2023-01-12T14:00:37Z) - Reset-Free Reinforcement Learning via Multi-Task Learning: Learning
Dexterous Manipulation Behaviors without Human Intervention [67.1936055742498]
We show that multi-task learning can effectively scale reset-free learning schemes to much more complex problems.
This work shows the ability to learn dexterous manipulation behaviors in the real world with RL without any human intervention.
arXiv Detail & Related papers (2021-04-22T17:38:27Z) - Data-Efficient Learning for Complex and Real-Time Physical Problem
Solving using Augmented Simulation [49.631034790080406]
We present a task for navigating a marble to the center of a circular maze.
We present a model that learns to move a marble in the complex environment within minutes of interacting with the real system.
arXiv Detail & Related papers (2020-11-14T02:03:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.