Bridging the Last Mile in Sim-to-Real Robot Perception via Bayesian
Active Learning
- URL: http://arxiv.org/abs/2109.11547v1
- Date: Thu, 23 Sep 2021 14:45:40 GMT
- Title: Bridging the Last Mile in Sim-to-Real Robot Perception via Bayesian
Active Learning
- Authors: Jianxiang Feng, Jongseok Lee, Maximilian Durner, Rudolph Triebel
- Abstract summary: We propose a pipeline that relies on deep Bayesian active learning and aims to minimize the manual annotation efforts.
In our experiments on two object detectiondata sets, we show that the labeling effort required to bridge thereality gap can be reduced to a small amount.
- Score: 34.910660020436424
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Learning from synthetic data is popular in avariety of robotic vision tasks
such as object detection, becauselarge amount of data can be generated without
annotationsby humans. However, when relying only on synthetic data,we encounter
the well-known problem of the simulation-to-reality (Sim-to-Real) gap, which is
hard to resolve completelyin practice. For such cases, real human-annotated
data isnecessary to bridge this gap, and in our work we focus on howto acquire
this data efficiently. Therefore, we propose a Sim-to-Real pipeline that relies
on deep Bayesian active learningand aims to minimize the manual annotation
efforts. We devisea learning paradigm that autonomously selects the data thatis
considered useful for the human expert to annotate. Toachieve this, a Bayesian
Neural Network (BNN) object detectorproviding reliable uncertain estimates is
adapted to infer theinformativeness of the unlabeled data, in order to
performactive learning. In our experiments on two object detectiondata sets, we
show that the labeling effort required to bridge thereality gap can be reduced
to a small amount. Furthermore, wedemonstrate the practical effectiveness of
this idea in a graspingtask on an assistive robot.
Related papers
- Information-driven Affordance Discovery for Efficient Robotic Manipulation [14.863105174430087]
We argue that well-directed interactions with the environment can mitigate this problem.
We provide a theoretical justification of our approach and we empirically validate the approach both in simulation and real-world tasks.
Our method, which we dub IDA, enables the efficient discovery of visual affordances for several action primitives.
arXiv Detail & Related papers (2024-05-06T21:25:51Z) - Learning Sim-to-Real Dense Object Descriptors for Robotic Manipulation [4.7246285569677315]
We present Sim-to-Real Dense Object Nets (SRDONs), a dense object descriptor that not only understands the object via appropriate representation but also maps simulated and real data to a unified feature space with pixel consistency.
We demonstrate in experiments that pre-trained SRDONs significantly improve performances on unseen objects and unseen visual environments for various robotic tasks with zero real-world training.
arXiv Detail & Related papers (2023-04-18T02:28:55Z) - Reinforcement Learning from Passive Data via Latent Intentions [86.4969514480008]
We show that passive data can still be used to learn features that accelerate downstream RL.
Our approach learns from passive data by modeling intentions.
Our experiments demonstrate the ability to learn from many forms of passive data, including cross-embodiment video data and YouTube videos.
arXiv Detail & Related papers (2023-04-10T17:59:05Z) - Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial
Application Case on Autonomous Disassembly [55.41644538483948]
We present an industrial application case that uses sim2real transfer learning for point cloud data.
We provide insights on how to generate and process synthetic point cloud data.
A novel patch-based attention network is proposed additionally to tackle this problem.
arXiv Detail & Related papers (2023-01-12T14:00:37Z) - TRAIL: Near-Optimal Imitation Learning with Suboptimal Data [100.83688818427915]
We present training objectives that use offline datasets to learn a factored transition model.
Our theoretical analysis shows that the learned latent action space can boost the sample-efficiency of downstream imitation learning.
To learn the latent action space in practice, we propose TRAIL (Transition-Reparametrized Actions for Imitation Learning), an algorithm that learns an energy-based transition model.
arXiv Detail & Related papers (2021-10-27T21:05:00Z) - What Matters in Learning from Offline Human Demonstrations for Robot
Manipulation [64.43440450794495]
We conduct an extensive study of six offline learning algorithms for robot manipulation.
Our study analyzes the most critical challenges when learning from offline human data.
We highlight opportunities for learning from human datasets.
arXiv Detail & Related papers (2021-08-06T20:48:30Z) - Generating Annotated Training Data for 6D Object Pose Estimation in
Operational Environments with Minimal User Interaction [1.0044401320520304]
We present a proof of concept for a novel approach of autonomously generating annotated training data for 6D object pose estimation.
This approach is designed for learning new objects in operational environments while requiring little interaction and no expertise on the part of the user.
arXiv Detail & Related papers (2021-03-17T14:46:21Z) - Visual Imitation Made Easy [102.36509665008732]
We present an alternate interface for imitation that simplifies the data collection process while allowing for easy transfer to robots.
We use commercially available reacher-grabber assistive tools both as a data collection device and as the robot's end-effector.
We experimentally evaluate on two challenging tasks: non-prehensile pushing and prehensile stacking, with 1000 diverse demonstrations for each task.
arXiv Detail & Related papers (2020-08-11T17:58:50Z) - On the Robustness of Active Learning [0.7340017786387767]
Active Learning is concerned with how to identify the most useful samples for a Machine Learning algorithm to be trained with.
We find that it is often applied with not enough care and domain knowledge.
We propose the new "Sum of Squared Logits" method based on the Simpson diversity index and investigate the effect of using the confusion matrix for balancing in sample selection.
arXiv Detail & Related papers (2020-06-18T09:07:23Z) - Scalable Multi-Task Imitation Learning with Autonomous Improvement [159.9406205002599]
We build an imitation learning system that can continuously improve through autonomous data collection.
We leverage the robot's own trials as demonstrations for tasks other than the one that the robot actually attempted.
In contrast to prior imitation learning approaches, our method can autonomously collect data with sparse supervision for continuous improvement.
arXiv Detail & Related papers (2020-02-25T18:56:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.