Waypoint-Based Imitation Learning for Robotic Manipulation
- URL: http://arxiv.org/abs/2307.14326v1
- Date: Wed, 26 Jul 2023 17:45:55 GMT
- Title: Waypoint-Based Imitation Learning for Robotic Manipulation
- Authors: Lucy Xiaoyang Shi, Archit Sharma, Tony Z. Zhao, Chelsea Finn
- Abstract summary: Waypoint labeling is underspecified, and requires additional human supervision.
We propose Automatic Waypoint Extraction (AWE) for imitation learning.
AWE can increase the success rate of state-of-the-art algorithms by up to 25% in simulation and by 4-28% on real-world bimanual manipulation tasks.
- Score: 56.4287610994102
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While imitation learning methods have seen a resurgent interest for robotic
manipulation, the well-known problem of compounding errors continues to afflict
behavioral cloning (BC). Waypoints can help address this problem by reducing
the horizon of the learning problem for BC, and thus, the errors compounded
over time. However, waypoint labeling is underspecified, and requires
additional human supervision. Can we generate waypoints automatically without
any additional human supervision? Our key insight is that if a trajectory
segment can be approximated by linear motion, the endpoints can be used as
waypoints. We propose Automatic Waypoint Extraction (AWE) for imitation
learning, a preprocessing module to decompose a demonstration into a minimal
set of waypoints which when interpolated linearly can approximate the
trajectory up to a specified error threshold. AWE can be combined with any BC
algorithm, and we find that AWE can increase the success rate of
state-of-the-art algorithms by up to 25% in simulation and by 4-28% on
real-world bimanual manipulation tasks, reducing the decision making horizon by
up to a factor of 10. Videos and code are available at
https://lucys0.github.io/awe/
Related papers
- Dense Optical Tracking: Connecting the Dots [82.79642869586587]
DOT is a novel, simple and efficient method for solving the problem of point tracking in a video.
We show that DOT is significantly more accurate than current optical flow techniques, outperforms sophisticated "universal trackers" like OmniMotion, and is on par with, or better than, the best point tracking algorithms like CoTracker.
arXiv Detail & Related papers (2023-12-01T18:59:59Z) - Neural Potential Field for Obstacle-Aware Local Motion Planning [46.42871544295734]
We propose a neural network model that returns a differentiable collision cost based on robot pose, obstacle map, and robot footprint.
Our architecture includes neural image encoders, which transform obstacle maps and robot footprints into embeddings.
Experiment on Husky UGV mobile robot showed that our approach allows real-time and safe local planning.
arXiv Detail & Related papers (2023-10-25T05:00:21Z) - Fighting Uncertainty with Gradients: Offline Reinforcement Learning via
Diffusion Score Matching [22.461036967440723]
We study smoothed distance to data as an uncertainty metric, and claim that it has two beneficial properties.
We show these gradients can be efficiently learned with score-matching techniques.
We propose Score-Guided Planning (SGP) to enable first-order planning in high-dimensional problems.
arXiv Detail & Related papers (2023-06-24T23:40:58Z) - PointFlowHop: Green and Interpretable Scene Flow Estimation from
Consecutive Point Clouds [49.7285297470392]
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work.
PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud.
It decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation.
arXiv Detail & Related papers (2023-02-27T23:06:01Z) - Multi-Robot Active Mapping via Neural Bipartite Graph Matching [49.72892929603187]
We study the problem of multi-robot active mapping, which aims for complete scene map construction in minimum time steps.
The key to this problem lies in the goal position estimation to enable more efficient robot movements.
We propose a novel algorithm, namely NeuralCoMapping, which takes advantage of both approaches.
arXiv Detail & Related papers (2022-03-30T14:03:17Z) - Learning Time-optimized Path Tracking with or without Sensory Feedback [5.254093731341154]
We present a learning-based approach that allows a robot to quickly follow a reference path defined in joint space.
The robot is controlled by a neural network that is trained via reinforcement learning using data generated by a physics simulator.
arXiv Detail & Related papers (2022-03-03T19:13:31Z) - Continuous Motion Planning with Temporal Logic Specifications using Deep
Neural Networks [16.296473750342464]
We propose a model-free reinforcement learning method to synthesize control policies for motion planning problems.
The robot is modelled as a discrete Markovtime decision process (MDP) with continuous state and action spaces.
We train deep neural networks to approximate the value function and policy using an actorcritic reinforcement learning method.
arXiv Detail & Related papers (2020-04-02T17:58:03Z) - Key Points Estimation and Point Instance Segmentation Approach for Lane
Detection [65.37887088194022]
We propose a traffic line detection method called Point Instance Network (PINet)
The PINet includes several stacked hourglass networks that are trained simultaneously.
The PINet achieves competitive accuracy and false positive on the TuSimple and Culane datasets.
arXiv Detail & Related papers (2020-02-16T15:51:30Z) - PointHop++: A Lightweight Learning Model on Point Sets for 3D
Classification [55.887502438160304]
The PointHop method was recently proposed by Zhang et al. for 3D point cloud classification with unsupervised feature extraction.
We improve the PointHop method furthermore in two aspects: 1) reducing its model complexity in terms of the model parameter number and 2) ordering discriminant features automatically based on the cross-entropy criterion.
With experiments conducted on the ModelNet40 benchmark dataset, we show that the PointHop++ method performs on par with deep neural network (DNN) solutions and surpasses other unsupervised feature extraction methods.
arXiv Detail & Related papers (2020-02-09T04:49:32Z) - Closed-loop deep learning: generating forward models with
back-propagation [0.0]
A reflex is a simple closed loop control approach which tries to minimise an error but fails to do so because it will always react too late.
An adaptive algorithm can use this error to learn a forward model with the help of predictive cues.
We show how this can be directly achieved by embedding deep learning into a closed loop system and preserving its continuous processing.
arXiv Detail & Related papers (2020-01-09T13:36:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.