Hallucinative Topological Memory for Zero-Shot Visual Planning
- URL: http://arxiv.org/abs/2002.12336v1
- Date: Thu, 27 Feb 2020 18:54:42 GMT
- Title: Hallucinative Topological Memory for Zero-Shot Visual Planning
- Authors: Kara Liu, Thanard Kurutach, Christine Tung, Pieter Abbeel, Aviv Tamar
- Abstract summary: In visual planning (VP), an agent learns to plan goal-directed behavior from observations of a dynamical system obtained offline.
Most previous works on VP approached the problem by planning in a learned latent space, resulting in low-quality visual plans.
Here, we propose a simple VP method that plans directly in image space and displays competitive performance.
- Score: 86.20780756832502
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In visual planning (VP), an agent learns to plan goal-directed behavior from
observations of a dynamical system obtained offline, e.g., images obtained from
self-supervised robot interaction. Most previous works on VP approached the
problem by planning in a learned latent space, resulting in low-quality visual
plans, and difficult training algorithms. Here, instead, we propose a simple VP
method that plans directly in image space and displays competitive performance.
We build on the semi-parametric topological memory (SPTM) method: image samples
are treated as nodes in a graph, the graph connectivity is learned from image
sequence data, and planning can be performed using conventional graph search
methods. We propose two modifications on SPTM. First, we train an energy-based
graph connectivity function using contrastive predictive coding that admits
stable training. Second, to allow zero-shot planning in new domains, we learn a
conditional VAE model that generates images given a context of the domain, and
use these hallucinated samples for building the connectivity graph and
planning. We show that this simple approach significantly outperform the
state-of-the-art VP methods, in terms of both plan interpretability and success
rate when using the plan to guide a trajectory-following controller.
Interestingly, our method can pick up non-trivial visual properties of objects,
such as their geometry, and account for it in the plans.
Related papers
- Planning as In-Painting: A Diffusion-Based Embodied Task Planning
Framework for Environments under Uncertainty [56.30846158280031]
Task planning for embodied AI has been one of the most challenging problems.
We propose a task-agnostic method named 'planning as in-painting'
The proposed framework achieves promising performances in various embodied AI tasks.
arXiv Detail & Related papers (2023-12-02T10:07:17Z) - Sequential Manipulation Planning on Scene Graph [90.28117916077073]
We devise a 3D scene graph representation, contact graph+ (cg+), for efficient sequential task planning.
Goal configurations, naturally specified on contact graphs, can be produced by a genetic algorithm with an optimization method.
A task plan is then succinct by computing the Graph Editing Distance (GED) between the initial contact graphs and the goal configurations, which generates graph edit operations corresponding to possible robot actions.
arXiv Detail & Related papers (2022-07-10T02:01:33Z) - Template based Graph Neural Network with Optimal Transport Distances [11.56532171513328]
Current Graph Neural Networks (GNN) architectures rely on two important components: node features embedding through message passing, and aggregation with a specialized form of pooling.
We propose in this work a novel point of view, which places distances to some learnable graph templates at the core of the graph representation.
This distance embedding is constructed thanks to an optimal transport distance: the Fused Gromov-Wasserstein (FGW) distance.
arXiv Detail & Related papers (2022-05-31T12:24:01Z) - Dynamic Graph Representation Learning via Graph Transformer Networks [41.570839291138114]
We propose a Transformer-based dynamic graph learning method named Dynamic Graph Transformer (DGT)
DGT has spatial-temporal encoding to effectively learn graph topology and capture implicit links.
We show that DGT presents superior performance compared with several state-of-the-art baselines.
arXiv Detail & Related papers (2021-11-19T21:44:23Z) - Learning to plan with uncertain topological maps [14.774543742438176]
We train an agent to navigate in 3D environments using a hierarchical strategy including a graph based planner and a local policy.
By performing an empirical analysis of our method in simulated photo-realistic 3D environments, we demonstrate that the inclusion of visual features in the learned neural planner outperforms classical symbolic solutions for graph based planning.
arXiv Detail & Related papers (2020-07-10T09:28:30Z) - Plan2Vec: Unsupervised Representation Learning by Latent Plans [106.37274654231659]
We introduce plan2vec, an unsupervised representation learning approach that is inspired by reinforcement learning.
Plan2vec constructs a weighted graph on an image dataset using near-neighbor distances, and then extrapolates this local metric to a global embedding by distilling path-integral over planned path.
We demonstrate the effectiveness of plan2vec on one simulated and two challenging real-world image datasets.
arXiv Detail & Related papers (2020-05-07T17:52:23Z) - Latent Space Roadmap for Visual Action Planning of Deformable and Rigid
Object Manipulation [74.88956115580388]
Planning is performed in a low-dimensional latent state space that embeds images.
Our framework consists of two main components: a Visual Foresight Module (VFM) that generates a visual plan as a sequence of images, and an Action Proposal Network (APN) that predicts the actions between them.
arXiv Detail & Related papers (2020-03-19T18:43:26Z) - Graph Ordering: Towards the Optimal by Learning [69.72656588714155]
Graph representation learning has achieved a remarkable success in many graph-based applications, such as node classification, prediction, and community detection.
However, for some kind of graph applications, such as graph compression and edge partition, it is very hard to reduce them to some graph representation learning tasks.
In this paper, we propose to attack the graph ordering problem behind such applications by a novel learning approach.
arXiv Detail & Related papers (2020-01-18T09:14:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.