Long-Horizon Planning and Execution with Functional Object-Oriented
Networks
- URL: http://arxiv.org/abs/2207.05800v6
- Date: Fri, 2 Jun 2023 17:12:02 GMT
- Title: Long-Horizon Planning and Execution with Functional Object-Oriented
Networks
- Authors: David Paulius, Alejandro Agostini and Dongheui Lee
- Abstract summary: We introduce the idea of exploiting object-level knowledge as a FOON for task planning and execution.
Our approach automatically transforms FOON into PDDL and leverages off-the-shelf planners, action contexts, and robot skills.
We demonstrate our approach on long-horizon tasks in CoppeliaSim and show how learned action contexts can be extended to never-before-seen scenarios.
- Score: 79.94575713911189
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Following work on joint object-action representations, functional
object-oriented networks (FOON) were introduced as a knowledge graph
representation for robots. A FOON contains symbolic concepts useful to a
robot's understanding of tasks and its environment for object-level planning.
Prior to this work, little has been done to show how plans acquired from FOON
can be executed by a robot, as the concepts in a FOON are too abstract for
execution. We thereby introduce the idea of exploiting object-level knowledge
as a FOON for task planning and execution. Our approach automatically
transforms FOON into PDDL and leverages off-the-shelf planners, action
contexts, and robot skills in a hierarchical planning pipeline to generate
executable task plans. We demonstrate our entire approach on long-horizon tasks
in CoppeliaSim and show how learned action contexts can be extended to
never-before-seen scenarios.
Related papers
- Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model [6.9268843428933025]
Large language models (LLMs) have demonstrated powerful planning and reasoning capabilities for comprehension and processing of semantic information.
We propose a novel language-model based framework that enables robots to autonomously plan behaviors and low-level execution under given textual instructions.
arXiv Detail & Related papers (2024-08-15T17:33:32Z) - Embodied Instruction Following in Unknown Environments [66.60163202450954]
We propose an embodied instruction following (EIF) method for complex tasks in the unknown environment.
We build a hierarchical embodied instruction following framework including the high-level task planner and the low-level exploration controller.
For the task planner, we generate the feasible step-by-step plans for human goal accomplishment according to the task completion process and the known visual clues.
arXiv Detail & Related papers (2024-06-17T17:55:40Z) - Embodied Task Planning with Large Language Models [86.63533340293361]
We propose a TAsk Planing Agent (TaPA) in embodied tasks for grounded planning with physical scene constraint.
During inference, we discover the objects in the scene by extending open-vocabulary object detectors to multi-view RGB images collected in different achievable locations.
Experimental results show that the generated plan from our TaPA framework can achieve higher success rate than LLaVA and GPT-3.5 by a sizable margin.
arXiv Detail & Related papers (2023-07-04T17:58:25Z) - Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2
into a Robot Language Model for Grounded Task Planning [45.51792981370957]
We investigate the applicability of a smaller class of large language models (LLMs) in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially.
Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans.
Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics.
arXiv Detail & Related papers (2023-05-12T18:14:32Z) - Knowledge Retrieval using Functional Object-Oriented Network [0.0]
The functional object-oriented network (FOON) is a knowledge representation for symbolic task planning that takes the shape of a graph.
A graph retrieval methodology is shown to produce manipulation motion sequences from the FOON to accomplish a desired aim.
The outcomes are illustrated using motion sequences created by the FOON to complete the desired objectives in a simulated environment.
arXiv Detail & Related papers (2022-11-06T06:02:29Z) - ProgPrompt: Generating Situated Robot Task Plans using Large Language
Models [68.57918965060787]
Large language models (LLMs) can be used to score potential next actions during task planning.
We present a programmatic LLM prompt structure that enables plan generation functional across situated environments.
arXiv Detail & Related papers (2022-09-22T20:29:49Z) - A Road-map to Robot Task Execution with the Functional Object-Oriented
Network [77.93376696738409]
functional object-oriented network (FOON) is a knowledge graph representation for robots.
Taking the form of a bipartite graph, a FOON contains symbolic or high-level information that would be pertinent to a robot's understanding of its environment and tasks.
arXiv Detail & Related papers (2021-06-01T00:43:04Z) - A Long Horizon Planning Framework for Manipulating Rigid Pointcloud
Objects [25.428781562909606]
We present a framework for solving long-horizon planning problems involving manipulation of rigid objects.
Our method plans in the space of object subgoals and frees the planner from reasoning about robot-object interaction dynamics.
arXiv Detail & Related papers (2020-11-16T18:59:33Z) - Latent Space Roadmap for Visual Action Planning of Deformable and Rigid
Object Manipulation [74.88956115580388]
Planning is performed in a low-dimensional latent state space that embeds images.
Our framework consists of two main components: a Visual Foresight Module (VFM) that generates a visual plan as a sequence of images, and an Action Proposal Network (APN) that predicts the actions between them.
arXiv Detail & Related papers (2020-03-19T18:43:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.