Language-guided Task Adaptation for Imitation Learning
- URL: http://arxiv.org/abs/2301.09770v1
- Date: Tue, 24 Jan 2023 00:56:43 GMT
- Title: Language-guided Task Adaptation for Imitation Learning
- Authors: Prasoon Goyal, Raymond J. Mooney, Scott Niekum
- Abstract summary: We introduce a novel setting, wherein an agent needs to learn a task from a demonstration of a related task with the difference between the tasks communicated in natural language.
The proposed setting allows reusing demonstrations from other tasks, by providing low effort language descriptions, and can also be used to provide feedback to correct agent errors.
- Score: 40.1007184209417
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce a novel setting, wherein an agent needs to learn a task from a
demonstration of a related task with the difference between the tasks
communicated in natural language. The proposed setting allows reusing
demonstrations from other tasks, by providing low effort language descriptions,
and can also be used to provide feedback to correct agent errors, which are
both important desiderata for building intelligent agents that assist humans in
daily tasks. To enable progress in this proposed setting, we create two
benchmarks -- Room Rearrangement and Room Navigation -- that cover a diverse
set of task adaptations. Further, we propose a framework that uses a
transformer-based model to reason about the entities in the tasks and their
relationships, to learn a policy for the target task
Related papers
- Leverage Task Context for Object Affordance Ranking [57.59106517732223]
We build the first large-scale task-oriented affordance ranking dataset with 25 common tasks, over 50k images and more than 661k objects.
Results demonstrate the feasibility of the task context based affordance learning paradigm and the superiority of our model over state-of-the-art models in the fields of saliency ranking and multimodal object detection.
arXiv Detail & Related papers (2024-11-25T04:22:33Z) - UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions [64.50935101415776]
We build a single model that jointly performs various spoken language understanding (SLU) tasks.
We demonstrate the efficacy of our single multi-task learning model "UniverSLU" for 12 speech classification and sequence generation task types spanning 17 datasets and 9 languages.
arXiv Detail & Related papers (2023-10-04T17:10:23Z) - Learning Task Embeddings for Teamwork Adaptation in Multi-Agent
Reinforcement Learning [13.468555224407764]
We show that a team of agents is able to adapt to novel tasks when provided with task embeddings.
We propose three MATE training paradigms: independent MATE, centralised MATE, and mixed MATE.
We show that the embeddings learned by MATE identify tasks and provide useful information which agents leverage during adaptation to novel tasks.
arXiv Detail & Related papers (2022-07-05T18:23:20Z) - Fast Inference and Transfer of Compositional Task Structures for
Few-shot Task Generalization [101.72755769194677]
We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph.
Our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks.
Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks.
arXiv Detail & Related papers (2022-05-25T10:44:25Z) - One-Shot Learning from a Demonstration with Hierarchical Latent Language [43.140223608960554]
We introduce DescribeWorld, an environment designed to test this sort of generalization skill in grounded agents.
The agent observes a single task demonstration in a Minecraft-like grid world, and is then asked to carry out the same task in a new map.
We find that agents that perform text-based inference are better equipped for the challenge under a random split of tasks.
arXiv Detail & Related papers (2022-03-09T15:36:43Z) - Multi-Agent Policy Transfer via Task Relationship Modeling [28.421365805638953]
We try to discover and exploit common structures among tasks for more efficient transfer.
We propose to learn effect-based task representations as a common space of tasks, using an alternatively fixed training scheme.
As a result, the proposed method can help transfer learned cooperation knowledge to new tasks after training on a few source tasks.
arXiv Detail & Related papers (2022-03-09T01:49:21Z) - Learning to Follow Language Instructions with Compositional Policies [22.778677208048475]
We propose a framework that learns to execute natural language instructions in an environment consisting of goal-reaching tasks.
We train a reinforcement learning agent to learn value functions that can be subsequently composed through a Boolean algebra.
We fine-tune a seq2seq model pretrained on web-scale corpora to map language to logical expressions.
arXiv Detail & Related papers (2021-10-09T21:28:26Z) - Zero-shot Task Adaptation using Natural Language [43.807555235240365]
We propose a novel setting where an agent is given both a demonstration and a description.
Our approach is able to complete more than 95% of target tasks when using template-based descriptions.
arXiv Detail & Related papers (2021-06-05T21:39:04Z) - Adaptive Procedural Task Generation for Hard-Exploration Problems [78.20918366839399]
We introduce Adaptive Procedural Task Generation (APT-Gen) to facilitate reinforcement learning in hard-exploration problems.
At the heart of our approach is a task generator that learns to create tasks from a parameterized task space via a black-box procedural generation module.
To enable curriculum learning in the absence of a direct indicator of learning progress, we propose to train the task generator by balancing the agent's performance in the generated tasks and the similarity to the target tasks.
arXiv Detail & Related papers (2020-07-01T09:38:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.