Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
- URL: http://arxiv.org/abs/2310.01439v1
- Date: Sat, 30 Sep 2023 16:40:50 GMT
- Title: Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
- Authors: Jo\~ao G. Ribeiroa, Cassandro Martinhoa, Alberto Sardinhaa, and
Francisco S. Melo
- Abstract summary: This paper introduces a formal definition of the setting of ad hoc teamwork under partial observability.
Our results in 70 POMDPs from 11 domains show that our approach is not only effective in assisting unknown teammates in solving unknown tasks but is also robust in scaling to more challenging problems.
- Score: 11.786470737937638
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper introduces a formal definition of the setting of ad hoc teamwork
under partial observability and proposes a first-principled model-based
approach which relies only on prior knowledge and partial observations of the
environment in order to perform ad hoc teamwork. We make three distinct
assumptions that set it apart previous works, namely: i) the state of the
environment is always partially observable, ii) the actions of the teammates
are always unavailable to the ad hoc agent and iii) the ad hoc agent has no
access to a reward signal which could be used to learn the task from scratch.
Our results in 70 POMDPs from 11 domains show that our approach is not only
effective in assisting unknown teammates in solving unknown tasks but is also
robust in scaling to more challenging problems.
Related papers
- No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery [53.08822154199948]
Unsupervised Environment Design (UED) methods have gained recent attention as their adaptive curricula promise to enable agents to be robust to in- and out-of-distribution tasks.
This work investigates how existing UED methods select training environments, focusing on task prioritisation metrics.
We develop a method that directly trains on scenarios with high learnability.
arXiv Detail & Related papers (2024-08-27T14:31:54Z) - Improving Zero-Shot ObjectNav with Generative Communication [60.84730028539513]
We propose a new method for improving zero-shot ObjectNav.
Our approach takes into account that the ground agent may have limited and sometimes obstructed view.
arXiv Detail & Related papers (2024-08-03T22:55:26Z) - Collaborative AI Teaming in Unknown Environments via Active Goal Deduction [22.842601384114058]
Existing approaches for training collaborative agents often require defined and known reward signals.
We propose teaming with unknown agents framework, which leverages kernel density Bayesian inverse learning method for active goal deduction.
We prove that unbiased reward estimates in our framework are sufficient for optimal teaming with unknown agents.
arXiv Detail & Related papers (2024-03-22T16:50:56Z) - Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in
the Avalon Game [25.823665278297057]
This study focuses on the ad hoc teamwork problem where the agent operates in an environment driven by natural language.
Our findings reveal the potential of LLM agents in team collaboration, highlighting issues related to hallucinations in communication.
To address this issue, we develop CodeAct, a general agent that equips LLM with enhanced memory and code-driven reasoning.
arXiv Detail & Related papers (2023-12-29T08:26:54Z) - ProAgent: Building Proactive Cooperative Agents with Large Language
Models [89.53040828210945]
ProAgent is a novel framework that harnesses large language models to create proactive agents.
ProAgent can analyze the present state, and infer the intentions of teammates from observations.
ProAgent exhibits a high degree of modularity and interpretability, making it easily integrated into various coordination scenarios.
arXiv Detail & Related papers (2023-08-22T10:36:56Z) - Knowledge-based Reasoning and Learning under Partial Observability in Ad
Hoc Teamwork [4.454557728745761]
This paper introduces an architecture that determines an ad hoc agent's behavior based on non-monotonic logical reasoning.
It supports online selection, adaptation, and learning of the models that predict the other agents' behavior.
We show that the performance of our architecture is comparable or better than state of the art data-driven baselines in both simple and complex scenarios.
arXiv Detail & Related papers (2023-06-01T15:21:27Z) - Weakly-supervised HOI Detection via Prior-guided Bi-level Representation
Learning [66.00600682711995]
Human object interaction (HOI) detection plays a crucial role in human-centric scene understanding and serves as a fundamental building-block for many vision tasks.
One generalizable and scalable strategy for HOI detection is to use weak supervision, learning from image-level annotations only.
This is inherently challenging due to ambiguous human-object associations, large search space of detecting HOIs and highly noisy training signal.
We develop a CLIP-guided HOI representation capable of incorporating the prior knowledge at both image level and HOI instance level, and adopt a self-taught mechanism to prune incorrect human-object associations.
arXiv Detail & Related papers (2023-03-02T14:41:31Z) - A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based
Policy Learning [11.998708550268978]
We develop a class of solutions for open ad hoc teamwork under full and partial observability.
We show that our solution can learn efficient policies in open ad hoc teamwork in fully and partially observable cases.
arXiv Detail & Related papers (2022-10-11T13:44:44Z) - Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions [51.71245032890532]
We propose methods enabling an agent acting upon the world to learn internal representations of sensory information consistent with actions that modify it.
In contrast to existing work, our approach does not require prior knowledge of the group and does not restrict the set of actions the agent can perform.
arXiv Detail & Related papers (2022-07-25T11:22:48Z) - Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under
Partial Observability [15.995282665634097]
We present a novel online prediction algorithm for the problem setting of ad hoc teamwork under partial observability (ATPO)
ATPO accommodates partial observability, using the agent's observations to identify which task is being performed by the teammates.
Our results show that ATPO is effective and robust in identifying the teammate's task from a large library of possible tasks, efficient at solving it in near-optimal time, and scalable in adapting to increasingly larger problem sizes.
arXiv Detail & Related papers (2022-01-10T18:53:34Z) - Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent
Populations [59.608216900601384]
We study agents that learn to communicate via actuating their joints in a 3D environment.
We show that under realistic assumptions, a non-uniform distribution of intents and a common-knowledge energy cost, these agents can find protocols that generalize to novel partners.
arXiv Detail & Related papers (2020-10-29T19:23:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.