Multi-Goal Multi-Agent Path Finding via Decoupled and Integrated Goal
Vertex Ordering
- URL: http://arxiv.org/abs/2009.05161v1
- Date: Thu, 10 Sep 2020 22:27:18 GMT
- Title: Multi-Goal Multi-Agent Path Finding via Decoupled and Integrated Goal
Vertex Ordering
- Authors: Pavel Surynek
- Abstract summary: We introduce multi-goal multi agent path finding (MAPF$MG$) which generalizes the standard discrete multi-agent path finding (MAPF) problem.
We suggest two novel algorithms using different paradigms to address MAPF$MG$: a search-based search algorithm called Hamiltonian-CBS (HCBS) and a compilation-based algorithm built using the SMT paradigm, called SMT-Hamiltonian-CBS (SMT-HCBS)
- Score: 15.99072005190786
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce multi-goal multi agent path finding (MAPF$^{MG}$) which
generalizes the standard discrete multi-agent path finding (MAPF) problem.
While the task in MAPF is to navigate agents in an undirected graph from their
starting vertices to one individual goal vertex per agent, MAPF$^{MG}$ assigns
each agent multiple goal vertices and the task is to visit each of them at
least once. Solving MAPF$^{MG}$ not only requires finding collision free paths
for individual agents but also determining the order of visiting agent's goal
vertices so that common objectives like the sum-of-costs are optimized. We
suggest two novel algorithms using different paradigms to address MAPF$^{MG}$:
a heuristic search-based search algorithm called Hamiltonian-CBS (HCBS) and a
compilation-based algorithm built using the SMT paradigm, called
SMT-Hamiltonian-CBS (SMT-HCBS). Experimental comparison suggests limitations of
compilation-based approach.
Related papers
- MGCBS: An Optimal and Efficient Algorithm for Solving Multi-Goal Multi-Agent Path Finding Problem [5.580214316179672]
The MG-MAPF problem requires each agent to visit pre-assigned multiple goal points at least once without conflict.
We propose the Multi-Goal Conflict-Based Search (MGCBS), which is based on Decoupling the goal Safe interval visiting order search and the Single-agent pathfinding.
Our method can consistently obtain optimal results and execute up to 7 times faster than the state-of-the-art method in our evaluation.
arXiv Detail & Related papers (2024-04-30T12:49:54Z) - Scalable Mechanism Design for Multi-Agent Path Finding [87.40027406028425]
Multi-Agent Path Finding (MAPF) involves determining paths for multiple agents to travel simultaneously and collision-free through a shared area toward given goal locations.
Finding an optimal solution is often computationally infeasible, making the use of approximate, suboptimal algorithms essential.
We introduce the problem of scalable mechanism design for MAPF and propose three strategyproof mechanisms, two of which even use approximate MAPF algorithms.
arXiv Detail & Related papers (2024-01-30T14:26:04Z) - Decentralized Monte Carlo Tree Search for Partially Observable
Multi-agent Pathfinding [49.730902939565986]
Multi-Agent Pathfinding problem involves finding a set of conflict-free paths for a group of agents confined to a graph.
In this study, we focus on the decentralized MAPF setting, where the agents may observe the other agents only locally.
We propose a decentralized multi-agent Monte Carlo Tree Search (MCTS) method for MAPF tasks.
arXiv Detail & Related papers (2023-12-26T06:57:22Z) - DMS*: Minimizing Makespan for Multi-Agent Combinatorial Path Finding [25.756524895372454]
Multi-Agent Combinatorial Path Finding (MCPF) seeks collision-free paths for multiple agents from their initial to goal locations.
Recent work develops methods to address MCPF while minimizing the sum of individual arrival times at goals.
This paper proposes a min-max variant of MCPF, denoted as MCPF-max, that minimizes the makespan of the agents.
arXiv Detail & Related papers (2023-12-11T11:53:31Z) - Monte-Carlo Tree Search for Multi-Agent Pathfinding: Preliminary Results [60.4817465598352]
We introduce an original variant of Monte-Carlo Tree Search (MCTS) tailored to multi-agent pathfinding.
Specifically, we use individual paths to assist the agents with the the goal-reaching behavior.
We also use a dedicated decomposition technique to reduce the branching factor of the tree search procedure.
arXiv Detail & Related papers (2023-07-25T12:33:53Z) - LaCAM: Search-Based Algorithm for Quick Multi-Agent Pathfinding [5.025654873456756]
We propose a novel complete algorithm for multi-agent pathfinding (MAPF) called lazy constraints addition search for MAPF (LaCAM)
LaCAM uses a two-level search to find solutions quickly, even with hundreds of agents or more.
Our experiments reveal that LaCAM is comparable to or outperforms state-of-the-art sub-optimal MAPF algorithms in a variety of scenarios.
arXiv Detail & Related papers (2022-11-24T06:27:18Z) - Adaptive Multi-Goal Exploration [118.40427257364729]
We show how AdaGoal can be used to tackle the objective of learning an $epsilon$-optimal goal-conditioned policy.
AdaGoal is anchored in the high-level algorithmic structure of existing methods for goal-conditioned deep reinforcement learning.
arXiv Detail & Related papers (2021-11-23T17:59:50Z) - Memory-Based Optimization Methods for Model-Agnostic Meta-Learning and
Personalized Federated Learning [56.17603785248675]
Model-agnostic meta-learning (MAML) has become a popular research area.
Existing MAML algorithms rely on the episode' idea by sampling a few tasks and data points to update the meta-model at each iteration.
This paper proposes memory-based algorithms for MAML that converge with vanishing error.
arXiv Detail & Related papers (2021-06-09T08:47:58Z) - A Provably Efficient Sample Collection Strategy for Reinforcement
Learning [123.69175280309226]
One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior.
We propose to tackle the exploration-exploitation problem following a decoupled approach composed of: 1) An "objective-specific" algorithm that prescribes how many samples to collect at which states, as if it has access to a generative model (i.e., sparse simulator of the environment); 2) An "objective-agnostic" sample collection responsible for generating the prescribed samples as fast as possible.
arXiv Detail & Related papers (2020-07-13T15:17:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.