Monte-Carlo Tree Search for Multi-Agent Pathfinding: Preliminary Results
- URL: http://arxiv.org/abs/2307.13453v1
- Date: Tue, 25 Jul 2023 12:33:53 GMT
- Title: Monte-Carlo Tree Search for Multi-Agent Pathfinding: Preliminary Results
- Authors: Yelisey Pitanov, Alexey Skrynnik, Anton Andreychuk, Konstantin
Yakovlev, Aleksandr Panov
- Abstract summary: We introduce an original variant of Monte-Carlo Tree Search (MCTS) tailored to multi-agent pathfinding.
Specifically, we use individual paths to assist the agents with the the goal-reaching behavior.
We also use a dedicated decomposition technique to reduce the branching factor of the tree search procedure.
- Score: 60.4817465598352
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work we study a well-known and challenging problem of Multi-agent
Pathfinding, when a set of agents is confined to a graph, each agent is
assigned a unique start and goal vertices and the task is to find a set of
collision-free paths (one for each agent) such that each agent reaches its
respective goal. We investigate how to utilize Monte-Carlo Tree Search (MCTS)
to solve the problem. Although MCTS was shown to demonstrate superior
performance in a wide range of problems like playing antagonistic games (e.g.
Go, Chess etc.), discovering faster matrix multiplication algorithms etc., its
application to the problem at hand was not well studied before. To this end we
introduce an original variant of MCTS, tailored to multi-agent pathfinding. The
crux of our approach is how the reward, that guides MCTS, is computed.
Specifically, we use individual paths to assist the agents with the the
goal-reaching behavior, while leaving them freedom to get off the track if it
is needed to avoid collisions. We also use a dedicated decomposition technique
to reduce the branching factor of the tree search procedure. Empirically we
show that the suggested method outperforms the baseline planning algorithm that
invokes heuristic search, e.g. A*, at each re-planning step.
Related papers
- LiteSearch: Efficacious Tree Search for LLM [70.29796112457662]
This study introduces a novel guided tree search algorithm with dynamic node selection and node-level exploration budget.
Experiments conducted on the GSM8K and TabMWP datasets demonstrate that our approach enjoys significantly lower computational costs compared to baseline methods.
arXiv Detail & Related papers (2024-06-29T05:14:04Z) - MGCBS: An Optimal and Efficient Algorithm for Solving Multi-Goal Multi-Agent Path Finding Problem [5.580214316179672]
The MG-MAPF problem requires each agent to visit pre-assigned multiple goal points at least once without conflict.
We propose the Multi-Goal Conflict-Based Search (MGCBS), which is based on Decoupling the goal Safe interval visiting order search and the Single-agent pathfinding.
Our method can consistently obtain optimal results and execute up to 7 times faster than the state-of-the-art method in our evaluation.
arXiv Detail & Related papers (2024-04-30T12:49:54Z) - Optimal and Bounded Suboptimal Any-Angle Multi-agent Pathfinding [13.296796764344169]
We present the first optimal any-angle multi-agent pathfinding algorithm.
Our planner is based on the Continuous Conflict-based Search (CCBS) algorithm and an optimal any-angle variant of the Safe Interval Path Planning (TO-AA-SIPP)
We adapt two techniques from classical MAPF to the any-angle setting, namely Disjoint Splitting and Multi-Constraints.
arXiv Detail & Related papers (2024-04-25T07:41:47Z) - Scalable Mechanism Design for Multi-Agent Path Finding [87.40027406028425]
Multi-Agent Path Finding (MAPF) involves determining paths for multiple agents to travel simultaneously and collision-free through a shared area toward given goal locations.
Finding an optimal solution is often computationally infeasible, making the use of approximate, suboptimal algorithms essential.
We introduce the problem of scalable mechanism design for MAPF and propose three strategyproof mechanisms, two of which even use approximate MAPF algorithms.
arXiv Detail & Related papers (2024-01-30T14:26:04Z) - Decentralized Monte Carlo Tree Search for Partially Observable
Multi-agent Pathfinding [49.730902939565986]
Multi-Agent Pathfinding problem involves finding a set of conflict-free paths for a group of agents confined to a graph.
In this study, we focus on the decentralized MAPF setting, where the agents may observe the other agents only locally.
We propose a decentralized multi-agent Monte Carlo Tree Search (MCTS) method for MAPF tasks.
arXiv Detail & Related papers (2023-12-26T06:57:22Z) - Improved Anonymous Multi-Agent Path Finding Algorithm [3.694001372535405]
We consider an Anonymous Multi-Agent Path-Finding (AMAPF) problem where the set of agents is confined to a graph.
We suggest a specific search algorithm that leverages the idea of exploring the search space not through considering separate search states but rather bulks of them simultaneously.
The resultant AMAPF solver demonstrates superior performance compared to the state-of-the-art competitor and is able to solve all publicly available MAPF instances in less than 30 seconds.
arXiv Detail & Related papers (2023-12-17T00:49:30Z) - Reinforcement Learning for Branch-and-Bound Optimisation using
Retrospective Trajectories [72.15369769265398]
Machine learning has emerged as a promising paradigm for branching.
We propose retro branching; a simple yet effective approach to RL for branching.
We outperform the current state-of-the-art RL branching algorithm by 3-5x and come within 20% of the best IL method's performance on MILPs with 500 constraints and 1000 variables.
arXiv Detail & Related papers (2022-05-28T06:08:07Z) - Multi-Goal Multi-Agent Path Finding via Decoupled and Integrated Goal
Vertex Ordering [15.99072005190786]
We introduce multi-goal multi agent path finding (MAPF$MG$) which generalizes the standard discrete multi-agent path finding (MAPF) problem.
We suggest two novel algorithms using different paradigms to address MAPF$MG$: a search-based search algorithm called Hamiltonian-CBS (HCBS) and a compilation-based algorithm built using the SMT paradigm, called SMT-Hamiltonian-CBS (SMT-HCBS)
arXiv Detail & Related papers (2020-09-10T22:27:18Z) - Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal
Constraints [52.58352707495122]
We present a multi-robot allocation algorithm that decouples the key computational challenges of sequential decision-making under uncertainty and multi-agent coordination.
We validate our results over a wide range of simulations on two distinct domains: multi-arm conveyor belt pick-and-place and multi-drone delivery dispatch in a city.
arXiv Detail & Related papers (2020-05-27T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.