Deadlock-Free Method for Multi-Agent Pickup and Delivery Problem Using
Priority Inheritance with Temporary Priority
- URL: http://arxiv.org/abs/2205.12504v1
- Date: Wed, 25 May 2022 05:45:22 GMT
- Title: Deadlock-Free Method for Multi-Agent Pickup and Delivery Problem Using
Priority Inheritance with Temporary Priority
- Authors: Yukita Fujitani, Tomoki Yamauchi, Yuki Miyashita and Toshiharu
Sugawara
- Abstract summary: This paper proposes a control method for the multi-agent pickup and delivery problem (MAPD problem) by extending the priority inheritance with backtracking (PIBT) method.
PIBT is only applicable to environments that are modeled as a bi-connected area, and if it contains dead-ends, such as tree-shaped paths, PIBT may cause deadlocks.
Our proposed method enables MAPD tasks to be performed in environments with some tree-shaped paths without deadlock while preserving the PIBT feature.
- Score: 2.064612766965483
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes a control method for the multi-agent pickup and delivery
problem (MAPD problem) by extending the priority inheritance with backtracking
(PIBT) method to make it applicable to more general environments. PIBT is an
effective algorithm that introduces a priority to each agent, and at each
timestep, the agents, in descending order of priority, decide their next
neighboring locations in the next timestep through communications only with the
local agents. Unfortunately, PIBT is only applicable to environments that are
modeled as a bi-connected area, and if it contains dead-ends, such as
tree-shaped paths, PIBT may cause deadlocks. However, in the real-world
environment, there are many dead-end paths to locations such as the shelves
where materials are stored as well as loading/unloading locations to
transportation trucks. Our proposed method enables MAPD tasks to be performed
in environments with some tree-shaped paths without deadlock while preserving
the PIBT feature; it does this by allowing the agents to have temporary
priorities and restricting agents' movements in the trees. First, we
demonstrate that agents can always reach their delivery without deadlock. Our
experiments indicate that the proposed method is very efficient, even in
environments where PIBT is not applicable, by comparing them with those
obtained using the well-known token passing method as a baseline.
Related papers
- Decentralized Monte Carlo Tree Search for Partially Observable
Multi-agent Pathfinding [49.730902939565986]
Multi-Agent Pathfinding problem involves finding a set of conflict-free paths for a group of agents confined to a graph.
In this study, we focus on the decentralized MAPF setting, where the agents may observe the other agents only locally.
We propose a decentralized multi-agent Monte Carlo Tree Search (MCTS) method for MAPF tasks.
arXiv Detail & Related papers (2023-12-26T06:57:22Z) - AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline
Multi-Agent RL via Alternating Stationary Distribution Correction Estimation [65.4532392602682]
One of the main challenges in offline Reinforcement Learning (RL) is the distribution shift that arises from the learned policy deviating from the data collection policy.
This is often addressed by avoiding out-of-distribution (OOD) actions during policy improvement as their presence can lead to substantial performance degradation.
We introduce AlberDICE, an offline MARL algorithm that performs centralized training of individual agents based on stationary distribution optimization.
arXiv Detail & Related papers (2023-11-03T18:56:48Z) - Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via
Planning and Learning [46.354187895184154]
Multi-agent Pathfinding (MAPF) problem generally asks to find a set of conflict-free paths for a set of agents confined to a graph.
In this work, we investigate the decentralized MAPF setting, when the central controller that posses all the information on the agents' locations and goals is absent.
We focus on the practically important lifelong variant of MAPF, which involves continuously assigning new goals to the agents upon arrival to the previous ones.
arXiv Detail & Related papers (2023-10-02T13:51:32Z) - AI planning in the imagination: High-level planning on learned abstract
search spaces [68.75684174531962]
We propose a new method, called PiZero, that gives an agent the ability to plan in an abstract search space that the agent learns during training.
We evaluate our method on multiple domains, including the traveling salesman problem, Sokoban, 2048, the facility location problem, and Pacman.
arXiv Detail & Related papers (2023-08-16T22:47:16Z) - Robust Multi-Agent Pickup and Delivery with Delays [5.287544737925232]
Multi-Agent Pickup and Delivery (MAPD) is the problem of computing collision-free paths for a group of agents.
Current algorithms for MAPD do not consider many of the practical issues encountered in real applications.
We present two solution approaches that provide robustness guarantees by planning paths that limit the effects of imperfect execution.
arXiv Detail & Related papers (2023-03-30T14:42:41Z) - Continual Test-Time Domain Adaptation [94.51284735268597]
Test-time domain adaptation aims to adapt a source pre-trained model to a target domain without using any source data.
CoTTA is easy to implement and can be readily incorporated in off-the-shelf pre-trained models.
arXiv Detail & Related papers (2022-03-25T11:42:02Z) - Multi-Agent Terraforming: Efficient Multi-Agent Path Finding via
Environment Manipulation [12.401344261399613]
Multi-agent pathfinding is concerned with planning collision-free paths for a team of agents from their start to goal locations in an environment cluttered with obstacles.
We introduce a new extension of MAPF, which we call Terraforming MAPF (tMAPF), where some agents are responsible for moving obstacles to clear the way for other agents.
We present extensions of two state-of-the-art algorithms, CBS and PBS, in order to tackle tMAPF, and demonstrate that they can consistently outperform the best solution possible under a static-obstacle setting.
arXiv Detail & Related papers (2022-03-20T12:18:35Z) - Optimal Multi-Agent Path Finding for Precedence Constrained Planning
Tasks [0.7742297876120561]
We consider an extension to this problem, Precedence Constrained Multi-Agent Path Finding (PC-MAPF)
We propose a novel algorithm, Precedence Constrained Conflict Based Search (PC-CBS), which finds makespan-optimal solutions for this class of problems.
We benchmark the performance of this algorithm over various warehouse assembly, and multi-agent pickup and delivery tasks, and use it to evaluate the sub-optimality of a recently proposed efficient baseline.
arXiv Detail & Related papers (2022-02-08T07:26:45Z) - Standby-Based Deadlock Avoidance Method for Multi-Agent Pickup and
Delivery Tasks [2.3204178451683264]
We propose a deadlock avoidance method called standby-based deadlock avoidance (SBDA)
SBDA uses standby nodes determined in real-time using the articulation-point-finding algorithm.
We demonstrated that our proposed method outperforms a conventional approach.
arXiv Detail & Related papers (2022-01-16T10:28:52Z) - Decentralised Approach for Multi Agent Path Finding [6.599344783327053]
Multi Agent Path Finding (MAPF) requires identification of conflict free paths for spatially-extended agents.
These find application in real world problems like Convoy Movement Problem, Train Scheduling etc.
Our proposed approach, Decentralised Multi Agent Path Finding (DeMAPF), handles MAPF as a sequence of pathplanning and allocation problems.
arXiv Detail & Related papers (2021-06-03T18:07:26Z) - A Feedback Scheme to Reorder a Multi-Agent Execution Schedule by
Persistently Optimizing a Switchable Action Dependency Graph [65.70656676650391]
We consider multiple Automated Guided Vehicles (AGVs) navigating a common workspace to fulfill various intralogistics tasks.
One approach is to construct an Action Dependency Graph (ADG) which encodes the ordering of AGVs as they proceed along their routes.
If the workspace is shared by dynamic obstacles such as humans or third party robots, AGVs can experience large delays.
We present an online method to repeatedly modify acyclic ADG to minimize route completion times of each AGV.
arXiv Detail & Related papers (2020-10-11T14:39:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.