A Feedback Scheme to Reorder a Multi-Agent Execution Schedule by
Persistently Optimizing a Switchable Action Dependency Graph
- URL: http://arxiv.org/abs/2010.05254v1
- Date: Sun, 11 Oct 2020 14:39:50 GMT
- Title: A Feedback Scheme to Reorder a Multi-Agent Execution Schedule by
Persistently Optimizing a Switchable Action Dependency Graph
- Authors: Alexander Berndt, Niels Van Duijkeren, Luigi Palmieri and Tamas
Keviczky
- Abstract summary: We consider multiple Automated Guided Vehicles (AGVs) navigating a common workspace to fulfill various intralogistics tasks.
One approach is to construct an Action Dependency Graph (ADG) which encodes the ordering of AGVs as they proceed along their routes.
If the workspace is shared by dynamic obstacles such as humans or third party robots, AGVs can experience large delays.
We present an online method to repeatedly modify acyclic ADG to minimize route completion times of each AGV.
- Score: 65.70656676650391
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper we consider multiple Automated Guided Vehicles (AGVs)
navigating a common workspace to fulfill various intralogistics tasks,
typically formulated as the Multi-Agent Path Finding (MAPF) problem. To keep
plan execution deadlock-free, one approach is to construct an Action Dependency
Graph (ADG) which encodes the ordering of AGVs as they proceed along their
routes. Using this method, delayed AGVs occasionally require others to wait for
them at intersections, thereby affecting the plan execution efficiency. If the
workspace is shared by dynamic obstacles such as humans or third party robots,
AGVs can experience large delays. A common mitigation approach is to re-solve
the MAPF using the current, delayed AGV positions. However, solving the MAPF is
time-consuming, making this approach inefficient, especially for large AGV
teams. In this work, we present an online method to repeatedly modify a given
acyclic ADG to minimize route completion times of each AGV. Our approach
persistently maintains an acyclic ADG, necessary for deadlock-free plan
execution. We evaluate the approach by considering simulations with random
disturbances on the execution and show faster route completion times compared
to the baseline ADG-based execution management approach.
Related papers
- Dynamic Planning for LLM-based Graphical User Interface Automation [48.31532014795368]
We propose a novel approach called Dynamic Planning of Thoughts (D-PoT) for LLM-based GUI agents.
D-PoT involves the dynamic adjustment of planning based on the environmental feedback and execution history.
Experimental results reveal that the proposed D-PoT significantly surpassed the strong GPT-4V baseline by +12.7%.
arXiv Detail & Related papers (2024-10-01T07:49:24Z) - Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders
for More Efficient Multi-Agent Path Finding Plan Execution [7.2988406091449605]
We introduce a new graphical representation called a Bidirectional Temporal Plan Graph (BTPG) which allows switching orders during execution to avoid unnecessary waiting time.
Experimental results show that following BTPGs consistently outperforms following TPGs, reducing unnecessary waits by 8-20%.
arXiv Detail & Related papers (2023-12-30T20:23:27Z) - Edge Generation Scheduling for DAG Tasks Using Deep Reinforcement
Learning [2.365237699556817]
Directed acyclic graph (DAG) tasks are currently adopted in the real-time domain to model complex applications.
We propose a new DAG scheduling framework that attempts to minimize the DAG width by iteratively generating edges.
We evaluate the effectiveness of the proposed algorithm by comparing it with state-of-the-art DAG schedulings and an optimal mixed-integer linear programming baseline.
arXiv Detail & Related papers (2023-08-28T15:19:18Z) - An Online Approach to Solve the Dynamic Vehicle Routing Problem with
Stochastic Trip Requests for Paratransit Services [5.649212162857776]
We propose a fully online approach to solve the dynamic vehicle routing problem (DVRP)
It is difficult to batch paratransit requests together as they are temporally sparse.
We use Monte Carlo tree search to evaluate actions for any given state.
arXiv Detail & Related papers (2022-03-28T22:15:52Z) - Optimal Solving of Constrained Path-Planning Problems with Graph
Convolutional Networks and Optimized Tree Search [12.457788665461312]
We propose a hybrid solving planner that combines machine learning models and an optimal solver.
We conduct experiments on realistic scenarios and show that GCN support enables substantial speedup and smoother scaling to harder problems.
arXiv Detail & Related papers (2021-08-02T16:53:21Z) - Gradient Coding with Dynamic Clustering for Straggler-Tolerant
Distributed Learning [55.052517095437]
gradient descent (GD) is widely employed to parallelize the learning task by distributing the dataset across multiple workers.
A significant performance bottleneck for the per-iteration completion time in distributed synchronous GD is $straggling$ workers.
Coded distributed techniques have been introduced recently to mitigate stragglers and to speed up GD iterations by assigning redundant computations to workers.
We propose a novel dynamic GC scheme, which assigns redundant data to workers to acquire the flexibility to choose from among a set of possible codes depending on the past straggling behavior.
arXiv Detail & Related papers (2021-03-01T18:51:29Z) - Symmetry Breaking for k-Robust Multi-Agent Path Finding [30.645303869311366]
k-Robust Conflict-BasedSearch (k-CBS) is an algorithm that produces coordinated and collision-free plan that is robust for up to k delays.
We introduce a variety of pairwise symmetry breaking constraints, specific to k-robust planning, that can efficiently find compatible and optimal paths for pairs of conflicting agents.
arXiv Detail & Related papers (2021-02-17T11:09:33Z) - Gradient Coding with Dynamic Clustering for Straggler Mitigation [57.9123881133818]
GC-DC regulates the number of straggling workers in each cluster based on the straggler behavior in the previous iteration.
We numerically show that GC-DC provides significant improvements in the average completion time (of each iteration) with no increase in the communication load compared to the original GC scheme.
arXiv Detail & Related papers (2020-11-03T18:52:15Z) - Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal
Constraints [52.58352707495122]
We present a multi-robot allocation algorithm that decouples the key computational challenges of sequential decision-making under uncertainty and multi-agent coordination.
We validate our results over a wide range of simulations on two distinct domains: multi-arm conveyor belt pick-and-place and multi-drone delivery dispatch in a city.
arXiv Detail & Related papers (2020-05-27T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.