An Online Approach to Solve the Dynamic Vehicle Routing Problem with
Stochastic Trip Requests for Paratransit Services
- URL: http://arxiv.org/abs/2203.15127v1
- Date: Mon, 28 Mar 2022 22:15:52 GMT
- Title: An Online Approach to Solve the Dynamic Vehicle Routing Problem with
Stochastic Trip Requests for Paratransit Services
- Authors: Michael Wilbur, Salah Uddin Kadir, Youngseo Kim, Geoffrey Pettet, Ayan
Mukhopadhyay, Philip Pugliese, Samitha Samaranayake, Aron Laszka and Abhishek
Dubey
- Abstract summary: We propose a fully online approach to solve the dynamic vehicle routing problem (DVRP)
It is difficult to batch paratransit requests together as they are temporally sparse.
We use Monte Carlo tree search to evaluate actions for any given state.
- Score: 5.649212162857776
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many transit agencies operating paratransit and microtransit services have to
respond to trip requests that arrive in real-time, which entails solving hard
combinatorial and sequential decision-making problems under uncertainty. To
avoid decisions that lead to significant inefficiency in the long term,
vehicles should be allocated to requests by optimizing a non-myopic utility
function or by batching requests together and optimizing a myopic utility
function. While the former approach is typically offline, the latter can be
performed online. We point out two major issues with such approaches when
applied to paratransit services in practice. First, it is difficult to batch
paratransit requests together as they are temporally sparse. Second, the
environment in which transit agencies operate changes dynamically (e.g.,
traffic conditions), causing estimates that are learned offline to become
stale. To address these challenges, we propose a fully online approach to solve
the dynamic vehicle routing problem (DVRP) with time windows and stochastic
trip requests that is robust to changing environmental dynamics by
construction. We focus on scenarios where requests are relatively sparse - our
problem is motivated by applications to paratransit services. We formulate DVRP
as a Markov decision process and use Monte Carlo tree search to evaluate
actions for any given state. Accounting for stochastic requests while
optimizing a non-myopic utility function is computationally challenging;
indeed, the action space for such a problem is intractably large in practice.
To tackle the large action space, we leverage the structure of the problem to
design heuristics that can sample promising actions for the tree search. Our
experiments using real-world data from our partner agency show that the
proposed approach outperforms existing state-of-the-art approaches both in
terms of performance and robustness.
Related papers
- A Meta-Engine Framework for Interleaved Task and Motion Planning using Topological Refinements [51.54559117314768]
Task And Motion Planning (TAMP) is the problem of finding a solution to an automated planning problem.
We propose a general and open-source framework for modeling and benchmarking TAMP problems.
We introduce an innovative meta-technique to solve TAMP problems involving moving agents and multiple task-state-dependent obstacles.
arXiv Detail & Related papers (2024-08-11T14:57:57Z) - A Reinforcement Learning Approach for Dynamic Rebalancing in
Bike-Sharing System [11.237099288412558]
Bike-Sharing Systems provide eco-friendly urban mobility, contributing to the alleviation of traffic congestion and healthier lifestyles.
Devising effective rebalancing strategies using vehicles to redistribute bikes among stations is therefore of uttermost importance for operators.
This paper introduces atemporal reinforcement learning algorithm for the dynamic rebalancing problem with multiple vehicles.
arXiv Detail & Related papers (2024-02-05T23:46:42Z) - Multi-Agent Deep Reinforcement Learning for Dynamic Avatar Migration in
AIoT-enabled Vehicular Metaverses with Trajectory Prediction [70.9337170201739]
We propose a model to predict the future trajectories of intelligent vehicles based on their historical data.
We show that our proposed algorithm can effectively reduce the latency of executing avatar tasks by around 25% without prediction.
arXiv Detail & Related papers (2023-06-26T13:27:11Z) - Combinatorial Optimization enriched Machine Learning to solve the
Dynamic Vehicle Routing Problem with Time Windows [5.4807970361321585]
We propose a novel machine learning pipeline that incorporates an optimization layer.
We apply this pipeline to a dynamic vehicle routing problem with waves, which was recently promoted in the EURO Meets NeurIPS Competition at NeurIPS 2022.
Our methodology ranked first in this competition, outperforming all other approaches in solving the proposed dynamic vehicle routing problem.
arXiv Detail & Related papers (2023-04-03T08:23:09Z) - Preference-Aware Delivery Planning for Last-Mile Logistics [3.04585143845864]
We propose a novel hierarchical route with learnable parameters that combines the strength of both the optimization and machine learning approaches.
By using a real-world delivery dataset provided by the Amazon Last Mile Research Challenge, we demonstrate the importance of having both the optimization and the machine learning components.
arXiv Detail & Related papers (2023-03-08T02:10:59Z) - Offline Vehicle Routing Problem with Online Bookings: A Novel Problem
Formulation with Applications to Paratransit [5.8521525578624916]
We introduce a novel formulation of the offline vehicle routing problem with online bookings.
This problem is very challenging computationally since it faces the complexity of considering large sets of requests.
We propose a novel computational approach, which combines an anytime algorithm with a learning-based policy for real-time decisions.
arXiv Detail & Related papers (2022-04-25T23:17:34Z) - Value Function is All You Need: A Unified Learning Framework for Ride
Hailing Platforms [57.21078336887961]
Large ride-hailing platforms, such as DiDi, Uber and Lyft, connect tens of thousands of vehicles in a city to millions of ride demands throughout the day.
We propose a unified value-based dynamic learning framework (V1D3) for tackling both tasks.
arXiv Detail & Related papers (2021-05-18T19:22:24Z) - Where the Action is: Let's make Reinforcement Learning for Stochastic
Dynamic Vehicle Routing Problems work! [0.0]
Demand for real-time, instant mobility and delivery services grows.
dynamic vehicle routing problems (SDVRPs) require anticipatory real-time routing actions.
For solving SDVRPs, joint work of both communities is needed.
arXiv Detail & Related papers (2021-02-28T13:26:35Z) - Multi-intersection Traffic Optimisation: A Benchmark Dataset and a
Strong Baseline [85.9210953301628]
Control of traffic signals is fundamental and critical to alleviate traffic congestion in urban areas.
Because of the high complexity of modelling the problem, experimental settings of current works are often inconsistent.
We propose a novel and strong baseline model based on deep reinforcement learning with the encoder-decoder structure.
arXiv Detail & Related papers (2021-01-24T03:55:39Z) - A Feedback Scheme to Reorder a Multi-Agent Execution Schedule by
Persistently Optimizing a Switchable Action Dependency Graph [65.70656676650391]
We consider multiple Automated Guided Vehicles (AGVs) navigating a common workspace to fulfill various intralogistics tasks.
One approach is to construct an Action Dependency Graph (ADG) which encodes the ordering of AGVs as they proceed along their routes.
If the workspace is shared by dynamic obstacles such as humans or third party robots, AGVs can experience large delays.
We present an online method to repeatedly modify acyclic ADG to minimize route completion times of each AGV.
arXiv Detail & Related papers (2020-10-11T14:39:50Z) - Multi-Agent Routing Value Iteration Network [88.38796921838203]
We propose a graph neural network based model that is able to perform multi-agent routing based on learned value in a sparsely connected graph.
We show that our model trained with only two agents on graphs with a maximum of 25 nodes can easily generalize to situations with more agents and/or nodes.
arXiv Detail & Related papers (2020-07-09T22:16:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.