Deep Reinforcement Learning for Solving the Heterogeneous Capacitated
Vehicle Routing Problem
- URL: http://arxiv.org/abs/2110.02629v1
- Date: Wed, 6 Oct 2021 10:05:19 GMT
- Title: Deep Reinforcement Learning for Solving the Heterogeneous Capacitated
Vehicle Routing Problem
- Authors: Jingwen Li, Yining Ma, Ruize Gao, Zhiguang Cao, Andrew Lim, Wen Song,
Jie Zhang
- Abstract summary: Vehicles in real-world scenarios are likely to be heterogeneous with different characteristics that affect their capacity (or travel speed)
We propose a DRL method based on the attention mechanism with a vehicle selection decoder accounting for the heterogeneous fleet constraint and a node selection decoder accounting for the route construction, which learns to construct a solution by automatically selecting both a vehicle and a node for this vehicle at each step.
- Score: 13.389057146418056
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Existing deep reinforcement learning (DRL) based methods for solving the
capacitated vehicle routing problem (CVRP) intrinsically cope with homogeneous
vehicle fleet, in which the fleet is assumed as repetitions of a single
vehicle. Hence, their key to construct a solution solely lies in the selection
of the next node (customer) to visit excluding the selection of vehicle.
However, vehicles in real-world scenarios are likely to be heterogeneous with
different characteristics that affect their capacity (or travel speed),
rendering existing DRL methods less effective. In this paper, we tackle
heterogeneous CVRP (HCVRP), where vehicles are mainly characterized by
different capacities. We consider both min-max and min-sum objectives for
HCVRP, which aim to minimize the longest or total travel time of the vehicle(s)
in the fleet. To solve those problems, we propose a DRL method based on the
attention mechanism with a vehicle selection decoder accounting for the
heterogeneous fleet constraint and a node selection decoder accounting for the
route construction, which learns to construct a solution by automatically
selecting both a vehicle and a node for this vehicle at each step. Experimental
results based on randomly generated instances show that, with desirable
generalization to various problem sizes, our method outperforms the
state-of-the-art DRL method and most of the conventional heuristics, and also
delivers competitive performance against the state-of-the-art heuristic method,
i.e., SISR. Additionally, the results of extended experiments demonstrate that
our method is also able to solve CVRPLib instances with satisfactory
performance.
Related papers
- Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet [0.0]
We propose an NCO approach to solve a time-constrained capacitated VRP with a finite vehicle fleet size.
The method is able to find adequate and cost-efficient solutions, showing both flexibility and robust generalizations.
arXiv Detail & Related papers (2024-11-07T15:16:36Z) - Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework [79.088116316919]
Connected Autonomous Vehicles (CAVs) have begun to open road testing around the world, but their safety and efficiency performance in complex scenarios is still not satisfactory.
This paper proposes CoDrivingLLM, an interactive and learnable LLM-driven cooperative driving framework.
arXiv Detail & Related papers (2024-09-19T14:36:00Z) - SmartPathfinder: Pushing the Limits of Heuristic Solutions for Vehicle Routing Problem with Drones Using Reinforcement Learning [14.395184780210913]
Vehicle Problem with Drones (VRPD) seeks to optimize the routing paths for both trucks and drones.
We conduct a comprehensive examination of methods designed for solving VRPD, distilling and standardizing them into core elements.
We then develop a novel reinforcement learning framework that is integrated seamlessly with the solution components.
arXiv Detail & Related papers (2024-04-13T19:10:54Z) - Fair collaborative vehicle routing: A deep multi-agent reinforcement
learning approach [49.00137468773683]
Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other.
Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents.
We propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning.
arXiv Detail & Related papers (2023-10-26T15:42:29Z) - FastRLAP: A System for Learning High-Speed Driving via Deep RL and
Autonomous Practicing [71.76084256567599]
We present a system that enables an autonomous small-scale RC car to drive aggressively from visual observations using reinforcement learning (RL)
Our system, FastRLAP (faster lap), trains autonomously in the real world, without human interventions, and without requiring any simulation or expert demonstrations.
The resulting policies exhibit emergent aggressive driving skills, such as timing braking and acceleration around turns and avoiding areas which impede the robot's motion, approaching the performance of a human driver using a similar first-person interface over the course of training.
arXiv Detail & Related papers (2023-04-19T17:33:47Z) - Unified Automatic Control of Vehicular Systems with Reinforcement
Learning [64.63619662693068]
This article contributes a streamlined methodology for vehicular microsimulation.
It discovers high performance control strategies with minimal manual design.
The study reveals numerous emergent behaviors resembling wave mitigation, traffic signaling, and ramp metering.
arXiv Detail & Related papers (2022-07-30T16:23:45Z) - Supervised Permutation Invariant Networks for Solving the CVRP with
Bounded Fleet Size [3.5235974685889397]
Learning to solve optimization problems, such as the vehicle routing problem, offers great computational advantages.
We propose a powerful supervised deep learning framework that constructs a complete tour plan from scratch while respecting an apriori fixed number of vehicles.
In combination with an efficient post-processing scheme, our supervised approach is not only much faster and easier to train but also competitive results.
arXiv Detail & Related papers (2022-01-05T10:32:18Z) - Learning Vehicle Routing Problems using Policy Optimisation [4.093722933440819]
State-of-the-art approaches learn a policy using reinforcement learning, and the learnt policy acts as a pseudo solver.
These approaches have demonstrated good performance in some cases, but given the large search space typical of routing problem, they can converge too quickly to poor policy.
We propose entropy regularised reinforcement learning (ERRL) that supports exploration by providing more policies.
arXiv Detail & Related papers (2020-12-24T14:18:56Z) - Vehicular Cooperative Perception Through Action Branching and Federated
Reinforcement Learning [101.64598586454571]
A novel framework is proposed to allow reinforcement learning-based vehicular association, resource block (RB) allocation, and content selection of cooperative perception messages (CPMs)
A federated RL approach is introduced in order to speed up the training process across vehicles.
Results show that federated RL improves the training process, where better policies can be achieved within the same amount of time compared to the non-federated approach.
arXiv Detail & Related papers (2020-12-07T02:09:15Z) - A Quantum Annealing Approach for Dynamic Multi-Depot Capacitated Vehicle
Routing Problem [5.057312718525522]
This paper presents a quantum computing algorithm that works on the principle of Adiabatic Quantum Computation (AQC)
It has shown significant computational advantages in solving optimization problems such as vehicle routing problems (VRP) when compared to classical algorithms.
This is an NP-hard optimization problem with real-world applications in the fields of transportation, logistics, and supply chain management.
arXiv Detail & Related papers (2020-05-26T01:47:39Z) - Reinforcement Learning Based Vehicle-cell Association Algorithm for
Highly Mobile Millimeter Wave Communication [53.47785498477648]
This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks.
We first formulate the user state (VU) problem as a discrete non-vehicle association optimization problem.
The proposed solution achieves up to 15% gains in terms sum of user complexity and 20% reduction in VUE compared to several baseline designs.
arXiv Detail & Related papers (2020-01-22T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.