Location-routing Optimisation for Urban Logistics Using Mobile Parcel
Locker Based on Hybrid Q-Learning Algorithm
- URL: http://arxiv.org/abs/2110.15485v1
- Date: Fri, 29 Oct 2021 01:27:12 GMT
- Title: Location-routing Optimisation for Urban Logistics Using Mobile Parcel
Locker Based on Hybrid Q-Learning Algorithm
- Authors: Yubin Liu, Qiming Ye, Yuxiang Feng, Jose Escribano-Macias, Panagiotis
Angeloudis
- Abstract summary: Parcel lockers (MPLs) have been introduced by urban logistics operators as a means to reduce traffic congestion and operational cost.
This paper proposes an integer programming model to solve the Location Routing Problem for MPLs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Mobile parcel lockers (MPLs) have been recently introduced by urban logistics
operators as a means to reduce traffic congestion and operational cost. Their
capability to relocate their position during the day has the potential to
improve customer accessibility and convenience (if deployed and planned
accordingly), allowing customers to collect parcels at their preferred time
among one of the multiple locations. This paper proposes an integer programming
model to solve the Location Routing Problem for MPLs to determine the optimal
configuration and locker routes. In solving this model, a Hybrid Q-Learning
algorithm-based Method (HQM) integrated with global and local search mechanisms
is developed, the performance of which is examined for different problem sizes
and benchmarked with genetic algorithms. Furthermore, we introduced two route
adjustment strategies to resolve stochastic events that may cause delays. The
results show that HQM achieves 443.41% improvement on average in solution
improvement, compared with the 94.91% improvement of heuristic counterparts,
suggesting HQM enables a more efficient search for better solutions. Finally,
we identify critical factors that contribute to service delays and investigate
their effects.
Related papers
- Research on Travel Route Planing Problems Based on Greedy Algorithm [0.0]
Route planning problem based on the greedy algorithm represents a method of identifying the optimal or near-optimal route between a given start point and end point.
In this paper, the PCA method is employed initially to downscale the city evaluation indexes, extract the key principal components, and then downscale the data.
A route planning algorithm is proposed and optimised based on the greedy algorithm, which provides personalised route customisation according to the different needs of tourists.
arXiv Detail & Related papers (2024-10-17T05:17:01Z) - Spatial-temporal-demand clustering for solving large-scale vehicle
routing problems with time windows [0.0]
We propose a decompose-route-improve (DRI) framework that groups customers using clustering.
Its similarity metric incorporates customers' spatial, temporal, and demand data.
We apply pruned local search (LS) between solved subproblems to improve the overall solution.
arXiv Detail & Related papers (2024-01-20T06:06:01Z) - MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion
Control in Real Networks [63.24965775030673]
We propose a novel Reinforcement Learning (RL) approach to design generic Congestion Control (CC) algorithms.
Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return.
We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch.
arXiv Detail & Related papers (2023-02-02T18:27:20Z) - Learning to Optimize Permutation Flow Shop Scheduling via Graph-based
Imitation Learning [70.65666982566655]
Permutation flow shop scheduling (PFSS) is widely used in manufacturing systems.
We propose to train the model via expert-driven imitation learning, which accelerates convergence more stably and accurately.
Our model's network parameters are reduced to only 37% of theirs, and the solution gap of our model towards the expert solutions decreases from 6.8% to 1.3% on average.
arXiv Detail & Related papers (2022-10-31T09:46:26Z) - Location-Routing Planning for Last-Mile Deliveries Using Mobile Parcel
Lockers: A Hybrid Q-Learning Network Approach [1.856181262236876]
This study formulates the Mobile Parcel Locker Problem (MPLP)
MPLP determines the optimal stopover location for MPLs throughout the day and plans corresponding delivery routes.
A Hybrid Q-Learning-Network-based Method (HQM) is developed to resolve the computational complexity of the resulting large problem instances.
arXiv Detail & Related papers (2022-09-09T11:59:42Z) - DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in
Complex Environments [55.204450019073036]
We present a novel reinforcement learning based task allocation and decentralized navigation algorithm for mobile robots in warehouse environments.
We consider the problem of joint decentralized task allocation and navigation and present a two level approach to solve it.
We observe improvement up to 14% in terms of task completion time and up-to 40% improvement in terms of computing collision-free trajectories for the robots.
arXiv Detail & Related papers (2022-09-07T00:35:27Z) - Off-line approximate dynamic programming for the vehicle routing problem
with stochastic customers and demands via decentralized decision-making [0.0]
This paper studies a variant of the vehicle routing problem (VRP) where both customer locations and demands are uncertain.
The objective is to maximize the served demands while fulfilling vehicle capacities and time restrictions.
We develop a Q-learning algorithm featuring state-of-the-art acceleration techniques such as Replay Memory and Double Q Network.
arXiv Detail & Related papers (2021-09-21T14:28:09Z) - Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
Drone Networks [151.27147513363502]
This paper studies the problem of the trajectory design for a group of energyconstrained drones operating in dynamic wireless network environments.
A value based reinforcement learning (VDRL) solution and a metatraining mechanism is proposed.
arXiv Detail & Related papers (2020-12-06T01:30:12Z) - Combining Propositional Logic Based Decision Diagrams with Decision
Making in Urban Systems [10.781866671930851]
We tackle the problem of multiagent pathfinding under uncertainty and partial observability.
We use propositional logic and integrate them with the RL algorithms to enable fast simulation for RL.
arXiv Detail & Related papers (2020-11-09T13:13:54Z) - Combining Deep Learning and Optimization for Security-Constrained
Optimal Power Flow [94.24763814458686]
Security-constrained optimal power flow (SCOPF) is fundamental in power systems.
Modeling of APR within the SCOPF problem results in complex large-scale mixed-integer programs.
This paper proposes a novel approach that combines deep learning and robust optimization techniques.
arXiv Detail & Related papers (2020-07-14T12:38:21Z) - Reinforcement Learning Based Vehicle-cell Association Algorithm for
Highly Mobile Millimeter Wave Communication [53.47785498477648]
This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks.
We first formulate the user state (VU) problem as a discrete non-vehicle association optimization problem.
The proposed solution achieves up to 15% gains in terms sum of user complexity and 20% reduction in VUE compared to several baseline designs.
arXiv Detail & Related papers (2020-01-22T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.