Location-Routing Planning for Last-Mile Deliveries Using Mobile Parcel
Lockers: A Hybrid Q-Learning Network Approach
- URL: http://arxiv.org/abs/2209.04265v1
- Date: Fri, 9 Sep 2022 11:59:42 GMT
- Title: Location-Routing Planning for Last-Mile Deliveries Using Mobile Parcel
Lockers: A Hybrid Q-Learning Network Approach
- Authors: Yubin Liu, Qiming Ye, Jose Escribano-Macias, Yuxiang Feng, Panagiotis
Angeloudis
- Abstract summary: This study formulates the Mobile Parcel Locker Problem (MPLP)
MPLP determines the optimal stopover location for MPLs throughout the day and plans corresponding delivery routes.
A Hybrid Q-Learning-Network-based Method (HQM) is developed to resolve the computational complexity of the resulting large problem instances.
- Score: 1.856181262236876
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Mobile parcel lockers (MPLs) have been recently proposed by logistics
operators as a technology that could help reduce traffic congestion and
operational costs in urban freight distribution. Given their ability to
relocate throughout their area of deployment, they hold the potential to
improve customer accessibility and convenience. In this study, we formulate the
Mobile Parcel Locker Problem (MPLP), a special case of the Location-Routing
Problem (LRP) which determines the optimal stopover location for MPLs
throughout the day and plans corresponding delivery routes. A Hybrid
Q-Learning-Network-based Method (HQM) is developed to resolve the computational
complexity of the resulting large problem instances while escaping local
optima. In addition, the HQM is integrated with global and local search
mechanisms to resolve the dilemma of exploration and exploitation faced by
classic reinforcement learning (RL) methods. We examine the performance of HQM
under different problem sizes (up to 200 nodes) and benchmarked it against the
Genetic Algorithm (GA). Our results indicate that the average reward obtained
by HQM is 1.96 times greater than GA, which demonstrates that HQM has a better
optimisation ability. Finally, we identify critical factors that contribute to
fleet size requirements, travel distances, and service delays. Our findings
outline that the efficiency of MPLs is mainly contingent on the length of time
windows and the deployment of MPL stopovers.
Related papers
- A Graph-based Adversarial Imitation Learning Framework for Reliable & Realtime Fleet Scheduling in Urban Air Mobility [5.19664437943693]
This paper presents a comprehensive optimization formulation of the fleet scheduling problem.
It also identifies the need for alternate solution approaches.
The new imitative approach achieves better mean performance and remarkable improvement in the case of unseen worst-case scenarios.
arXiv Detail & Related papers (2024-07-16T18:51:24Z) - Decentralized Online Learning in Task Assignment Games for Mobile
Crowdsensing [55.07662765269297]
A mobile crowdsensing platform (MCSP) sequentially publishes sensing tasks to the available mobile units (MUs) that signal their willingness to participate in a task by sending sensing offers back to the MCSP.
A stable task assignment must address two challenges: the MCSP's and MUs' conflicting goals, and the uncertainty about the MUs' required efforts and preferences.
To overcome these challenges a novel decentralized approach combining matching theory and online learning, called collision-avoidance multi-armed bandit with strategic free sensing (CA-MAB-SFS) is proposed.
arXiv Detail & Related papers (2023-09-19T13:07:15Z) - Joint Service Caching, Communication and Computing Resource Allocation in Collaborative MEC Systems: A DRL-based Two-timescale Approach [15.16859210403316]
Meeting the strict Quality of Service (QoS) requirements of terminals has imposed a challenge on Multiaccess Edge Computing (MEC) systems.
We propose a collaborative framework that facilitates resource sharing between the edge servers.
We show that our proposed algorithm outperforms the baseline algorithms in terms of the average switching and cache cost.
arXiv Detail & Related papers (2023-07-19T00:27:49Z) - MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion
Control in Real Networks [63.24965775030673]
We propose a novel Reinforcement Learning (RL) approach to design generic Congestion Control (CC) algorithms.
Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return.
We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch.
arXiv Detail & Related papers (2023-02-02T18:27:20Z) - Multi-Resource Allocation for On-Device Distributed Federated Learning
Systems [79.02994855744848]
This work poses a distributed multi-resource allocation scheme for minimizing the weighted sum of latency and energy consumption in the on-device distributed federated learning (FL) system.
Each mobile device in the system engages the model training process within the specified area and allocates its computation and communication resources for deriving and uploading parameters, respectively.
arXiv Detail & Related papers (2022-11-01T14:16:05Z) - DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in
Complex Environments [55.204450019073036]
We present a novel reinforcement learning based task allocation and decentralized navigation algorithm for mobile robots in warehouse environments.
We consider the problem of joint decentralized task allocation and navigation and present a two level approach to solve it.
We observe improvement up to 14% in terms of task completion time and up-to 40% improvement in terms of computing collision-free trajectories for the robots.
arXiv Detail & Related papers (2022-09-07T00:35:27Z) - Fast Continuous and Integer L-shaped Heuristics Through Supervised
Learning [4.521119623956821]
We propose a methodology to accelerate the solution of mixed-integer linear two-stage programs.
We aim at solving problems where the second stage is highly demanding.
Our core idea is to gain large reductions in online solution time while incurring small reductions in first-stage solution accuracy.
arXiv Detail & Related papers (2022-05-02T13:15:32Z) - Collaborative Intelligent Reflecting Surface Networks with Multi-Agent
Reinforcement Learning [63.83425382922157]
Intelligent reflecting surface (IRS) is envisioned to be widely applied in future wireless networks.
In this paper, we investigate a multi-user communication system assisted by cooperative IRS devices with the capability of energy harvesting.
arXiv Detail & Related papers (2022-03-26T20:37:14Z) - Location-routing Optimisation for Urban Logistics Using Mobile Parcel
Locker Based on Hybrid Q-Learning Algorithm [0.0]
Parcel lockers (MPLs) have been introduced by urban logistics operators as a means to reduce traffic congestion and operational cost.
This paper proposes an integer programming model to solve the Location Routing Problem for MPLs.
arXiv Detail & Related papers (2021-10-29T01:27:12Z) - Combining Propositional Logic Based Decision Diagrams with Decision
Making in Urban Systems [10.781866671930851]
We tackle the problem of multiagent pathfinding under uncertainty and partial observability.
We use propositional logic and integrate them with the RL algorithms to enable fast simulation for RL.
arXiv Detail & Related papers (2020-11-09T13:13:54Z) - Reinforcement Learning Based Vehicle-cell Association Algorithm for
Highly Mobile Millimeter Wave Communication [53.47785498477648]
This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks.
We first formulate the user state (VU) problem as a discrete non-vehicle association optimization problem.
The proposed solution achieves up to 15% gains in terms sum of user complexity and 20% reduction in VUE compared to several baseline designs.
arXiv Detail & Related papers (2020-01-22T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.