The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with
Trucks and UAVs
- URL: http://arxiv.org/abs/2312.00140v1
- Date: Thu, 30 Nov 2023 19:03:04 GMT
- Title: The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with
Trucks and UAVs
- Authors: Robert van Steenbergen, Wouter van Heeswijk, Martijn Mes
- Abstract summary: Humanitarian logistics operations face increasing difficulties due to rising demands for aid in disaster areas.
This paper investigates the dynamic allocation of scarce relief supplies across multiple affected districts over time.
It introduces a novel dynamic post-disaster inventory allocation problem with trucks and unmanned aerial vehicles delivering relief goods.
- Score: 1.3812010983144802
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Humanitarian logistics operations face increasing difficulties due to rising
demands for aid in disaster areas. This paper investigates the dynamic
allocation of scarce relief supplies across multiple affected districts over
time. It introduces a novel stochastic dynamic post-disaster inventory
allocation problem with trucks and unmanned aerial vehicles delivering relief
goods under uncertain supply and demand. The relevance of this humanitarian
logistics problem lies in the importance of considering the inter-temporal
social impact of deliveries. We achieve this by incorporating deprivation costs
when allocating scarce supplies. Furthermore, we consider the inherent
uncertainties of disaster areas and the potential use of cargo UAVs to enhance
operational efficiency. This study proposes two anticipatory solution methods
based on approximate dynamic programming, specifically decomposed linear value
function approximation and neural network value function approximation to
effectively manage uncertainties in the dynamic allocation process. We compare
DL-VFA and NN-VFA with various state-of-the-art methods (exact re-optimization,
PPO) and results show a 6-8% improvement compared to the best benchmarks.
NN-VFA provides the best performance and captures nonlinearities in the
problem, whereas DL-VFA shows excellent scalability against a minor performance
loss. The experiments reveal that consideration of deprivation costs results in
improved allocation of scarce supplies both across affected districts and over
time. Finally, results show that deploying UAVs can play a crucial role in the
allocation of relief goods, especially in the first stages after a disaster.
The use of UAVs reduces transportation- and deprivation costs together by
16-20% and reduces maximum deprivation times by 19-40%, while maintaining
similar levels of demand coverage, showcasing efficient and effective
operations.
Related papers
- ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization [52.5587113539404]
We introduce a causality-aware entropy term that effectively identifies and prioritizes actions with high potential impacts for efficient exploration.
Our proposed algorithm, ACE: Off-policy Actor-critic with Causality-aware Entropy regularization, demonstrates a substantial performance advantage across 29 diverse continuous control tasks.
arXiv Detail & Related papers (2024-02-22T13:22:06Z) - Posterior Sampling with Delayed Feedback for Reinforcement Learning with
Linear Function Approximation [62.969796245827006]
Delayed-PSVI is an optimistic value-based algorithm that explores the value function space via noise perturbation with posterior sampling.
We show our algorithm achieves $widetildeO(sqrtd3H3 T + d2H2 E[tau]$ worst-case regret in the presence of unknown delays.
We incorporate a gradient-based approximate sampling scheme via Langevin dynamics for Delayed-LPSVI.
arXiv Detail & Related papers (2023-10-29T06:12:43Z) - An Optimistic-Robust Approach for Dynamic Positioning of Omnichannel
Inventories [10.353243563465124]
We introduce a new class of data-driven optimistic-robust bimodal inventory optimization (BIO) strategy.
Our experiments show that significant benefits can be achieved by rethinking traditional approaches to inventory management.
arXiv Detail & Related papers (2023-10-17T23:10:57Z) - Integrated Sensing, Computation, and Communication for UAV-assisted
Federated Edge Learning [52.7230652428711]
Federated edge learning (FEEL) enables privacy-preserving model training through periodic communication between edge devices and the server.
Unmanned Aerial Vehicle (UAV)mounted edge devices are particularly advantageous for FEEL due to their flexibility and mobility in efficient data collection.
arXiv Detail & Related papers (2023-06-05T16:01:33Z) - Route Optimization via Environment-Aware Deep Network and Reinforcement
Learning [7.063811319445716]
We develop a mobile sequential recommendation system to maximize the profitability of vehicle service providers (e.g., taxi drivers)
A reinforcement-learning framework is proposed to tackle this problem, by integrating a self-check mechanism and a deep neural network for customer pick-up point monitoring.
Based on the yellow taxi data in New York City and vicinity before and after the COVID-19 outbreak, we have conducted comprehensive experiments to evaluate the effectiveness of our method.
arXiv Detail & Related papers (2021-11-16T02:19:13Z) - DROP: Deep relocating option policy for optimal ride-hailing vehicle
repositioning [36.31945021412277]
In a ride-hailing system, an optimal relocation of vacant vehicles can significantly reduce fleet idling time and balance the supply-demand distribution.
This study proposes the deep relocating option policy (DROP) that supervises vehicle agents to escape from oversupply areas.
We present a hierarchical learning framework that trains a high-level relocation policy and a set of low-level DROPs.
arXiv Detail & Related papers (2021-09-09T10:20:53Z) - Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems [17.076557377480444]
The Dynamic Pickup and Delivery Problem (D PDP) is aimed at dynamically scheduling vehicles among multiple sites in order to minimize the cost when delivery orders are not known a priori.
We propose a data-driven approach, Spatial-Temporal Aided Double Deep Graph Network (ST-DDGN), to solve industry-scale D PDP.
Our method is entirely data driven and thus adaptive, i.e., the relational representation of adjacent vehicles can be learned and corrected by ST-DDGN from data periodically.
arXiv Detail & Related papers (2021-05-27T01:16:00Z) - Distributed CNN Inference on Resource-Constrained UAVs for Surveillance
Systems: Design and Optimization [43.9909417652678]
Unmanned Aerial Vehicles (UAVs) have attracted great interest in the last few years owing to their ability to cover large areas and access difficult and hazardous target zones.
Thanks to the advancements in computer vision and machine learning, UAVs are being adopted for a broad range of solutions and applications.
Deep Neural Networks (DNNs) are progressing toward deeper and complex models that prevent them from being executed on-board.
arXiv Detail & Related papers (2021-05-23T20:19:43Z) - Efficient UAV Trajectory-Planning using Economic Reinforcement Learning [65.91405908268662]
We introduce REPlanner, a novel reinforcement learning algorithm inspired by economic transactions to distribute tasks between UAVs.
We formulate the path planning problem as a multi-agent economic game, where agents can cooperate and compete for resources.
As the system computes task distributions via UAV cooperation, it is highly resilient to any change in the swarm size.
arXiv Detail & Related papers (2021-03-03T20:54:19Z) - SS-SFDA : Self-Supervised Source-Free Domain Adaptation for Road
Segmentation in Hazardous Environments [54.22535063244038]
We present a novel approach for unsupervised road segmentation in adverse weather conditions such as rain or fog.
This includes a new algorithm for source-free domain adaptation (SFDA) using self-supervised learning.
We have evaluated the performance on $6$ datasets corresponding to real and synthetic adverse weather conditions.
arXiv Detail & Related papers (2020-11-27T09:19:03Z) - Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep
Reinforcement Learning Approach [88.45509934702913]
We design a navigation policy for multiple unmanned aerial vehicles (UAVs) where mobile base stations (BSs) are deployed.
We incorporate different contextual information such as energy and age of information (AoI) constraints to ensure the data freshness at the ground BS.
By applying the proposed trained model, an effective real-time trajectory policy for the UAV-BSs captures the observable network states over time.
arXiv Detail & Related papers (2020-02-21T07:29:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.