A Deep Reinforcement Learning Approach for the Meal Delivery Problem
- URL: http://arxiv.org/abs/2104.12000v1
- Date: Sat, 24 Apr 2021 19:01:59 GMT
- Title: A Deep Reinforcement Learning Approach for the Meal Delivery Problem
- Authors: Hadi Jahanshahi, Aysun Bozanta, Mucahit Cevik, Eray Mert Kavuk,
Ay\c{s}e Tosun, Sibel B. Sonuc, Bilgin Kosucu, Ay\c{s}e Ba\c{s}ar
- Abstract summary: We consider a meal delivery service fulfilling dynamic customer requests given a set of couriers over the course of a day.
We model this service as a Markov decision process and use deep reinforcement learning as the solution approach.
Our results present valuable insights on both the courier assignment process and the optimal number of couriers for different order frequencies on a given day.
- Score: 1.5391321019692434
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider a meal delivery service fulfilling dynamic customer requests
given a set of couriers over the course of a day. A courier's duty is to
pick-up an order from a restaurant and deliver it to a customer. We model this
service as a Markov decision process and use deep reinforcement learning as the
solution approach. We experiment with the resulting policies on synthetic and
real-world datasets and compare those with the baseline policies. We also
examine the courier utilization for different numbers of couriers. In our
analysis, we specifically focus on the impact of the limited available
resources in the meal delivery problem. Furthermore, we investigate the effect
of intelligent order rejection and re-positioning of the couriers. Our
numerical experiments show that, by incorporating the geographical locations of
the restaurants, customers, and the depot, our model significantly improves the
overall service quality as characterized by the expected total reward and the
delivery times. Our results present valuable insights on both the courier
assignment process and the optimal number of couriers for different order
frequencies on a given day. The proposed model also shows a robust performance
under a variety of scenarios for real-world implementation.
Related papers
- CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing [70.25689961697523]
We propose a generalizable algorithm that enhances sequential reasoning by cross-task experience sharing and selection.
Our work bridges the gap between existing sequential reasoning paradigms and validates the effectiveness of leveraging cross-task experiences.
arXiv Detail & Related papers (2024-10-22T03:59:53Z) - Dynamic Demand Management for Parcel Lockers [0.0]
We develop a solution framework that orchestrates algorithmic techniques rooted in Sequential Decision Analytics and Reinforcement Learning.
Our innovative approach to combine these techniques enables us to address the strong interrelations between the two decision types.
Our computational study shows that our method outperforms a myopic benchmark by 13.7% and an industry-inspired policy by 12.6%.
arXiv Detail & Related papers (2024-09-08T11:38:48Z) - The Restaurant Meal Delivery Problem with Ghost Kitchens [0.0]
"Ghost kitchens" proposes synchronized food preparation of several restaurants in a central complex.
We propose operational strategies for the effective operations of ghost kitchens.
We show that both integrated optimization of cook scheduling and dispatching vehicle, as well as anticipation of future demand and decisions, are essential for successful operations.
arXiv Detail & Related papers (2024-08-14T09:54:03Z) - Towards Fairness in Online Service with k Servers and its Application on
Fair Food Delivery [6.729646573556134]
We introduce a realistic generalization of k- without its assumptions - the k-FOOD problem.
The k-FOOD problem offers the versatility to model a variety of real-world use cases such as food delivery, ride sharing, and quick commerce.
Motivated by the need for fairness in online platforms, we introduce the FAIR k-FOOD problem with the max-min objective.
arXiv Detail & Related papers (2023-12-18T15:22:03Z) - Algorithmic Persuasion Through Simulation [51.23082754429737]
We study a Bayesian persuasion game where a sender wants to persuade a receiver to take a binary action, such as purchasing a product.
The sender is informed about the (binary) state of the world, such as whether the quality of the product is high or low, but only has limited information about the receiver's beliefs and utilities.
Motivated by customer surveys, user studies, and recent advances in AI, we allow the sender to learn more about the receiver by querying an oracle that simulates the receiver's behavior.
arXiv Detail & Related papers (2023-11-29T23:01:33Z) - Beyond Greedy Search: Tracking by Multi-Agent Reinforcement
Learning-based Beam Search [103.53249725360286]
Existing trackers usually select a location or proposal with the maximum score as tracking result for each frame.
We propose a novel multi-agent reinforcement learning based beam search strategy (termed BeamTracking) to address this issue.
arXiv Detail & Related papers (2022-05-19T16:35:36Z) - Approaching sales forecasting using recurrent neural networks and
transformers [57.43518732385863]
We develop three alternatives to tackle the problem of forecasting the customer sales at day/store/item level using deep learning techniques.
Our empirical results show how good performance can be achieved by using a simple sequence to sequence architecture with minimal data preprocessing effort.
The proposed solution achieves a RMSLE of around 0.54, which is competitive with other more specific solutions to the problem proposed in the Kaggle competition.
arXiv Detail & Related papers (2022-04-16T12:03:52Z) - Delivery Issues Identification from Customer Feedback Data [0.0]
This paper shows how to find these issues using customer feedback from the text comments and uploaded images.
We used transfer learning for both Text and Image models to minimize the demand for thousands of labeled examples.
arXiv Detail & Related papers (2021-12-26T12:41:10Z) - Information Directed Reward Learning for Reinforcement Learning [64.33774245655401]
We learn a model of the reward function that allows standard RL algorithms to achieve high expected return with as few expert queries as possible.
In contrast to prior active reward learning methods designed for specific types of queries, IDRL naturally accommodates different query types.
We support our findings with extensive evaluations in multiple environments and with different types of queries.
arXiv Detail & Related papers (2021-02-24T18:46:42Z) - Fully-Automated Packaging Structure Recognition in Logistics
Environments [60.56493342808093]
We propose a method for complete automation of packaging structure recognition.
Our algorithm is based on deep learning models, more precisely convolutional neural networks for instance segmentation in images.
We show that the solution is capable of correctly recognizing the packaging structure in approximately 85% of our test cases, and even more (91%) when focusing on most common package types.
arXiv Detail & Related papers (2020-08-11T10:57:23Z) - Same-Day Delivery with Fairness [5.904739807133708]
In 2016, certain minority neighborhoods were excluded from receiving Amazon's same-day delivery (SDD) service.
In this paper, we study the problem of offering fair SDD-service to customers.
We introduce a novel transformation of learning from rates to actual services, which creates a stable and efficient learning process.
arXiv Detail & Related papers (2020-07-19T00:25:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.