Related papers: A Deep Reinforcement Learning Approach for the Meal Delivery Problem

A Deep Reinforcement Learning Approach for the Meal Delivery Problem

URL: http://arxiv.org/abs/2104.12000v1
Date: Sat, 24 Apr 2021 19:01:59 GMT
Title: A Deep Reinforcement Learning Approach for the Meal Delivery Problem
Authors: Hadi Jahanshahi, Aysun Bozanta, Mucahit Cevik, Eray Mert Kavuk, Ay\c{s}e Tosun, Sibel B. Sonuc, Bilgin Kosucu, Ay\c{s}e Ba\c{s}ar
Abstract summary: We consider a meal delivery service fulfilling dynamic customer requests given a set of couriers over the course of a day. We model this service as a Markov decision process and use deep reinforcement learning as the solution approach. Our results present valuable insights on both the courier assignment process and the optimal number of couriers for different order frequencies on a given day.
Score: 1.5391321019692434
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider a meal delivery service fulfilling dynamic customer requests given a set of couriers over the course of a day. A courier's duty is to pick-up an order from a restaurant and deliver it to a customer. We model this service as a Markov decision process and use deep reinforcement learning as the solution approach. We experiment with the resulting policies on synthetic and real-world datasets and compare those with the baseline policies. We also examine the courier utilization for different numbers of couriers. In our analysis, we specifically focus on the impact of the limited available resources in the meal delivery problem. Furthermore, we investigate the effect of intelligent order rejection and re-positioning of the couriers. Our numerical experiments show that, by incorporating the geographical locations of the restaurants, customers, and the depot, our model significantly improves the overall service quality as characterized by the expected total reward and the delivery times. Our results present valuable insights on both the courier assignment process and the optimal number of couriers for different order frequencies on a given day. The proposed model also shows a robust performance under a variety of scenarios for real-world implementation.

Related papers

Learning to Estimate Package Delivery Time in Mixed Imbalanced Delivery and Pickup Logistics Services [12.270567592483888]
We propose TransPDT, a Transformer-based multi-task package delivery time prediction model. A system based on TransPDT is deployed internally in JD Logistics to track more than 2000 couriers handling hundreds of thousands of packages per day in Beijing.
arXiv Detail & Related papers (2025-05-01T08:00:22Z)
Real-Time Integrated Dispatching and Idle Fleet Steering with Deep Reinforcement Learning for A Meal Delivery Platform [0.0]
This study sets out to solve the real-time order dispatching and idle courier steering problems for a meal delivery platform. We propose a reinforcement learning (RL)-based strategic dual-control framework. We find the delivery efficiency and fairness of workload distribution among couriers have been improved.
arXiv Detail & Related papers (2025-01-10T09:15:40Z)
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing [70.25689961697523]
We propose a generalizable algorithm that enhances sequential reasoning by cross-task experience sharing and selection. Our work bridges the gap between existing sequential reasoning paradigms and validates the effectiveness of leveraging cross-task experiences.
arXiv Detail & Related papers (2024-10-22T03:59:53Z)
Dynamic Demand Management for Parcel Lockers [0.0]
We develop a solution framework that orchestrates algorithmic techniques rooted in Sequential Decision Analytics and Reinforcement Learning. Our innovative approach to combine these techniques enables us to address the strong interrelations between the two decision types. Our computational study shows that our method outperforms a myopic benchmark by 13.7% and an industry-inspired policy by 12.6%.
arXiv Detail & Related papers (2024-09-08T11:38:48Z)
The Restaurant Meal Delivery Problem with Ghost Kitchens [0.0]
"Ghost kitchens" proposes synchronized food preparation of several restaurants in a central complex. We propose operational strategies for the effective operations of ghost kitchens. We show that both integrated optimization of cook scheduling and dispatching vehicle, as well as anticipation of future demand and decisions, are essential for successful operations.
arXiv Detail & Related papers (2024-08-14T09:54:03Z)
Towards Fairness in Online Service with k Servers and its Application on Fair Food Delivery [6.729646573556134]
We introduce a realistic generalization of k- without its assumptions - the k-FOOD problem. The k-FOOD problem offers the versatility to model a variety of real-world use cases such as food delivery, ride sharing, and quick commerce. Motivated by the need for fairness in online platforms, we introduce the FAIR k-FOOD problem with the max-min objective.
arXiv Detail & Related papers (2023-12-18T15:22:03Z)
Algorithmic Persuasion Through Simulation [51.23082754429737]
We study a Bayesian persuasion game where a sender wants to persuade a receiver to take a binary action, such as purchasing a product. The sender is informed about the (binary) state of the world, such as whether the quality of the product is high or low, but only has limited information about the receiver's beliefs and utilities. Motivated by customer surveys, user studies, and recent advances in AI, we allow the sender to learn more about the receiver by querying an oracle that simulates the receiver's behavior.
arXiv Detail & Related papers (2023-11-29T23:01:33Z)
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search [103.53249725360286]
Existing trackers usually select a location or proposal with the maximum score as tracking result for each frame. We propose a novel multi-agent reinforcement learning based beam search strategy (termed BeamTracking) to address this issue.
arXiv Detail & Related papers (2022-05-19T16:35:36Z)
Approaching sales forecasting using recurrent neural networks and transformers [57.43518732385863]
We develop three alternatives to tackle the problem of forecasting the customer sales at day/store/item level using deep learning techniques. Our empirical results show how good performance can be achieved by using a simple sequence to sequence architecture with minimal data preprocessing effort. The proposed solution achieves a RMSLE of around 0.54, which is competitive with other more specific solutions to the problem proposed in the Kaggle competition.
arXiv Detail & Related papers (2022-04-16T12:03:52Z)
Delivery Issues Identification from Customer Feedback Data [0.0]
This paper shows how to find these issues using customer feedback from the text comments and uploaded images. We used transfer learning for both Text and Image models to minimize the demand for thousands of labeled examples.
arXiv Detail & Related papers (2021-12-26T12:41:10Z)
Information Directed Reward Learning for Reinforcement Learning [64.33774245655401]
We learn a model of the reward function that allows standard RL algorithms to achieve high expected return with as few expert queries as possible. In contrast to prior active reward learning methods designed for specific types of queries, IDRL naturally accommodates different query types. We support our findings with extensive evaluations in multiple environments and with different types of queries.
arXiv Detail & Related papers (2021-02-24T18:46:42Z)
Fully-Automated Packaging Structure Recognition in Logistics Environments [60.56493342808093]
We propose a method for complete automation of packaging structure recognition. Our algorithm is based on deep learning models, more precisely convolutional neural networks for instance segmentation in images. We show that the solution is capable of correctly recognizing the packaging structure in approximately 85% of our test cases, and even more (91%) when focusing on most common package types.
arXiv Detail & Related papers (2020-08-11T10:57:23Z)
Same-Day Delivery with Fairness [5.904739807133708]
In 2016, certain minority neighborhoods were excluded from receiving Amazon's same-day delivery (SDD) service. In this paper, we study the problem of offering fair SDD-service to customers. We introduce a novel transformation of learning from rates to actual services, which creates a stable and efficient learning process.
arXiv Detail & Related papers (2020-07-19T00:25:09Z)

This list is automatically generated from the titles and abstracts of the papers in this site.