Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in
E-Commerce
- URL: http://arxiv.org/abs/2311.16171v1
- Date: Mon, 20 Nov 2023 10:32:28 GMT
- Title: Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in
E-Commerce
- Authors: Omkar Shelke and Pranavi Pathakota and Anandsingh Chauhan and Harshad
Khadilkar and Hardik Meisheri and Balaraman Ravindran
- Abstract summary: paper presents an integrated algorithmic framework for minimising product delivery costs in e-commerce.
One of the major challenges in e-commerce is the large volume of-temporally diverse orders from multiple customers.
We propose an approach that combines graph neural networks and reinforcement learning to train the node selection and vehicle agents.
- Score: 11.421159751635667
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents an integrated algorithmic framework for minimising
product delivery costs in e-commerce (known as the cost-to-serve or C2S). One
of the major challenges in e-commerce is the large volume of spatio-temporally
diverse orders from multiple customers, each of which has to be fulfilled from
one of several warehouses using a fleet of vehicles. This results in two levels
of decision-making: (i) selection of a fulfillment node for each order
(including the option of deferral to a future time), and then (ii) routing of
vehicles (each of which can carry multiple orders originating from the same
warehouse). We propose an approach that combines graph neural networks and
reinforcement learning to train the node selection and vehicle routing agents.
We include real-world constraints such as warehouse inventory capacity, vehicle
characteristics such as travel times, service times, carrying capacity, and
customer constraints including time windows for delivery. The complexity of
this problem arises from the fact that outcomes (rewards) are driven both by
the fulfillment node mapping as well as the routing algorithms, and are
spatio-temporally distributed. Our experiments show that this algorithmic
pipeline outperforms pure heuristic policies.
Related papers
- Deep Reinforcement Learning for Traveling Purchaser Problems [63.37136587778153]
The traveling purchaser problem (TPP) is an important optimization problem with broad applications.
We propose a novel approach based on deep reinforcement learning (DRL), which addresses route construction and purchase planning separately.
By introducing a meta-learning strategy, the policy network can be trained stably on large-sized TPP instances.
arXiv Detail & Related papers (2024-04-03T05:32:10Z) - AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning [54.47116888545878]
AutoAct is an automatic agent learning framework for QA.
It does not rely on large-scale annotated data and synthetic planning trajectories from closed-source models.
arXiv Detail & Related papers (2024-01-10T16:57:24Z) - Modeling routing problems in QUBO with application to ride-hailing [0.0]
We focus on one such routing problem, the Ride Pooling Problem (RPP), where multiple customers can request on-demand pickups and drop-offs from shared vehicles within a fleet.
The task is to optimally pool customer requests using the limited set of vehicles, akin to a small-scale flexible bus route.
arXiv Detail & Related papers (2022-12-09T14:55:34Z) - No-Regret Learning in Two-Echelon Supply Chain with Unknown Demand
Distribution [48.27759561064771]
We consider the two-echelon supply chain model introduced in [Cachon and Zipkin, 1999] under two different settings.
We design algorithms that achieve favorable guarantees for both regret and convergence to the optimal inventory decision in both settings.
Our algorithms are based on Online Gradient Descent and Online Newton Step, together with several new ingredients specifically designed for our problem.
arXiv Detail & Related papers (2022-10-23T08:45:39Z) - DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in
Complex Environments [55.204450019073036]
We present a novel reinforcement learning based task allocation and decentralized navigation algorithm for mobile robots in warehouse environments.
We consider the problem of joint decentralized task allocation and navigation and present a two level approach to solve it.
We observe improvement up to 14% in terms of task completion time and up-to 40% improvement in terms of computing collision-free trajectories for the robots.
arXiv Detail & Related papers (2022-09-07T00:35:27Z) - Concepts and Algorithms for Agent-based Decentralized and Integrated
Scheduling of Production and Auxiliary Processes [78.120734120667]
This paper describes an agent-based decentralized and integrated scheduling approach.
Part of the requirements is to develop a linearly scaling communication architecture.
The approach is explained using an example based on industrial requirements.
arXiv Detail & Related papers (2022-05-06T18:44:29Z) - Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order
Fulfilment in Electronic Commerce [3.3865605512957457]
We find that the cost of delivery of products from the most node in the supply chain is a key challenge.
The large scale, highproblemity, and large geographical spread of e-commerce supply chains make this setting ideal for a carefully designed data-driven decision-making algorithm.
We show that a reinforcement learning based algorithm is competitive with these policies, with the potential of efficient scale-up in the real world.
arXiv Detail & Related papers (2021-12-16T09:42:40Z) - DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer
Programming for Multi-transfer Truck Freight Delivery [38.04321619061474]
DeepFreight is a model-free deep-reinforcement-learning-based algorithm for multi-transfer freight delivery.
The proposed system is highly scalable and ensures a 100% delivery success while maintaining low delivery-time and fuel consumption.
arXiv Detail & Related papers (2021-03-05T03:06:48Z) - Formulating and solving integrated order batching and routing in
multi-depot AGV-assisted mixed-shelves warehouses [1.2117737635879038]
This paper proposes a mixed-shelves storage policy and AGV-assisted mixed-shelves picking systems.
We develop a variable neighborhood search algorithm to solve the integrated problem more efficiently.
We conclude that the mixed-shelves storage policy is more suitable than the usual storage policy in AGV-assisted mixed-shelves systems for both single-line and multiline orders.
arXiv Detail & Related papers (2021-01-27T15:04:05Z) - A Multi-Agent System for Solving the Dynamic Capacitated Vehicle Routing
Problem with Stochastic Customers using Trajectory Data Mining [0.0]
E-commerce has created new challenges for logistics companies, one of which is being able to deliver products quickly and at low cost.
Our work presents a multi-agent system that uses trajectory data mining techniques to extract territorial patterns and use them in the dynamic creation of last-mile routes.
arXiv Detail & Related papers (2020-09-26T21:36:35Z) - Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal
Constraints [52.58352707495122]
We present a multi-robot allocation algorithm that decouples the key computational challenges of sequential decision-making under uncertainty and multi-agent coordination.
We validate our results over a wide range of simulations on two distinct domains: multi-arm conveyor belt pick-and-place and multi-drone delivery dispatch in a city.
arXiv Detail & Related papers (2020-05-27T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.