Reinforcement Learning for Multi-Product Multi-Node Inventory Management
in Supply Chains
- URL: http://arxiv.org/abs/2006.04037v1
- Date: Sun, 7 Jun 2020 04:02:59 GMT
- Title: Reinforcement Learning for Multi-Product Multi-Node Inventory Management
in Supply Chains
- Authors: Nazneen N Sultana, Hardik Meisheri, Vinita Baniwal, Somjit Nath,
Balaraman Ravindran, Harshad Khadilkar
- Abstract summary: This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains.
Experiments show that the proposed approach is able to handle a multi-objective reward comprised of maximising product sales and minimising wastage of perishable products.
- Score: 17.260459603456745
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper describes the application of reinforcement learning (RL) to
multi-product inventory management in supply chains. The problem description
and solution are both adapted from a real-world business solution. The novelty
of this problem with respect to supply chain literature is (i) we consider
concurrent inventory management of a large number (50 to 1000) of products with
shared capacity, (ii) we consider a multi-node supply chain consisting of a
warehouse which supplies three stores, (iii) the warehouse, stores, and
transportation from warehouse to stores have finite capacities, (iv) warehouse
and store replenishment happen at different time scales and with realistic time
lags, and (v) demand for products at the stores is stochastic. We describe a
novel formulation in a multi-agent (hierarchical) reinforcement learning
framework that can be used for parallelised decision-making, and use the
advantage actor critic (A2C) algorithm with quantised action spaces to solve
the problem. Experiments show that the proposed approach is able to handle a
multi-objective reward comprised of maximising product sales and minimising
wastage of perishable products.
Related papers
- Enhancing Supply Chain Visibility with Knowledge Graphs and Large Language Models [49.898152180805454]
This paper presents a novel framework leveraging Knowledge Graphs (KGs) and Large Language Models (LLMs) to enhance supply chain visibility.
Our zero-shot, LLM-driven approach automates the extraction of supply chain information from diverse public sources.
With high accuracy in NER and RE tasks, it provides an effective tool for understanding complex, multi-tiered supply networks.
arXiv Detail & Related papers (2024-08-05T17:11:29Z) - InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains [0.0]
This study introduces a novel approach using large language models (LLMs) to manage multi-agent inventory systems.
Our model, InvAgent, enhances resilience and improves efficiency across the supply chain network.
arXiv Detail & Related papers (2024-07-16T04:55:17Z) - MARLIM: Multi-Agent Reinforcement Learning for Inventory Management [1.1470070927586016]
This paper presents a novel reinforcement learning framework called MARLIM to address the inventory management problem.
Within this context, controllers are developed through single or multiple agents in a cooperative setting.
Numerical experiments on real data demonstrate the benefits of reinforcement learning methods over traditional baselines.
arXiv Detail & Related papers (2023-08-03T09:31:45Z) - Cooperative Multi-Agent Reinforcement Learning for Inventory Management [0.5276232626689566]
Reinforcement Learning (RL) for inventory management is a nascent field of research.
We present a system with a custom GPU-parallelized environment that consists of one warehouse and multiple stores.
We achieve a system that outperforms standard inventory control policies.
arXiv Detail & Related papers (2023-04-18T06:55:59Z) - Multi-Agent Reinforcement Learning with Shared Resources for Inventory
Management [62.23979094308932]
In our setting, the constraint on the shared resources (such as the inventory capacity) couples the otherwise independent control for each SKU.
We formulate the problem with this structure as Shared-Resource Game (SRSG)and propose an efficient algorithm called Context-aware Decentralized PPO (CD-PPO)
Through extensive experiments, we demonstrate that CD-PPO can accelerate the learning procedure compared with standard MARL algorithms.
arXiv Detail & Related papers (2022-12-15T09:35:54Z) - No-Regret Learning in Two-Echelon Supply Chain with Unknown Demand
Distribution [48.27759561064771]
We consider the two-echelon supply chain model introduced in [Cachon and Zipkin, 1999] under two different settings.
We design algorithms that achieve favorable guarantees for both regret and convergence to the optimal inventory decision in both settings.
Our algorithms are based on Online Gradient Descent and Online Newton Step, together with several new ingredients specifically designed for our problem.
arXiv Detail & Related papers (2022-10-23T08:45:39Z) - A Simulation Environment and Reinforcement Learning Method for Waste
Reduction [50.545552995521774]
We study the problem of restocking a grocery store's inventory with perishable items over time, from a distributional point of view.
The objective is to maximize sales while minimizing waste, with uncertainty about the actual consumption by costumers.
We frame inventory restocking as a new reinforcement learning task that exhibits behavior conditioned on the agent's actions.
arXiv Detail & Related papers (2022-05-30T22:48:57Z) - Concepts and Algorithms for Agent-based Decentralized and Integrated
Scheduling of Production and Auxiliary Processes [78.120734120667]
This paper describes an agent-based decentralized and integrated scheduling approach.
Part of the requirements is to develop a linearly scaling communication architecture.
The approach is explained using an example based on industrial requirements.
arXiv Detail & Related papers (2022-05-06T18:44:29Z) - Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply
Chains [1.4685355149711299]
We analyze and compare the performance of state-of-the-art deep reinforcement learning algorithms for solving the supply chain inventory management problem.
This study provides detailed insight into the design and development of an open-source software library that provides a customizable environment for solving the supply chain inventory management problem.
arXiv Detail & Related papers (2022-04-20T16:33:01Z) - Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order
Fulfilment in Electronic Commerce [3.3865605512957457]
We find that the cost of delivery of products from the most node in the supply chain is a key challenge.
The large scale, highproblemity, and large geographical spread of e-commerce supply chains make this setting ideal for a carefully designed data-driven decision-making algorithm.
We show that a reinforcement learning based algorithm is competitive with these policies, with the potential of efficient scale-up in the real world.
arXiv Detail & Related papers (2021-12-16T09:42:40Z) - Product1M: Towards Weakly Supervised Instance-Level Product Retrieval
via Cross-modal Pretraining [108.86502855439774]
We investigate a more realistic setting that aims to perform weakly-supervised multi-modal instance-level product retrieval.
We contribute Product1M, one of the largest multi-modal cosmetic datasets for real-world instance-level retrieval.
We propose a novel model named Cross-modal contrAstive Product Transformer for instance-level prodUct REtrieval (CAPTURE)
arXiv Detail & Related papers (2021-07-30T12:11:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.