Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Sultana, Nazneen N; Meisheri, Hardik; Baniwal, Vinita; Nath, Somjit; Ravindran, Balaraman; Khadilkar, Harshad

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Authors: Nazneen N Sultana, Hardik Meisheri, Vinita Baniwal, Somjit Nath, Balaraman Ravindran, Harshad Khadilkar

(Submitted on 7 Jun 2020)

Abstract: This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains. The problem description and solution are both adapted from a real-world business solution. The novelty of this problem with respect to supply chain literature is (i) we consider concurrent inventory management of a large number (50 to 1000) of products with shared capacity, (ii) we consider a multi-node supply chain consisting of a warehouse which supplies three stores, (iii) the warehouse, stores, and transportation from warehouse to stores have finite capacities, (iv) warehouse and store replenishment happen at different time scales and with realistic time lags, and (v) demand for products at the stores is stochastic. We describe a novel formulation in a multi-agent (hierarchical) reinforcement learning framework that can be used for parallelised decision-making, and use the advantage actor critic (A2C) algorithm with quantised action spaces to solve the problem. Experiments show that the proposed approach is able to handle a multi-objective reward comprised of maximising product sales and minimising wastage of perishable products.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:2006.04037 [cs.LG]
	(or arXiv:2006.04037v1 [cs.LG] for this version)

Submission history

From: Hardik Meisheri [view email]
[v1] Sun, 7 Jun 2020 04:02:59 GMT (5250kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.04037

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Submission history