IDEAS home Printed from https://ideas.repec.org/a/spr/cejnor/v32y2024i3d10.1007_s10100-023-00872-2.html
   My bibliography  Save this article

Multi-echelon inventory optimization using deep reinforcement learning

Author

Listed:
  • Kevin Geevers

    (ORTEC B.V.)

  • Lotte Hezewijk

    (ORTEC B.V.
    Eindhoven University of Technology)

  • Martijn R. K. Mes

    (University of Twente)

Abstract

This paper studies the applicability of a deep reinforcement learning approach to three different multi-echelon inventory systems, with the objective of minimizing the holding and backorder costs. First, we conduct an extensive literature review to map the current applications of reinforcement learning in multi-echelon inventory systems. Next, we apply our deep reinforcement learning method to three cases with different network structures (linear, divergent, and general structures). The linear and divergent cases are derived from literature, whereas the general case is based on a real-life manufacturer. We apply the proximal policy optimization (PPO) algorithm, with a continuous action space, and show that it consistently outperforms the benchmark solution. It achieves an average improvement of 16.4% for the linear case, 11.3% for the divergent case, and 6.6% for the general case. We explain the limitations of our approach and propose avenues for future research.

Suggested Citation

  • Kevin Geevers & Lotte Hezewijk & Martijn R. K. Mes, 2024. "Multi-echelon inventory optimization using deep reinforcement learning," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 32(3), pages 653-683, September.
  • Handle: RePEc:spr:cejnor:v:32:y:2024:i:3:d:10.1007_s10100-023-00872-2
    DOI: 10.1007/s10100-023-00872-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10100-023-00872-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10100-023-00872-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kunnumkal, Sumit & Topaloglu, Huseyin, 2011. "Linear programming based decomposition methods for inventory distribution systems," European Journal of Operational Research, Elsevier, vol. 211(2), pages 282-297, June.
    2. de Kok, Ton & Grob, Christopher & Laumanns, Marco & Minner, Stefan & Rambau, Jörg & Schade, Konrad, 2018. "A typology and literature review on stochastic multi-echelon inventory models," European Journal of Operational Research, Elsevier, vol. 269(3), pages 955-983.
    3. Mustafa Çimen & Chris Kirkbride, 2017. "Approximate dynamic programming algorithms for multidimensional flexible production-inventory problems," International Journal of Production Research, Taylor & Francis Journals, vol. 55(7), pages 2034-2050, April.
    4. Topan, E. & Eruguz, A.S. & Ma, W. & van der Heijden, M.C. & Dekker, R., 2020. "A review of operational spare parts service logistics in service control towers," European Journal of Operational Research, Elsevier, vol. 282(2), pages 401-414.
    5. Fangruo Chen & Jing-Sheng Song, 2001. "Optimal Policies for Multiechelon Inventory Problems with Markov-Modulated Demand," Operations Research, INFORMS, vol. 49(2), pages 226-234, April.
    6. Lambrecht, M. R. & Luyten, R. & Vander Eecken, J., 1985. "Protective inventories and bottlenecks in production systems," European Journal of Operational Research, Elsevier, vol. 22(3), pages 319-328, December.
    7. Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo, 2002. "Inventory management in supply chains: a reinforcement learning approach," International Journal of Production Economics, Elsevier, vol. 78(2), pages 153-161, July.
    8. Geng, Wei & Qiu, Minmin & Zhao, Xiaobo, 2010. "An inventory system with single distributor and multiple retailers: Operating scenarios and performance comparison," International Journal of Production Economics, Elsevier, vol. 128(1), pages 434-444, November.
    9. Kevin H. Shang & Jing-Sheng Song, 2003. "Newsvendor Bounds and Heuristic for Optimal Policies in Serial Supply Chains," Management Science, INFORMS, vol. 49(5), pages 618-638, May.
    10. Kalchschmidt, Matteo & Zotteri, Giulio & Verganti, Roberto, 2003. "Inventory management in a multi-echelon spare parts supply chain," International Journal of Production Economics, Elsevier, vol. 81(1), pages 397-413, January.
    11. Ying Rong & Zümbül Atan & Lawrence V. Snyder, 2017. "Heuristics for Base-Stock Levels in Multi-Echelon Distribution Networks," Production and Operations Management, Production and Operations Management Society, vol. 26(9), pages 1760-1777, September.
    12. Sandeep Jain & N. Raghavan, 2009. "A queuing approach for inventory planning with batch ordering in multi-echelon supply chains," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 17(1), pages 95-110, March.
    13. Tunc, Huseyin & Kilic, Onur A. & Tarim, S. Armagan & Eksioglu, Burak, 2011. "The cost of using stationary inventory policies when demand is non-stationary," Omega, Elsevier, vol. 39(4), pages 410-415, August.
    14. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    15. Uday Rao & Alan Scheller-Wolf & Sridhar Tayur, 2000. "Development of a Rapid-Response Supply Chain at Caterpillar," Operations Research, INFORMS, vol. 48(2), pages 189-204, April.
    16. Rau, Hsin & Wu, Mei-Ying & Wee, Hui-Ming, 2003. "Integrated inventory model for deteriorating items under a multi-echelon supply chain environment," International Journal of Production Economics, Elsevier, vol. 86(2), pages 155-168, November.
    17. Iida, Tetsuo, 2001. "The infinite horizon non-stationary stochastic multi-echelon inventory problem and near-myopic policies," European Journal of Operational Research, Elsevier, vol. 134(3), pages 525-539, November.
    18. Mustafa Doğru & A. Kok & G. Houtum, 2013. "Newsvendor characterizations for one-warehouse multi-retailer inventory systems with discrete demand under the balance assumption," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 21(3), pages 541-559, September.
    19. Steven Nahmias & Stephen A. Smith, 1994. "Optimizing Inventory Levels in a Two-Echelon Retailer System with Partial Lost Sales," Management Science, INFORMS, vol. 40(5), pages 582-596, May.
    20. Chen, Frank Y. & Feng, Youyi & Simchi-Levi, David, 2002. "Uniform distribution of inventory positions in two-echelon periodic review systems with batch-ordering policies and interdependent demands," European Journal of Operational Research, Elsevier, vol. 140(3), pages 648-654, August.
    21. Ganeshan, Ram, 1999. "Managing supply chain inventories: A multiple retailer, one warehouse, multiple supplier model," International Journal of Production Economics, Elsevier, vol. 59(1-3), pages 341-354, March.
    22. Gumus, Alev Taskin & Guneri, Ali Fuat & Ulengin, Fusun, 2010. "A new methodology for multi-echelon inventory management in stochastic and neuro-fuzzy environments," International Journal of Production Economics, Elsevier, vol. 128(1), pages 248-260, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. de Kok, Ton & Grob, Christopher & Laumanns, Marco & Minner, Stefan & Rambau, Jörg & Schade, Konrad, 2018. "A typology and literature review on stochastic multi-echelon inventory models," European Journal of Operational Research, Elsevier, vol. 269(3), pages 955-983.
    2. Retsef Levi & Robin Roundy & Van Anh Truong & Xinshang Wang, 2017. "Provably Near-Optimal Balancing Policies for Multi-Echelon Stochastic Inventory Control Models," Mathematics of Operations Research, INFORMS, vol. 42(1), pages 256-276, January.
    3. Svoboda, Josef & Minner, Stefan & Yao, Man, 2021. "Typology and literature review on multiple supplier inventory control models," European Journal of Operational Research, Elsevier, vol. 293(1), pages 1-23.
    4. Monthatipkul, Chumpol & Yenradee, Pisal, 2008. "Inventory/distribution control system in a one-warehouse/multi-retailer supply chain," International Journal of Production Economics, Elsevier, vol. 114(1), pages 119-133, July.
    5. Li Chen & Jing-Sheng Song & Yue Zhang, 2017. "Serial Inventory Systems with Markov-Modulated Demand: Derivative Bounds, Asymptotic Analysis, and Insights," Operations Research, INFORMS, vol. 65(5), pages 1231-1249, October.
    6. Gabor, Adriana F. & van Ommeren, Jan-Kees & Sleptchenko, Andrei, 2022. "An inventory model with discounts for omnichannel retailers of slow moving items," European Journal of Operational Research, Elsevier, vol. 300(1), pages 58-72.
    7. Yang, Liu & Li, Haitao & Campbell, James F. & Sweeney, Donald C., 2017. "Integrated multi-period dynamic inventory classification and control," International Journal of Production Economics, Elsevier, vol. 189(C), pages 86-96.
    8. Karaman, Abdullah & Altiok, Tayfur, 2009. "Approximate analysis and optimization of batch ordering policies in capacitated supply chains," European Journal of Operational Research, Elsevier, vol. 193(1), pages 222-237, February.
    9. Dai, Zhuo & Aqlan, Faisal & Gao, Kuo, 2017. "Optimizing multi-echelon inventory with three types of demand in supply chain," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 107(C), pages 141-177.
    10. Ghoudi, Kilani & Hamdouch, Younes & Boulaksil, Youssef & Hamdan, Sadeque, 2024. "Supply chain coordination in a dual sourcing system under the Tailored Base-Surge policy," European Journal of Operational Research, Elsevier, vol. 317(2), pages 533-549.
    11. Lingxiu Dong & Hau L. Lee, 2003. "Optimal Policies and Approximations for a Serial Multiechelon Inventory System with Time-Correlated Demand," Operations Research, INFORMS, vol. 51(6), pages 969-980, December.
    12. Pal, Brojeswar & Sana, Shib Sankar & Chaudhuri, Kripasindhu, 2012. "A multi-echelon supply chain model for reworkable items in multiple-markets with supply disruption," Economic Modelling, Elsevier, vol. 29(5), pages 1891-1898.
    13. Jing‐Sheng Song & Paul H. Zipkin, 2012. "Newsvendor problems with sequentially revealed demand information," Naval Research Logistics (NRL), John Wiley & Sons, vol. 59(8), pages 601-612, December.
    14. Scheller-Wolf, Alan & Tayur, Sridhar, 2009. "Risk sharing in supply chains using order bands--Analytical results and managerial insights," International Journal of Production Economics, Elsevier, vol. 121(2), pages 715-727, October.
    15. Haji, Rasoul & Neghab, Mohammadali Pirayesh & Baboli, Armand, 2009. "Introducing a new ordering policy in a two-echelon inventory system with Poisson demand," International Journal of Production Economics, Elsevier, vol. 117(1), pages 212-218, January.
    16. Gerrits, B. & Topan, E. & van der Heijden, M.C., 2022. "Operational planning in service control towers – heuristics and case study," European Journal of Operational Research, Elsevier, vol. 302(3), pages 983-998.
    17. Rau, Hsin & Wu, Mei-Ying & Wee, Hui-Ming, 2003. "Integrated inventory model for deteriorating items under a multi-echelon supply chain environment," International Journal of Production Economics, Elsevier, vol. 86(2), pages 155-168, November.
    18. da Costa, Paulo & Verleijsdonk, Peter & Voorberg, Simon & Akcay, Alp & Kapodistria, Stella & van Jaarsveld, Willem & Zhang, Yingqian, 2023. "Policies for the dynamic traveling maintainer problem with alerts," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1141-1152.
    19. Daniel, J. Sudhir Ryan & Rajendran, Chandrasekharan, 2006. "Heuristic approaches to determine base-stock levels in a serial supply chain with a single objective and with multiple objectives," European Journal of Operational Research, Elsevier, vol. 175(1), pages 566-592, November.
    20. Gumus, Alev Taskin & Guneri, Ali Fuat & Ulengin, Fusun, 2010. "A new methodology for multi-echelon inventory management in stochastic and neuro-fuzzy environments," International Journal of Production Economics, Elsevier, vol. 128(1), pages 248-260, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:cejnor:v:32:y:2024:i:3:d:10.1007_s10100-023-00872-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.