IDEAS home Printed from https://ideas.repec.org/a/eee/proeco/v268y2024ics0925527323003316.html
   My bibliography  Save this article

Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem

Author

Listed:
  • Stranieri, Francesco
  • Fadda, Edoardo
  • Stella, Fabio

Abstract

We introduce a novel heuristic designed to address the supply chain inventory management problem in the context of a two-echelon divergent supply chain. The proposed heuristic advances the current state-of-the-art by combining deep reinforcement learning with multi-stage stochastic programming. In particular, deep reinforcement learning is employed to determine the number of batches to produce, while multi-stage stochastic programming is applied to make shipping decisions. To support further research, we release a publicly available software environment that simulates a wide range of two-echelon divergent supply chain settings, allowing the manipulation of various parameter values, including those associated with seasonal demands. We then present a comprehensive set of numerical experiments considering constraints on production and warehouse capacities under fixed and variable logistic costs. The results demonstrate that the proposed heuristic significantly and consistently outperforms pure deep reinforcement learning algorithms in minimizing total costs. Moreover, it overcomes several inherent limitations of multi-stage stochastic programming models, thus underscoring its potential advantages in addressing complex supply chain scenarios.

Suggested Citation

  • Stranieri, Francesco & Fadda, Edoardo & Stella, Fabio, 2024. "Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem," International Journal of Production Economics, Elsevier, vol. 268(C).
  • Handle: RePEc:eee:proeco:v:268:y:2024:i:c:s0925527323003316
    DOI: 10.1016/j.ijpe.2023.109099
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0925527323003316
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijpe.2023.109099?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Huang, Yongxi & Chen, Chien-Wei & Fan, Yueyue, 2010. "Multistage optimization of the supply chains of biofuels," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 46(6), pages 820-830, November.
    2. de Kok, Ton & Grob, Christopher & Laumanns, Marco & Minner, Stefan & Rambau, Jörg & Schade, Konrad, 2018. "A typology and literature review on stochastic multi-echelon inventory models," European Journal of Operational Research, Elsevier, vol. 269(3), pages 955-983.
    3. Harvey M. Wagner & Thomson M. Whitin, 1958. "Dynamic Version of the Economic Lot Size Model," Management Science, INFORMS, vol. 5(1), pages 89-96, October.
    4. Yan, Yimo & Chow, Andy H.F. & Ho, Chin Pang & Kuo, Yong-Hong & Wu, Qihao & Ying, Chengshuo, 2022. "Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 162(C).
    5. Khouja, Moutaz, 2003. "Optimizing inventory decisions in a multi-stage multi-customer supply chain," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 39(3), pages 193-208, May.
    6. Narendra Agrawal & Stephen A. Smith, 1996. "Estimating negative binomial demand for retail inventory management with unobservable lost sales," Naval Research Logistics (NRL), John Wiley & Sons, vol. 43(6), pages 839-861, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. B. C. Giri & A. Chakraborty & T. Maiti, 2017. "Effectiveness of consignment stock policy in a three-level supply chain," Operational Research, Springer, vol. 17(1), pages 39-66, April.
    2. Palak, Gökçe & Ekşioğlu, Sandra Duni & Geunes, Joseph, 2014. "Analyzing the impacts of carbon regulatory mechanisms on supplier and mode selection decisions: An application to a biofuel supply chain," International Journal of Production Economics, Elsevier, vol. 154(C), pages 198-216.
    3. Battini, Daria & Persona, Alessandro & Sgarbossa, Fabio, 2014. "A sustainable EOQ model: Theoretical formulation and applications," International Journal of Production Economics, Elsevier, vol. 149(C), pages 145-153.
    4. Wolosewicz, Cathy & Dauzère-Pérès, Stéphane & Aggoune, Riad, 2015. "A Lagrangian heuristic for an integrated lot-sizing and fixed scheduling problem," European Journal of Operational Research, Elsevier, vol. 244(1), pages 3-12.
    5. Charles, Mehdi & Dauzère-Pérès, Stéphane & Kedad-Sidhoum, Safia & Mazhoud, Issam, 2022. "Motivations and analysis of the capacitated lot-sizing problem with setup times and minimum and maximum ending inventories," European Journal of Operational Research, Elsevier, vol. 302(1), pages 203-220.
    6. Liu, Tieming, 2008. "Economic lot sizing problem with inventory bounds," European Journal of Operational Research, Elsevier, vol. 185(1), pages 204-215, February.
    7. Kaijie Zhu & Ulrich W. Thonemann, 2009. "Coordination of pricing and inventory control across products," Naval Research Logistics (NRL), John Wiley & Sons, vol. 56(2), pages 175-190, March.
    8. Chakrabarti, T. & Chaudhuri, K. S., 1997. "An EOQ model for deteriorating items with a linear trend in demand and shortages in all cycles," International Journal of Production Economics, Elsevier, vol. 49(3), pages 205-213, May.
    9. Ba, Birome Holo & Prins, Christian & Prodhon, Caroline, 2016. "Models for optimization and performance evaluation of biomass supply chains: An Operations Research perspective," Renewable Energy, Elsevier, vol. 87(P2), pages 977-989.
    10. Qiu, Ruozhen & Sun, Minghe & Lim, Yun Fong, 2017. "Optimizing (s, S) policies for multi-period inventory models with demand distribution uncertainty: Robust dynamic programing approaches," European Journal of Operational Research, Elsevier, vol. 261(3), pages 880-892.
    11. van den Heuvel, Wilco & Gutiérrez, José Miguel & Hwang, Hark-Chin, 2011. "Note on "An efficient approach for solving the lot-sizing problem with time-varying storage capacities"," European Journal of Operational Research, Elsevier, vol. 213(2), pages 455-457, September.
    12. Stan van Hoesel & H. Edwin Romeijn & Dolores Romero Morales & Albert P. M. Wagelmans, 2005. "Integrated Lot Sizing in Serial Supply Chains with Production Capacities," Management Science, INFORMS, vol. 51(11), pages 1706-1719, November.
    13. Sana, S. & Goyal, S. K. & Chaudhuri, K. S., 2004. "A production-inventory model for a deteriorating item with trended demand and shortages," European Journal of Operational Research, Elsevier, vol. 157(2), pages 357-371, September.
    14. Toy, Ayhan Özgür & Berk, Emre, 2013. "Dynamic lot sizing for a warm/cold process: Heuristics and insights," International Journal of Production Economics, Elsevier, vol. 145(1), pages 53-66.
    15. Tang, Lianhua & Li, Yantong & Bai, Danyu & Liu, Tao & Coelho, Leandro C., 2022. "Bi-objective optimization for a multi-period COVID-19 vaccination planning problem," Omega, Elsevier, vol. 110(C).
    16. Alawneh, Fawzat & Zhang, Guoqing, 2018. "Dual-channel warehouse and inventory management with stochastic demand," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 112(C), pages 84-106.
    17. Bo Dai & Fenfen Li, 2021. "Joint Inventory Replenishment Planning of an E-Commerce Distribution System with Distribution Centers at Producers’ Locations," Logistics, MDPI, vol. 5(3), pages 1-14, July.
    18. Melega, Gislaine Mara & de Araujo, Silvio Alexandre & Jans, Raf, 2018. "Classification and literature review of integrated lot-sizing and cutting stock problems," European Journal of Operational Research, Elsevier, vol. 271(1), pages 1-19.
    19. Schwartz, Jay D. & Rivera, Daniel E., 2010. "A process control approach to tactical inventory management in production-inventory systems," International Journal of Production Economics, Elsevier, vol. 125(1), pages 111-124, May.
    20. Fleischmann, Moritz & Bloemhof-Ruwaard, Jacqueline M. & Dekker, Rommert & van der Laan, Erwin & van Nunen, Jo A. E. E. & Van Wassenhove, Luk N., 1997. "Quantitative models for reverse logistics: A review," European Journal of Operational Research, Elsevier, vol. 103(1), pages 1-17, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:proeco:v:268:y:2024:i:c:s0925527323003316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijpe .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.