IDEAS home Printed from https://ideas.repec.org/a/eee/proeco/v268y2024ics0925527323003316.html
   My bibliography  Save this article

Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem

Author

Listed:
  • Stranieri, Francesco
  • Fadda, Edoardo
  • Stella, Fabio

Abstract

We introduce a novel heuristic designed to address the supply chain inventory management problem in the context of a two-echelon divergent supply chain. The proposed heuristic advances the current state-of-the-art by combining deep reinforcement learning with multi-stage stochastic programming. In particular, deep reinforcement learning is employed to determine the number of batches to produce, while multi-stage stochastic programming is applied to make shipping decisions. To support further research, we release a publicly available software environment that simulates a wide range of two-echelon divergent supply chain settings, allowing the manipulation of various parameter values, including those associated with seasonal demands. We then present a comprehensive set of numerical experiments considering constraints on production and warehouse capacities under fixed and variable logistic costs. The results demonstrate that the proposed heuristic significantly and consistently outperforms pure deep reinforcement learning algorithms in minimizing total costs. Moreover, it overcomes several inherent limitations of multi-stage stochastic programming models, thus underscoring its potential advantages in addressing complex supply chain scenarios.

Suggested Citation

  • Stranieri, Francesco & Fadda, Edoardo & Stella, Fabio, 2024. "Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem," International Journal of Production Economics, Elsevier, vol. 268(C).
  • Handle: RePEc:eee:proeco:v:268:y:2024:i:c:s0925527323003316
    DOI: 10.1016/j.ijpe.2023.109099
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0925527323003316
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijpe.2023.109099?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Huang, Yongxi & Chen, Chien-Wei & Fan, Yueyue, 2010. "Multistage optimization of the supply chains of biofuels," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 46(6), pages 820-830, November.
    2. de Kok, Ton & Grob, Christopher & Laumanns, Marco & Minner, Stefan & Rambau, Jörg & Schade, Konrad, 2018. "A typology and literature review on stochastic multi-echelon inventory models," European Journal of Operational Research, Elsevier, vol. 269(3), pages 955-983.
    3. Harvey M. Wagner & Thomson M. Whitin, 1958. "Dynamic Version of the Economic Lot Size Model," Management Science, INFORMS, vol. 5(1), pages 89-96, October.
    4. Yan, Yimo & Chow, Andy H.F. & Ho, Chin Pang & Kuo, Yong-Hong & Wu, Qihao & Ying, Chengshuo, 2022. "Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 162(C).
    5. Khouja, Moutaz, 2003. "Optimizing inventory decisions in a multi-stage multi-customer supply chain," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 39(3), pages 193-208, May.
    6. Narendra Agrawal & Stephen A. Smith, 1996. "Estimating negative binomial demand for retail inventory management with unobservable lost sales," Naval Research Logistics (NRL), John Wiley & Sons, vol. 43(6), pages 839-861, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Winkelmann, Jonas & Spinler, Stefan & Neukirchen, Thomas, 2024. "Green transport fleet renewal using approximate dynamic programming: A case study in German heavy-duty road transportation," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 186(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. B. C. Giri & A. Chakraborty & T. Maiti, 2017. "Effectiveness of consignment stock policy in a three-level supply chain," Operational Research, Springer, vol. 17(1), pages 39-66, April.
    2. Palak, Gökçe & Ekşioğlu, Sandra Duni & Geunes, Joseph, 2014. "Analyzing the impacts of carbon regulatory mechanisms on supplier and mode selection decisions: An application to a biofuel supply chain," International Journal of Production Economics, Elsevier, vol. 154(C), pages 198-216.
    3. Battini, Daria & Persona, Alessandro & Sgarbossa, Fabio, 2014. "A sustainable EOQ model: Theoretical formulation and applications," International Journal of Production Economics, Elsevier, vol. 149(C), pages 145-153.
    4. Siao-Leu Phouratsamay & Safia Kedad-Sidhoum & Fanny Pascual, 2021. "Coordination of a two-level supply chain with contracts," 4OR, Springer, vol. 19(2), pages 235-264, June.
    5. Wolosewicz, Cathy & Dauzère-Pérès, Stéphane & Aggoune, Riad, 2015. "A Lagrangian heuristic for an integrated lot-sizing and fixed scheduling problem," European Journal of Operational Research, Elsevier, vol. 244(1), pages 3-12.
    6. Charles, Mehdi & Dauzère-Pérès, Stéphane & Kedad-Sidhoum, Safia & Mazhoud, Issam, 2022. "Motivations and analysis of the capacitated lot-sizing problem with setup times and minimum and maximum ending inventories," European Journal of Operational Research, Elsevier, vol. 302(1), pages 203-220.
    7. Minjiao Zhang & Simge Küçükyavuz & Saumya Goel, 2014. "A Branch-and-Cut Method for Dynamic Decision Making Under Joint Chance Constraints," Management Science, INFORMS, vol. 60(5), pages 1317-1333, May.
    8. Liu, Tieming, 2008. "Economic lot sizing problem with inventory bounds," European Journal of Operational Research, Elsevier, vol. 185(1), pages 204-215, February.
    9. Lee, Jinkyu & Bae, Sanghyeon & Kim, Woo Chang & Lee, Yongjae, 2023. "Value function gradient learning for large-scale multistage stochastic programming problems," European Journal of Operational Research, Elsevier, vol. 308(1), pages 321-335.
    10. Kaijie Zhu & Ulrich W. Thonemann, 2009. "Coordination of pricing and inventory control across products," Naval Research Logistics (NRL), John Wiley & Sons, vol. 56(2), pages 175-190, March.
    11. Chakrabarti, T. & Chaudhuri, K. S., 1997. "An EOQ model for deteriorating items with a linear trend in demand and shortages in all cycles," International Journal of Production Economics, Elsevier, vol. 49(3), pages 205-213, May.
    12. Rossi, Tommaso & Pozzi, Rossella & Testa, Mariapaola, 2017. "EOQ-based inventory management in single-machine multi-item systems," Omega, Elsevier, vol. 71(C), pages 106-113.
    13. Ba, Birome Holo & Prins, Christian & Prodhon, Caroline, 2016. "Models for optimization and performance evaluation of biomass supply chains: An Operations Research perspective," Renewable Energy, Elsevier, vol. 87(P2), pages 977-989.
    14. Timo Hilger & Florian Sahling & Horst Tempelmeier, 2016. "Capacitated dynamic production and remanufacturing planning under demand and return uncertainty," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 38(4), pages 849-876, October.
    15. Guerrero, W.J. & Prodhon, C. & Velasco, N. & Amaya, C.A., 2013. "Hybrid heuristic for the inventory location-routing problem with deterministic demand," International Journal of Production Economics, Elsevier, vol. 146(1), pages 359-370.
    16. Ming Zhao & Minjiao Zhang, 2020. "Multiechelon Lot Sizing: New Complexities and Inequalities," Operations Research, INFORMS, vol. 68(2), pages 534-551, March.
    17. Bouchery, Yann & Hezarkhani, Behzad & Stauffer, Gautier, 2022. "Coalition formation and cost sharing for truck platooning," Transportation Research Part B: Methodological, Elsevier, vol. 165(C), pages 15-34.
    18. Jenny Carolina Saldana Cortés, 2011. "Programación semidefinida aplicada a problemas de cantidad económica de pedido," Documentos CEDE 8735, Universidad de los Andes, Facultad de Economía, CEDE.
    19. Qiu, Ruozhen & Sun, Minghe & Lim, Yun Fong, 2017. "Optimizing (s, S) policies for multi-period inventory models with demand distribution uncertainty: Robust dynamic programing approaches," European Journal of Operational Research, Elsevier, vol. 261(3), pages 880-892.
    20. van den Heuvel, Wilco & Gutiérrez, José Miguel & Hwang, Hark-Chin, 2011. "Note on "An efficient approach for solving the lot-sizing problem with time-varying storage capacities"," European Journal of Operational Research, Elsevier, vol. 213(2), pages 455-457, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:proeco:v:268:y:2024:i:c:s0925527323003316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijpe .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.