IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v34y2023i1d10.1007_s10845-022-01982-5.html
   My bibliography  Save this article

Hybrid algorithm based on reinforcement learning for smart inventory management

Author

Listed:
  • Carlos Cuartas

    (Universidad EAFIT)

  • Jose Aguilar

    (Universidad EAFIT
    Universidad de Alcalá
    Universidad de Los Andes)

Abstract

This article proposes a hybrid algorithm based on reinforcement learning and the inventory management methodology called DDMRP (Demand Driven Material Requirement Planning) to determine the optimal time to buy a certain product, and how much quantity should be requested. For this, the inventory management problem is formulated as a Markov Decision Process where the environment with which the system interacts is designed from the concepts raised in the DDMRP methodology, and through the reinforcement learning algorithm—specifically, Q-Learning. The optimal policy is determined for making decisions about when and how much to buy. To determine the optimal policy, three approaches are proposed for the reward function: the first one is based on inventory levels; the second is an optimization function based on the distance of the inventory to its optimal level, and the third is a shaping function based on levels and distances to the optimal inventory. The results show that the proposed algorithm has promising results in scenarios with different characteristics, performing adequately in difficult case studies, with a diversity of situations such as scenarios with discontinuous or continuous demand, seasonal and non-seasonal behavior, and with high demand peaks, among others.

Suggested Citation

  • Carlos Cuartas & Jose Aguilar, 2023. "Hybrid algorithm based on reinforcement learning for smart inventory management," Journal of Intelligent Manufacturing, Springer, vol. 34(1), pages 123-149, January.
  • Handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5
    DOI: 10.1007/s10845-022-01982-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-022-01982-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-022-01982-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dhahri, Issam & Chabchoub, Habib, 2007. "Nonlinear goal programming models quantifying the bullwhip effect in supply chain based on ARIMA parameters," European Journal of Operational Research, Elsevier, vol. 177(3), pages 1800-1810, March.
    2. Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo, 2002. "Inventory management in supply chains: a reinforcement learning approach," International Journal of Production Economics, Elsevier, vol. 78(2), pages 153-161, July.
    3. Olumide Emmanuel Oluyisola & Swapnil Bhalla & Fabio Sgarbossa & Jan Ola Strandhagen, 2022. "Designing and developing smart production planning and control systems in the industry 4.0 era: a methodology and case study," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 311-332, January.
    4. Angela Patricia Velasco Acosta & Christian Mascle & Pierre Baptiste, 2020. "Applicability of Demand-Driven MRP in a complex manufacturing environment," International Journal of Production Research, Taylor & Francis Journals, vol. 58(14), pages 4233-4245, July.
    5. Thomy Eko Saputro & Gonçalo Figueira & Bernardo Almada-Lobo, 2021. "Integrating supplier selection with inventory management under supply disruptions," International Journal of Production Research, Taylor & Francis Journals, vol. 59(11), pages 3304-3322, June.
    6. Haiying Che & Zixing Bai & Rong Zuo & Honglei Li, 2020. "A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling," Complexity, Hindawi, vol. 2020, pages 1-12, August.
    7. Sushil Punia & Konstantinos Nikolopoulos & Surya Prakash Singh & Jitendra K. Madaan & Konstantia Litsiou, 2020. "Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail," International Journal of Production Research, Taylor & Francis Journals, vol. 58(16), pages 4964-4979, July.
    8. Fei Zhang & Zailin Guan & Li Zhang & Yanyan Cui & Pengxing Yi & Saif Ullah, 2019. "Inventory management for a remanufacture-to-order production with multi-components (parts)," Journal of Intelligent Manufacturing, Springer, vol. 30(1), pages 59-78, January.
    9. Matthias Thürer & Nuno O. Fernandes & Mark Stevenson, 2022. "Production planning and control in multi-stage assembly systems: an assessment of Kanban, MRP, OPT (DBR) and DDMRP by simulation," International Journal of Production Research, Taylor & Francis Journals, vol. 60(3), pages 1036-1050, February.
    10. Edward A. Silver, 1981. "Operations Research in Inventory Management: A Review and Critique," Operations Research, INFORMS, vol. 29(4), pages 628-645, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nuno Octávio Fernandes & Matthias Thürer & Sílvio Carmo Silva, 2024. "DDMRP relative priority for production execution: an assessment by simulation," Operations Management Research, Springer, vol. 17(4), pages 1244-1256, December.
    2. Mahmut Parlar & David Perry, 1996. "Inventory models of future supply uncertainty with single and multiple suppliers," Naval Research Logistics (NRL), John Wiley & Sons, vol. 43(2), pages 191-210, March.
    3. Lixiang Zhang & Yan Yan & Yaoguang Hu, 2024. "Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 3875-3888, December.
    4. Schmid, Verena & Doerner, Karl F. & Laporte, Gilbert, 2013. "Rich routing problems arising in supply chain management," European Journal of Operational Research, Elsevier, vol. 224(3), pages 435-448.
    5. Visentin, Andrea & Prestwich, Steven & Rossi, Roberto & Tarim, S. Armagan, 2021. "Computing optimal (R,s,S) policy parameters by a hybrid of branch-and-bound and stochastic dynamic programming," European Journal of Operational Research, Elsevier, vol. 294(1), pages 91-99.
    6. Gurkan, M. Edib & Tunc, Huseyin & Tarim, S. Armagan, 2022. "The joint stochastic lot sizing and pricing problem," Omega, Elsevier, vol. 108(C).
    7. Broccardo, Laura & Tenucci, Andrea & Agarwal, Reeti & Alshibani, Safiya Mukhtar, 2024. "Steering digitalization and management control maturity in small and medium enterprises (SMEs)," Technological Forecasting and Social Change, Elsevier, vol. 204(C).
    8. Durdu Hakan Utku, 2023. "The Evaluation and Improvement of the Production Processes of an Automotive Industry Company via Simulation and Optimization," Sustainability, MDPI, vol. 15(3), pages 1-17, January.
    9. Naragain Phumchusri & Nichakan Phupaichitkun, 2024. "Sales prediction hybrid models for retails using promotional pricing strategy as a key demand driver," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 23(5), pages 461-480, October.
    10. Fotuhi, Fateme & Huynh, Nathan & Vidal, Jose M. & Xie, Yuanchang, 2013. "Modeling yard crane operators as reinforcement learning agents," Research in Transportation Economics, Elsevier, vol. 42(1), pages 3-12.
    11. Kumar, Anupam & Evers, Philip T., 2015. "Setting safety stock based on imprecise records," International Journal of Production Economics, Elsevier, vol. 169(C), pages 68-75.
    12. Keun Hee Lee & Mali Abdollahian & Sergei Schreider & Sona Taheri, 2023. "Supply Chain Demand Forecasting and Price Optimisation Models with Substitution Effect," Mathematics, MDPI, vol. 11(11), pages 1-28, May.
    13. Xiaoling Zhang & Ke Zhang & Shibo Li & Damien Koenitz, 2023. "Effects of store fixture shape at retail checkout: Evidence from field and online studies," Production and Operations Management, Production and Operations Management Society, vol. 32(10), pages 3158-3173, October.
    14. Azadi, Majid & Yousefi, Saeed & Farzipoor Saen, Reza & Shabanpour, Hadi & Jabeen, Fauzia, 2023. "Forecasting sustainability of healthcare supply chains using deep learning and network data envelopment analysis," Journal of Business Research, Elsevier, vol. 154(C).
    15. Gel, Esma S. & Salman, F. Sibel, 2022. "Dynamic ordering decisions with approximate learning of supply yield uncertainty," International Journal of Production Economics, Elsevier, vol. 243(C).
    16. Hadar Amrani & Eugene Khmelnitsky, 2017. "Optimal division of inventory between depot and bases," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(1), pages 3-18, February.
    17. Dana Marsetiya Utama & Fakhrina Fahma Abdullah & Ikhlasul Amallynda & Teguh Baroto, 2024. "Integrated production-inventory model for multi-item raw materials with exponential quality degradation: a real case study," OPSEARCH, Springer;Operational Research Society of India, vol. 61(4), pages 1862-1887, December.
    18. P K Tripathy & W-M Wee & P R Majhi, 2003. "An EOQ model with process reliability considerations," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(5), pages 549-554, May.
    19. Parlar, Mahmut & Perry, David, 1995. "Analysis of a (Q, r, T) inventory policy with deterministic and random yields when future supply is uncertain," European Journal of Operational Research, Elsevier, vol. 84(2), pages 431-443, July.
    20. Bellantuono, Nicola & Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo & Tang, Christopher S., 2009. "The implications of joint adoption of revenue sharing and advance booking discount programs," International Journal of Production Economics, Elsevier, vol. 121(2), pages 383-394, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.