IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v34y2023i1d10.1007_s10845-022-01982-5.html
   My bibliography  Save this article

Hybrid algorithm based on reinforcement learning for smart inventory management

Author

Listed:
  • Carlos Cuartas

    (Universidad EAFIT)

  • Jose Aguilar

    (Universidad EAFIT
    Universidad de Alcalá
    Universidad de Los Andes)

Abstract

This article proposes a hybrid algorithm based on reinforcement learning and the inventory management methodology called DDMRP (Demand Driven Material Requirement Planning) to determine the optimal time to buy a certain product, and how much quantity should be requested. For this, the inventory management problem is formulated as a Markov Decision Process where the environment with which the system interacts is designed from the concepts raised in the DDMRP methodology, and through the reinforcement learning algorithm—specifically, Q-Learning. The optimal policy is determined for making decisions about when and how much to buy. To determine the optimal policy, three approaches are proposed for the reward function: the first one is based on inventory levels; the second is an optimization function based on the distance of the inventory to its optimal level, and the third is a shaping function based on levels and distances to the optimal inventory. The results show that the proposed algorithm has promising results in scenarios with different characteristics, performing adequately in difficult case studies, with a diversity of situations such as scenarios with discontinuous or continuous demand, seasonal and non-seasonal behavior, and with high demand peaks, among others.

Suggested Citation

  • Carlos Cuartas & Jose Aguilar, 2023. "Hybrid algorithm based on reinforcement learning for smart inventory management," Journal of Intelligent Manufacturing, Springer, vol. 34(1), pages 123-149, January.
  • Handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5
    DOI: 10.1007/s10845-022-01982-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-022-01982-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-022-01982-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dhahri, Issam & Chabchoub, Habib, 2007. "Nonlinear goal programming models quantifying the bullwhip effect in supply chain based on ARIMA parameters," European Journal of Operational Research, Elsevier, vol. 177(3), pages 1800-1810, March.
    2. Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo, 2002. "Inventory management in supply chains: a reinforcement learning approach," International Journal of Production Economics, Elsevier, vol. 78(2), pages 153-161, July.
    3. Olumide Emmanuel Oluyisola & Swapnil Bhalla & Fabio Sgarbossa & Jan Ola Strandhagen, 2022. "Designing and developing smart production planning and control systems in the industry 4.0 era: a methodology and case study," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 311-332, January.
    4. Angela Patricia Velasco Acosta & Christian Mascle & Pierre Baptiste, 2020. "Applicability of Demand-Driven MRP in a complex manufacturing environment," International Journal of Production Research, Taylor & Francis Journals, vol. 58(14), pages 4233-4245, July.
    5. Thomy Eko Saputro & Gonçalo Figueira & Bernardo Almada-Lobo, 2021. "Integrating supplier selection with inventory management under supply disruptions," International Journal of Production Research, Taylor & Francis Journals, vol. 59(11), pages 3304-3322, June.
    6. Haiying Che & Zixing Bai & Rong Zuo & Honglei Li, 2020. "A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling," Complexity, Hindawi, vol. 2020, pages 1-12, August.
    7. Sushil Punia & Konstantinos Nikolopoulos & Surya Prakash Singh & Jitendra K. Madaan & Konstantia Litsiou, 2020. "Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail," International Journal of Production Research, Taylor & Francis Journals, vol. 58(16), pages 4964-4979, July.
    8. Fei Zhang & Zailin Guan & Li Zhang & Yanyan Cui & Pengxing Yi & Saif Ullah, 2019. "Inventory management for a remanufacture-to-order production with multi-components (parts)," Journal of Intelligent Manufacturing, Springer, vol. 30(1), pages 59-78, January.
    9. Matthias Thürer & Nuno O. Fernandes & Mark Stevenson, 2022. "Production planning and control in multi-stage assembly systems: an assessment of Kanban, MRP, OPT (DBR) and DDMRP by simulation," International Journal of Production Research, Taylor & Francis Journals, vol. 60(3), pages 1036-1050, February.
    10. Edward A. Silver, 1981. "Operations Research in Inventory Management: A Review and Critique," Operations Research, INFORMS, vol. 29(4), pages 628-645, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Schmid, Verena & Doerner, Karl F. & Laporte, Gilbert, 2013. "Rich routing problems arising in supply chain management," European Journal of Operational Research, Elsevier, vol. 224(3), pages 435-448.
    2. Broccardo, Laura & Tenucci, Andrea & Agarwal, Reeti & Alshibani, Safiya Mukhtar, 2024. "Steering digitalization and management control maturity in small and medium enterprises (SMEs)," Technological Forecasting and Social Change, Elsevier, vol. 204(C).
    3. Naragain Phumchusri & Nichakan Phupaichitkun, 2024. "Sales prediction hybrid models for retails using promotional pricing strategy as a key demand driver," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 23(5), pages 461-480, October.
    4. Kumar, Anupam & Evers, Philip T., 2015. "Setting safety stock based on imprecise records," International Journal of Production Economics, Elsevier, vol. 169(C), pages 68-75.
    5. Gel, Esma S. & Salman, F. Sibel, 2022. "Dynamic ordering decisions with approximate learning of supply yield uncertainty," International Journal of Production Economics, Elsevier, vol. 243(C).
    6. Parlar, Mahmut & Perry, David, 1995. "Analysis of a (Q, r, T) inventory policy with deterministic and random yields when future supply is uncertain," European Journal of Operational Research, Elsevier, vol. 84(2), pages 431-443, July.
    7. Monthatipkul, Chumpol & Yenradee, Pisal, 2008. "Inventory/distribution control system in a one-warehouse/multi-retailer supply chain," International Journal of Production Economics, Elsevier, vol. 114(1), pages 119-133, July.
    8. Sandun C. Perera & Suresh P. Sethi, 2023. "A survey of stochastic inventory models with fixed costs: Optimality of (s, S) and (s, S)‐type policies—Discrete‐time case," Production and Operations Management, Production and Operations Management Society, vol. 32(1), pages 131-153, January.
    9. Patanjal Kumar & Sachin Kumar Mangla & Yigit Kazancoglu & Ali Emrouznejad, 2023. "A decision framework for incorporating the coordination and behavioural issues in sustainable supply chains in digital economy," Annals of Operations Research, Springer, vol. 326(2), pages 721-749, July.
    10. Hsing-Chun Hung & Yuh-Wen Chen, 2023. "Striving to Achieve United Nations Sustainable Development Goals of Taiwanese SMEs by Adopting Industry 4.0," Sustainability, MDPI, vol. 15(3), pages 1-18, January.
    11. David Yao & Morton Klein, 1989. "Lot sizes under continuous demand: The backorder case," Naval Research Logistics (NRL), John Wiley & Sons, vol. 36(5), pages 615-624, October.
    12. Z S Hua & B Zhang & J Yang & D S Tan, 2007. "A new approach of forecasting intermittent demand for spare parts inventories in the process industries," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 58(1), pages 52-61, January.
    13. Deniz Preil & Michael Krapp, 2022. "Artificial intelligence-based inventory management: a Monte Carlo tree search approach," Annals of Operations Research, Springer, vol. 308(1), pages 415-439, January.
    14. Demeter, Krisztina & Szász, Levente & Kő, Andrea, 2019. "A text mining based overview of inventory research in the ISIR special issues 1994–2016," International Journal of Production Economics, Elsevier, vol. 209(C), pages 134-146.
    15. Mahmut Parlar & Defne Berkin, 1991. "Future supply uncertainty in EOQ models," Naval Research Logistics (NRL), John Wiley & Sons, vol. 38(1), pages 107-121, February.
    16. Sandun C. Perera & Suresh P. Sethi, 2023. "A survey of stochastic inventory models with fixed costs: Optimality of (s, S) and (s, S)‐type policies—Continuous‐time case," Production and Operations Management, Production and Operations Management Society, vol. 32(1), pages 154-169, January.
    17. Truong Ngoc Cuong & Le Ngoc Bao Long & Hwan-Seong Kim & Sam-Sang You, 2023. "Data analytics and throughput forecasting in port management systems against disruptions: a case study of Busan Port," Maritime Economics & Logistics, Palgrave Macmillan;International Association of Maritime Economists (IAME), vol. 25(1), pages 61-89, March.
    18. Eugenia BABILONI & Ester GUIJARRO & Manuel CARDÓS & Sofía ESTELLÉS, 2012. "Exact Fill Rates for the (R, S) Inventory Control with Discrete Distributed Demands for the Backordering Case," Informatica Economica, Academy of Economic Studies - Bucharest, Romania, vol. 16(3), pages 19-26.
    19. Wolfgang Seiringer & Balwin Bokor & Klaus Altendorfer, 2024. "Evaluating Production Planning and Control Systems in Different Environments: A Comparative Simulation Study," Papers 2405.02015, arXiv.org, revised Nov 2024.
    20. Jung, Seung Hwan & Yang, Yunsi, 2023. "On the value of operational flexibility in the trailer shipment and assignment problem: Data-driven approaches and reinforcement learning," International Journal of Production Economics, Elsevier, vol. 264(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.