IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v251y2024ics0951832024004733.html
   My bibliography  Save this article

Reinforcement learning in reliability and maintenance optimization: A tutorial

Author

Listed:
  • Zhang, Qin
  • Liu, Yu
  • Xiang, Yisha
  • Xiahou, Tangfan

Abstract

The increasing complexity of engineering systems presents significant challenges in addressing intricate reliability and maintenance optimization problems. Advanced computational techniques have become imperative. Reinforcement learning, with its strong capability of solving complicated sequential decision-making problems under uncertainty, has emerged as a powerful tool to reliability and maintainability community. This paper offers a step-by-step guideline on the utilization of reinforcement learning algorithms for resolving reliability optimization and maintenance optimization problems. We first introduce the Markov decision process modeling framework for these problems and elucidate the design and implementation of solution algorithms, including dynamic programming, reinforcement learning, and deep reinforcement learning. Case studies, including a pipeline system mission abort optimization and a manufacturing system condition-based maintenance decision-making, are included to demonstrate the utility of reinforcement learning in reliability and maintenance applications. This tutorial will assist researchers in the reliability and maintainability community by summarizing the state-of-the-art reinforcement learning algorithms and providing the hand-on implementations in reliability and maintenance optimization problems.

Suggested Citation

  • Zhang, Qin & Liu, Yu & Xiang, Yisha & Xiahou, Tangfan, 2024. "Reinforcement learning in reliability and maintenance optimization: A tutorial," Reliability Engineering and System Safety, Elsevier, vol. 251(C).
  • Handle: RePEc:eee:reensy:v:251:y:2024:i:c:s0951832024004733
    DOI: 10.1016/j.ress.2024.110401
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832024004733
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2024.110401?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Najafi, Seyedvahid & Lee, Chi-Guhn, 2023. "A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    2. Liu, Lujie & Yang, Jun & Yan, Bingxin, 2024. "A dynamic mission abort policy for transportation systems with stochastic dependence by deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    3. Zhang, Nan & Deng, Yingjun & Liu, Bin & Zhang, Jun, 2023. "Condition-based maintenance for a multi-component system in a dynamic operating environment," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    4. Liu, Bin & Pandey, Mahesh D. & Wang, Xiaolin & Zhao, Xiujie, 2021. "A finite-horizon condition-based maintenance policy for a two-unit system with dependent degradation processes," European Journal of Operational Research, Elsevier, vol. 295(2), pages 705-717.
    5. Guo, Chunhui & Liang, Zhenglin, 2022. "A predictive Markov decision process for optimizing inspection and maintenance strategies of partially observable multi-state systems," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
    6. Alaswad, Suzan & Xiang, Yisha, 2017. "A review on condition-based maintenance optimization models for stochastically deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 157(C), pages 54-63.
    7. Pinciroli, Luca & Baraldi, Piero & Ballabio, Guido & Compare, Michele & Zio, Enrico, 2022. "Optimization of the Operation and Maintenance of renewable energy systems by Deep Reinforcement Learning," Renewable Energy, Elsevier, vol. 183(C), pages 752-763.
    8. Abhijit Gosavi, 2009. "Reinforcement Learning: A Tutorial Survey and Recent Advances," INFORMS Journal on Computing, INFORMS, vol. 21(2), pages 178-192, May.
    9. Ouyang, Zhiyuan & Liu, Yu & Ruan, Sheng-Jia & Jiang, Tao, 2019. "An improved particle swarm optimization algorithm for reliability-redundancy allocation problem with mixed redundancy strategy and heterogeneous components," Reliability Engineering and System Safety, Elsevier, vol. 181(C), pages 62-74.
    10. Luca Pinciroli & Piero Baraldi & Guido Ballabio & Michele Compare & Enrico Zio, 2021. "Deep Reinforcement Learning Based on Proximal Policy Optimization for the Maintenance of a Wind Farm with Multiple Crews," Energies, MDPI, vol. 14(20), pages 1-17, October.
    11. Shi, Yue & Zhu, Weihang & Xiang, Yisha & Feng, Qianmei, 2020. "Condition-based maintenance optimization for multi-component systems subject to a system reliability requirement," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    12. Liu, Lujie & Yang, Jun, 2023. "A dynamic mission abort policy for the swarm executing missions and its solution method by tailored deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    13. Xu, Jianyu & Liu, Bin & Zhao, Xiujie & Wang, Xiao-Lin, 2024. "Online reinforcement learning for condition-based group maintenance using factored Markov decision processes," European Journal of Operational Research, Elsevier, vol. 315(1), pages 176-190.
    14. Liu, Yu & Chen, Yiming & Jiang, Tao, 2020. "Dynamic selective maintenance optimization for multi-state systems over a finite horizon: A deep reinforcement learning approach," European Journal of Operational Research, Elsevier, vol. 283(1), pages 166-181.
    15. Zhang, Nailong & Si, Wujun, 2020. "Deep reinforcement learning for condition-based maintenance planning of multi-component systems under dependent competing risks," Reliability Engineering and System Safety, Elsevier, vol. 203(C).
    16. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    17. Andriotis, C.P. & Papakonstantinou, K.G., 2019. "Managing engineering systems with large state and action spaces through deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
    18. Zhu, Zhicheng & Xiang, Yisha & Zhao, Ming & Shi, Yue, 2023. "Data-driven remanufacturing planning with parameter uncertainty," European Journal of Operational Research, Elsevier, vol. 309(1), pages 102-116.
    19. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    20. Ma, Chenyang & Wang, Qiyu & Cai, Zhiqiang & Si, Shubin & Zhao, Jiangbin, 2021. "Component reassignment for reliability optimization of reconfigurable systems considering component degradation," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    21. Levitin, Gregory & Finkelstein, Maxim & Xiang, Yanping, 2021. "Optimal mission abort policies for repairable multistate systems performing multi-attempt mission," Reliability Engineering and System Safety, Elsevier, vol. 209(C).
    22. Zhao, Xian & Sun, Jinglei & Qiu, Qingan & Chen, Ke, 2021. "Optimal inspection and mission abort policies for systems subject to degradation," European Journal of Operational Research, Elsevier, vol. 292(2), pages 610-621.
    23. Nan Zhang & Sen Tian & Kaiquan Cai & Jun Zhang, 2023. "Condition-based maintenance assessment for a deteriorating system considering stochastic failure dependence," IISE Transactions, Taylor & Francis Journals, vol. 55(7), pages 687-697, July.
    24. Zhou, Yifan & Li, Bangcheng & Lin, Tian Ran, 2022. "Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    25. Zhao, Yunfei & Smidts, Carol, 2022. "Reinforcement learning for adaptive maintenance policy optimization under imperfect knowledge of the system degradation model and partial observability of system states," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
    26. de Jonge, Bram & Scarf, Philip A., 2020. "A review on maintenance optimization," European Journal of Operational Research, Elsevier, vol. 285(3), pages 805-824.
    27. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    28. Uit Het Broek, Michiel A.J. & Teunter, Ruud H. & de Jonge, Bram & Veldman, Jasper, 2021. "Joint condition-based maintenance and load-sharing optimization for two-unit systems with economic dependency," European Journal of Operational Research, Elsevier, vol. 295(3), pages 1119-1131.
    29. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    30. Qiu, Qingan & Cui, Lirong & Wu, Bei, 2020. "Dynamic mission abort policy for systems operating in a controllable environment with self-healing mechanism," Reliability Engineering and System Safety, Elsevier, vol. 203(C).
    31. Coit, David W. & Zio, Enrico, 2019. "The evolution of system reliability optimization," Reliability Engineering and System Safety, Elsevier, vol. 192(C).
    32. Andriotis, C.P. & Papakonstantinou, K.G., 2021. "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints," Reliability Engineering and System Safety, Elsevier, vol. 212(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mikhail, Mina & Ouali, Mohamed-Salah & Yacout, Soumaya, 2024. "A data-driven methodology with a nonparametric reliability method for optimal condition-based maintenance strategies," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    2. Karabağ, Oktay & Bulut, Önder & Toy, Ayhan Özgür & Fadıloğlu, Mehmet Murat, 2024. "An efficient procedure for optimal maintenance intervention in partially observable multi-component systems," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
    3. Pinciroli, Luca & Baraldi, Piero & Zio, Enrico, 2023. "Maintenance optimization in industry 4.0," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    4. Lee, Dongkyu & Song, Junho, 2023. "Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
    5. Morato, P.G. & Andriotis, C.P. & Papakonstantinou, K.G. & Rigo, P., 2023. "Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    6. Najafi, Seyedvahid & Lee, Chi-Guhn, 2023. "A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    7. Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    8. Azizi, Fariba & Salari, Nooshin, 2023. "A novel condition-based maintenance framework for parallel manufacturing systems based on bivariate birth/birth–death processes," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    9. Liu, Lujie & Yang, Jun, 2023. "A dynamic mission abort policy for the swarm executing missions and its solution method by tailored deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    10. Anwar, Ghazanfar Ali & Zhang, Xiaoge, 2024. "Deep reinforcement learning for intelligent risk optimization of buildings under hazard," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
    11. Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    12. Lee, Juseong & Mitici, Mihaela, 2023. "Deep reinforcement learning for predictive aircraft maintenance using probabilistic Remaining-Useful-Life prognostics," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    13. Guan, Xiaoshu & Sun, Huabin & Hou, Rongrong & Xu, Yang & Bao, Yuequan & Li, Hui, 2023. "A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    14. Zheng, Meimei & Su, Zhiyun & Wang, Dong & Pan, Ershun, 2024. "Joint maintenance and spare part ordering from multiple suppliers for multicomponent systems using a deep reinforcement learning algorithm," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    15. Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2024. "Optimal system loading and aborting in additive multi-attempt missions," Reliability Engineering and System Safety, Elsevier, vol. 251(C).
    16. da Costa, Paulo & Verleijsdonk, Peter & Voorberg, Simon & Akcay, Alp & Kapodistria, Stella & van Jaarsveld, Willem & Zhang, Yingqian, 2023. "Policies for the dynamic traveling maintainer problem with alerts," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1141-1152.
    17. Huy Chau & Duy Nguyen & Thai Nguyen, 2024. "Continuous-time optimal investment with portfolio constraints: a reinforcement learning approach," Papers 2412.10692, arXiv.org.
    18. Weifan Long & Taixian Hou & Xiaoyi Wei & Shichao Yan & Peng Zhai & Lihua Zhang, 2023. "A Survey on Population-Based Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(10), pages 1-17, May.
    19. Zhao, Xian & Lv, Zuheng & Qiu, Qingan & Wu, Yaguang, 2023. "Designing two-level rescue depot location and dynamic rescue policies for unmanned vehicles," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    20. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:251:y:2024:i:c:s0951832024004733. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.