IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v239y2023ics095183202300426x.html
   My bibliography  Save this article

Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network

Author

Listed:
  • Lee, Dongkyu
  • Song, Junho

Abstract

Lifeline systems such as transportation and water distribution networks may deteriorate with age, raising the risk of system failure or degradation. Thus, system-level sequential decision-making is essential to address the problem cost-effectively while minimizing the potential loss. Researchers have proposed to assess the risk of lifeline systems using Markov decision processes (MDPs) to identify a risk-informed operation and maintenance (O&M) policy. In complex systems with many components, however, it is potentially intractable to find MDP solutions because the numbers of states and action spaces increase exponentially. This paper proposes a multi-agent deep reinforcement learning framework, termed parallelized multi-agent deep Q-network (PM-DQN), to overcome the curse of dimensionality. The proposed method takes a divide-and-conquer strategy, in which multiple subsystems are identified by community detection, and each agent learns to achieve the O&M policy of the corresponding subsystem. The agents establish policies to minimize the decentralized cost of the cluster unit, including the factorized cost. Such learning processes occur simultaneously in several parallel units, and the trained policies are periodically synchronized with the best ones, thereby improving the master policy. Numerical examples demonstrate that the proposed method outperforms baseline policies, including conventional maintenance schemes and the subsystem-level optimal policy.

Suggested Citation

  • Lee, Dongkyu & Song, Junho, 2023. "Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
  • Handle: RePEc:eee:reensy:v:239:y:2023:i:c:s095183202300426x
    DOI: 10.1016/j.ress.2023.109512
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S095183202300426X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2023.109512?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Niu, Yi-Feng, 2021. "Performance measure of a multi-state flow network under reliability and maintenance cost considerations," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    2. Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    3. Stern, R.E. & Song, J. & Work, D.B., 2017. "Accelerated Monte Carlo system reliability analysis through machine-learning-based surrogate models of network connectivity," Reliability Engineering and System Safety, Elsevier, vol. 164(C), pages 1-9.
    4. Ouyang, Yanfeng & Madanat, Samer, 2006. "An analytical solution for the finite-horizon pavement resurfacing planning problem," Transportation Research Part B: Methodological, Elsevier, vol. 40(9), pages 767-778, November.
    5. Aryai, Vahid & Baji, Hassan & Mahmoodian, Mojtaba & Li, Chun-Qing, 2020. "Time-dependent finite element reliability assessment of cast-iron water pipes subjected to spatio-temporal correlated corrosion process," Reliability Engineering and System Safety, Elsevier, vol. 197(C).
    6. Nguyen, Van-Thai & Do, Phuc & Vosin, Alexandre & Iung, Benoit, 2022. "Artificial-intelligence-based maintenance decision-making and optimization for multi-state component systems," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
    7. de Jonge, Bram & Teunter, Ruud & Tinga, Tiedo, 2017. "The influence of practical factors on the benefits of condition-based maintenance over time-based maintenance," Reliability Engineering and System Safety, Elsevier, vol. 158(C), pages 21-30.
    8. NESTEROV, Yurii, 2012. "Efficiency of coordinate descent methods on huge-scale optimization problems," LIDAM Reprints CORE 2511, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    9. Morales-Torres, Adrián & Escuder-Bueno, Ignacio & Serrano-Lombillo, Armando & Castillo Rodríguez, Jesica T., 2019. "Dealing with epistemic uncertainty in risk-informed decision making for dam safety management," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
    10. Papakonstantinou, K.G. & Shinozuka, M., 2014. "Planning structural inspection and maintenance policies via dynamic programming and Markov processes. Part II: POMDP implementation," Reliability Engineering and System Safety, Elsevier, vol. 130(C), pages 214-224.
    11. Martínez-Galán Fernández, Pablo & Guillén López, Antonio J. & Márquez, Adolfo Crespo & Gomez Fernández, Juan Fco. & Marcos, Jose Antonio, 2022. "Dynamic Risk Assessment for CBM-based adaptation of maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    12. Jannie Sønderkær Nielsen & John Dalsgaard Sørensen, 2014. "Methods for Risk-Based Planning of O&M of Wind Turbines," Energies, MDPI, vol. 7(10), pages 1-20, October.
    13. Yang, Ao & Qiu, Qingan & Zhu, Mingren & Cui, Lirong & Chen, Weilin & Chen, Jianhui, 2022. "Condition-based maintenance strategy for redundant systems with arbitrary structures using improved reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    14. Ahuja, Ravindra K. & Kodialam, Murali & Mishra, Ajay K. & Orlin, James B., 1997. "Computational investigations of maximum flow algorithms," European Journal of Operational Research, Elsevier, vol. 97(3), pages 509-542, March.
    15. Zhou, Yifan & Li, Bangcheng & Lin, Tian Ran, 2022. "Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    16. Mosayebi Omshi, E. & Grall, A., 2021. "Replacement and imperfect repair of deteriorating system: Study of a CBM policy and impact of repair efficiency," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    17. Zhang, Nailong & Si, Wujun, 2020. "Deep reinforcement learning for condition-based maintenance planning of multi-component systems under dependent competing risks," Reliability Engineering and System Safety, Elsevier, vol. 203(C).
    18. Der Kiureghian, Armen & Ditlevsen, Ove D. & Song, Junho, 2007. "Availability, reliability and downtime of systems with repairable components," Reliability Engineering and System Safety, Elsevier, vol. 92(2), pages 231-242.
    19. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    20. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    21. Andriotis, C.P. & Papakonstantinou, K.G., 2019. "Managing engineering systems with large state and action spaces through deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
    22. Ohlmann, Jeffrey W. & Bean, James C., 2009. "Resource-constrained management of heterogeneous assets with stochastic deterioration," European Journal of Operational Research, Elsevier, vol. 199(1), pages 198-208, November.
    23. Andriotis, C.P. & Papakonstantinou, K.G., 2021. "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints," Reliability Engineering and System Safety, Elsevier, vol. 212(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiang, Fengyuan & Dong, Sheng, 2024. "Probabilistic-based burst failure mechanism analysis and risk assessment of pipelines with random non-uniform corrosion defects, considering the interacting effects," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
    2. Yang, Sen & Zhang, Yi & Lu, Xinzheng & Guo, Wei & Miao, Huiquan, 2024. "Multi-agent deep reinforcement learning based decision support model for resilient community post-hazard recovery," Reliability Engineering and System Safety, Elsevier, vol. 242(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Morato, P.G. & Andriotis, C.P. & Papakonstantinou, K.G. & Rigo, P., 2023. "Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    2. Xu, Zhaoyi & Saleh, Joseph Homer, 2021. "Machine learning for reliability engineering and safety applications: Review of current status and future opportunities," Reliability Engineering and System Safety, Elsevier, vol. 211(C).
    3. Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    4. Najafi, Seyedvahid & Lee, Chi-Guhn, 2023. "A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    5. Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    6. Liu, Hengchang & Li, Bo & Yao, Fengming & Hu, Gexi & Xie, Lei, 2024. "Maintenance optimization of multi-unit balanced systems using deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
    7. Lee, Juseong & Mitici, Mihaela, 2023. "Deep reinforcement learning for predictive aircraft maintenance using probabilistic Remaining-Useful-Life prognostics," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    8. Cheng, Jianda & Cheng, Minghui & Liu, Yan & Wu, Jun & Li, Wei & Frangopol, Dan M., 2024. "Knowledge transfer for adaptive maintenance policy optimization in engineering fleets based on meta-reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
    9. Hamida, Zachary & Goulet, James-A., 2023. "Hierarchical reinforcement learning for transportation infrastructure maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    10. Luo, Yi & Zhao, Xiujie & Liu, Bin & He, Shuguang, 2024. "Condition-based maintenance policy for systems under dynamic environment," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
    11. Guan, Xiaoshu & Sun, Huabin & Hou, Rongrong & Xu, Yang & Bao, Yuequan & Li, Hui, 2023. "A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    12. Lee, Jun S. & Yeo, In-Ho & Bae, Younghoon, 2024. "A stochastic track maintenance scheduling model based on deep reinforcement learning approaches," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    13. Zheng, Meimei & Su, Zhiyun & Wang, Dong & Pan, Ershun, 2024. "Joint maintenance and spare part ordering from multiple suppliers for multicomponent systems using a deep reinforcement learning algorithm," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    14. Anwar, Ghazanfar Ali & Zhang, Xiaoge, 2024. "Deep reinforcement learning for intelligent risk optimization of buildings under hazard," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
    15. Mikhail, Mina & Ouali, Mohamed-Salah & Yacout, Soumaya, 2024. "A data-driven methodology with a nonparametric reliability method for optimal condition-based maintenance strategies," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    16. Pinciroli, Luca & Baraldi, Piero & Zio, Enrico, 2023. "Maintenance optimization in industry 4.0," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    17. Guan, Xiaoshu & Xiang, Zhengliang & Bao, Yuequan & Li, Hui, 2022. "Structural dominant failure modes searching method based on deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 219(C).
    18. Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    19. Yan, Dongyang & Li, Keping & Zhu, Qiaozhen & Liu, Yanyan, 2023. "A railway accident prevention method based on reinforcement learning – Active preventive strategy by multi-modal data," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    20. Ye, Zhenggeng & Cai, Zhiqiang & Yang, Hui & Si, Shubin & Zhou, Fuli, 2023. "Joint optimization of maintenance and quality inspection for manufacturing networks based on deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 236(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:239:y:2023:i:c:s095183202300426x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.