Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network

My bibliography Save this article

Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network

Author

Listed:

Lee, Dongkyu
Song, Junho

Registered:

Abstract

Lifeline systems such as transportation and water distribution networks may deteriorate with age, raising the risk of system failure or degradation. Thus, system-level sequential decision-making is essential to address the problem cost-effectively while minimizing the potential loss. Researchers have proposed to assess the risk of lifeline systems using Markov decision processes (MDPs) to identify a risk-informed operation and maintenance (O&M) policy. In complex systems with many components, however, it is potentially intractable to find MDP solutions because the numbers of states and action spaces increase exponentially. This paper proposes a multi-agent deep reinforcement learning framework, termed parallelized multi-agent deep Q-network (PM-DQN), to overcome the curse of dimensionality. The proposed method takes a divide-and-conquer strategy, in which multiple subsystems are identified by community detection, and each agent learns to achieve the O&M policy of the corresponding subsystem. The agents establish policies to minimize the decentralized cost of the cluster unit, including the factorized cost. Such learning processes occur simultaneously in several parallel units, and the trained policies are periodically synchronized with the best ones, thereby improving the master policy. Numerical examples demonstrate that the proposed method outperforms baseline policies, including conventional maintenance schemes and the subsystem-level optimal policy.

Suggested Citation

Lee, Dongkyu & Song, Junho, 2023. "Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network," Reliability Engineering and System Safety, Elsevier, vol. 239(C).

Handle: RePEc:eee:reensy:v:239:y:2023:i:c:s095183202300426x
DOI: 10.1016/j.ress.2023.109512

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Ouyang, Yanfeng & Madanat, Samer, 2006. "An analytical solution for the finite-horizon pavement resurfacing planning problem," Transportation Research Part B: Methodological, Elsevier, vol. 40(9), pages 767-778, November.
de Jonge, Bram & Teunter, Ruud & Tinga, Tiedo, 2017. "The influence of practical factors on the benefits of condition-based maintenance over time-based maintenance," Reliability Engineering and System Safety, Elsevier, vol. 158(C), pages 21-30.
NESTEROV, Yurii, 2012. "Efficiency of coordinate descent methods on huge-scale optimization problems," LIDAM Reprints CORE 2511, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Morales-Torres, AdriÃ¡n & Escuder-Bueno, Ignacio & Serrano-Lombillo, Armando & Castillo RodrÃguez, Jesica T., 2019. "Dealing with epistemic uncertainty in risk-informed decision making for dam safety management," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
MartÃnez-GalÃ¡n FernÃ¡ndez, Pablo & GuillÃ©n LÃ³pez, Antonio J. & MÃ¡rquez, Adolfo Crespo & Gomez FernÃ¡ndez, Juan Fco. & Marcos, Jose Antonio, 2022. "Dynamic Risk Assessment for CBM-based adaptation of maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
Jannie Sønderkær Nielsen & John Dalsgaard Sørensen, 2014. "Methods for Risk-Based Planning of O&M of Wind Turbines," Energies, MDPI, vol. 7(10), pages 1-20, October.
Yang, Ao & Qiu, Qingan & Zhu, Mingren & Cui, Lirong & Chen, Weilin & Chen, Jianhui, 2022. "Condition-based maintenance strategy for redundant systems with arbitrary structures using improved reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
Zhang, Nailong & Si, Wujun, 2020. "Deep reinforcement learning for condition-based maintenance planning of multi-component systems under dependent competing risks," Reliability Engineering and System Safety, Elsevier, vol. 203(C).
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Andriotis, C.P. & Papakonstantinou, K.G., 2019. "Managing engineering systems with large state and action spaces through deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
Ohlmann, Jeffrey W. & Bean, James C., 2009. "Resource-constrained management of heterogeneous assets with stochastic deterioration," European Journal of Operational Research, Elsevier, vol. 199(1), pages 198-208, November.
Niu, Yi-Feng, 2021. "Performance measure of a multi-state flow network under reliability and maintenance cost considerations," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
Stern, R.E. & Song, J. & Work, D.B., 2017. "Accelerated Monte Carlo system reliability analysis through machine-learning-based surrogate models of network connectivity," Reliability Engineering and System Safety, Elsevier, vol. 164(C), pages 1-9.
Aryai, Vahid & Baji, Hassan & Mahmoodian, Mojtaba & Li, Chun-Qing, 2020. "Time-dependent finite element reliability assessment of cast-iron water pipes subjected to spatio-temporal correlated corrosion process," Reliability Engineering and System Safety, Elsevier, vol. 197(C).
Nguyen, Van-Thai & Do, Phuc & Vosin, Alexandre & Iung, Benoit, 2022. "Artificial-intelligence-based maintenance decision-making and optimization for multi-state component systems," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
Papakonstantinou, K.G. & Shinozuka, M., 2014. "Planning structural inspection and maintenance policies via dynamic programming and Markov processes. Part II: POMDP implementation," Reliability Engineering and System Safety, Elsevier, vol. 130(C), pages 214-224.
Ahuja, Ravindra K. & Kodialam, Murali & Mishra, Ajay K. & Orlin, James B., 1997. "Computational investigations of maximum flow algorithms," European Journal of Operational Research, Elsevier, vol. 97(3), pages 509-542, March.
Zhou, Yifan & Li, Bangcheng & Lin, Tian Ran, 2022. "Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
Mosayebi Omshi, E. & Grall, A., 2021. "Replacement and imperfect repair of deteriorating system: Study of a CBM policy and impact of repair efficiency," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
Der Kiureghian, Armen & Ditlevsen, Ove D. & Song, Junho, 2007. "Availability, reliability and downtime of systems with repairable components," Reliability Engineering and System Safety, Elsevier, vol. 92(2), pages 231-242.
David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
Andriotis, C.P. & Papakonstantinou, K.G., 2021. "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints," Reliability Engineering and System Safety, Elsevier, vol. 212(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jiang, Fengyuan & Dong, Sheng, 2024. "Probabilistic-based burst failure mechanism analysis and risk assessment of pipelines with random non-uniform corrosion defects, considering the interacting effects," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
Kere, Kiswendsida J. & Huang, Qindan, 2024. "An analytical approach to evaluate life-cycle cost of deteriorating pipelines," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
Yang, Sen & Zhang, Yi & Lu, Xinzheng & Guo, Wei & Miao, Huiquan, 2024. "Multi-agent deep reinforcement learning based decision support model for resilient community post-hazard recovery," Reliability Engineering and System Safety, Elsevier, vol. 242(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Morato, P.G. & Andriotis, C.P. & Papakonstantinou, K.G. & Rigo, P., 2023. "Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
Zhang, Qin & Liu, Yu & Xiang, Yisha & Xiahou, Tangfan, 2024. "Reinforcement learning in reliability and maintenance optimization: A tutorial," Reliability Engineering and System Safety, Elsevier, vol. 251(C).
Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
Ferreira Neto, Waldomiro Alves & VirgÃnio Cavalcante, Cristiano Alexandre & Do, Phuc, 2024. "Deep reinforcement learning for maintenance optimization of a scrap-based steel production line," Reliability Engineering and System Safety, Elsevier, vol. 249(C).
Xu, Zhaoyi & Saleh, Joseph Homer, 2021. "Machine learning for reliability engineering and safety applications: Review of current status and future opportunities," Reliability Engineering and System Safety, Elsevier, vol. 211(C).
Najafi, Seyedvahid & Lee, Chi-Guhn, 2023. "A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
Lee, Juseong & Mitici, Mihaela, 2023. "Deep reinforcement learning for predictive aircraft maintenance using probabilistic Remaining-Useful-Life prognostics," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
Cheng, Jianda & Cheng, Minghui & Liu, Yan & Wu, Jun & Li, Wei & Frangopol, Dan M., 2024. "Knowledge transfer for adaptive maintenance policy optimization in engineering fleets based on meta-reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
Hamida, Zachary & Goulet, James-A., 2023. "Hierarchical reinforcement learning for transportation infrastructure maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
Luo, Yi & Zhao, Xiujie & Liu, Bin & He, Shuguang, 2024. "Condition-based maintenance policy for systems under dynamic environment," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
Lee, Jun S. & Yeo, In-Ho & Bae, Younghoon, 2024. "A stochastic track maintenance scheduling model based on deep reinforcement learning approaches," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
Zheng, Meimei & Su, Zhiyun & Wang, Dong & Pan, Ershun, 2024. "Joint maintenance and spare part ordering from multiple suppliers for multicomponent systems using a deep reinforcement learning algorithm," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
Liu, Hengchang & Li, Bo & Yao, Fengming & Hu, Gexi & Xie, Lei, 2024. "Maintenance optimization of multi-unit balanced systems using deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
Guan, Xiaoshu & Sun, Huabin & Hou, Rongrong & Xu, Yang & Bao, Yuequan & Li, Hui, 2023. "A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
Anwar, Ghazanfar Ali & Zhang, Xiaoge, 2024. "Deep reinforcement learning for intelligent risk optimization of buildings under hazard," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
Pliego MarugÃ¡n, Alberto & Pinar-PÃ©rez, JesÃºs M. & GarcÃa MÃ¡rquez, Fausto Pedro, 2024. "A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs," Reliability Engineering and System Safety, Elsevier, vol. 252(C).
Saleh, Ali & ChiachÃo, Manuel & Salas, Juan FernÃ¡ndez & Kolios, Athanasios, 2023. "Self-adaptive optimized maintenance of offshore wind turbines by intelligent Petri nets," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
Zhou, Yifan & Li, Bangcheng & Lin, Tian Ran, 2022. "Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 217(C).

More about this item

Keywords

Deep reinforcement learning; Lifeline systems; Life-cycle cost; Markov decision process; Operation & maintenance; Parallel processing;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:239:y:2023:i:c:s095183202300426x. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data