Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

My bibliography Save this article

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

Author

Listed:

Andriotis, C.P.
Papakonstantinou, K.G.

Registered:

Abstract

Determination of inspection and maintenance policies for minimizing long-term risks and costs in deteriorating engineering environments constitutes a complex optimization problem. Major computational challenges include the (i) curse of dimensionality, due to exponential scaling of state/action set cardinalities with the number of components; (ii) curse of history, related to exponentially growing decision-trees with the number of decision-steps; (iii) presence of state uncertainties, induced by inherent environment stochasticity and variability of inspection/monitoring measurements; (iv) presence of constraints, pertaining to stochastic long-term limitations, due to resource scarcity and other infeasible/undesirable system responses. In this work, these challenges are addressed within a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL). POMDPs optimally tackle (ii)-(iii), combining stochastic dynamic programming with Bayesian inference principles. Multi-agent DRL addresses (i), through deep function parametrizations and decentralized control assumptions. Challenge (iv) is herein handled through proper state augmentation and Lagrangian relaxation, with emphasis on life-cycle risk-based constraints and budget limitations. The underlying algorithmic steps are provided, and the proposed framework is found to outperform well-established policy baselines and facilitate adept prescription of inspection and intervention actions, in cases where decisions must be made in the most resource- and risk-aware manner.

Suggested Citation

Andriotis, C.P. & Papakonstantinou, K.G., 2021. "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints," Reliability Engineering and System Safety, Elsevier, vol. 212(C).

Handle: RePEc:eee:reensy:v:212:y:2021:i:c:s095183202100106x
DOI: 10.1016/j.ress.2021.107551

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Robin P. Nicolai & Rommert Dekker, 2008. "Optimal Maintenance of Multi-component Systems: A Review," Springer Series in Reliability Engineering, in: Complex System Maintenance Handbook, chapter 11, pages 263-286, Springer.
- Nicolai, R.P. & Dekker, R., 2006. "Optimal maintenance of multi-component systems: a review," Econometric Institute Research Papers EI 2006-29, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
Liu, Yu & Chen, Yiming & Jiang, Tao, 2020. "Dynamic selective maintenance optimization for multi-state systems over a finite horizon: A deep reinforcement learning approach," European Journal of Operational Research, Elsevier, vol. 283(1), pages 166-181.
Rocchetta, R. & Bellani, L. & Compare, M. & Zio, E. & Patelli, E., 2019. "A reinforcement learning framework for optimal operation and maintenance of power grids," Applied Energy, Elsevier, vol. 241(C), pages 291-301.
Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
Papakonstantinou, K.G. & Shinozuka, M., 2014. "Planning structural inspection and maintenance policies via dynamic programming and Markov processes. Part I: Theory," Reliability Engineering and System Safety, Elsevier, vol. 130(C), pages 202-213.
Andriotis, C.P. & Papakonstantinou, K.G., 2019. "Managing engineering systems with large state and action spaces through deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
Daniel S. Bernstein & Robert Givan & Neil Immerman & Shlomo Zilberstein, 2002. "The Complexity of Decentralized Control of Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 27(4), pages 819-840, November.
B. Castanier & C. Bérenguer & A. Grall, 2003. "A sequential condition‐based repair/replacement policy with non‐periodic inspections for a system subject to continuous wear," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 19(4), pages 327-347, October.
Memarzadeh, Milad & Pozzi, Matteo & Kolter, J. Zico, 2016. "Hierarchical modeling of systems with similar components: A framework for adaptive monitoring and control," Reliability Engineering and System Safety, Elsevier, vol. 153(C), pages 159-169.
Papakonstantinou, K.G. & Shinozuka, M., 2014. "Planning structural inspection and maintenance policies via dynamic programming and Markov processes. Part II: POMDP implementation," Reliability Engineering and System Safety, Elsevier, vol. 130(C), pages 214-224.
Bocchini, Paolo & Frangopol, Dan M., 2011. "A probabilistic computational framework for bridge network optimal maintenance scheduling," Reliability Engineering and System Safety, Elsevier, vol. 96(2), pages 332-349.
Yang, David Y. & Frangopol, Dan M., 2019. "Life-cycle management of deteriorating civil infrastructure considering resilience to lifetime hazards: A general approach based on renewal-reward processes," Reliability Engineering and System Safety, Elsevier, vol. 183(C), pages 197-212.
Nozhati, Saeed & Sarkale, Yugandhar & Chong, Edwin K.P. & Ellingwood, Bruce R., 2020. "Optimal stochastic dynamic scheduling for managing community recovery from natural hazards," Reliability Engineering and System Safety, Elsevier, vol. 193(C).
Rockafellar, R. Tyrrell & Uryasev, Stanislav, 2002. "Conditional value-at-risk for general loss distributions," Journal of Banking & Finance, Elsevier, vol. 26(7), pages 1443-1471, July.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Andriotis, C.P. & Papakonstantinou, K.G., 2019. "Managing engineering systems with large state and action spaces through deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
Memarzadeh, Milad & Pozzi, Matteo, 2016. "Value of information in sequential decision making: Component inspection, permanent monitoring and system-level scheduling," Reliability Engineering and System Safety, Elsevier, vol. 154(C), pages 137-151.
Morato, P.G. & Andriotis, C.P. & Papakonstantinou, K.G. & Rigo, P., 2023. "Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
Mancuso, A. & Compare, M. & Salo, A. & Zio, E., 2021. "Optimal Prognostics and Health Management-driven inspection and maintenance strategies for industrial systems," Reliability Engineering and System Safety, Elsevier, vol. 210(C).
KarabaÄŸ, Oktay & Eruguz, Ayse Sena & Basten, Rob, 2020. "Integrated optimization of maintenance interventions and spare part selection for a partially observable multi-component system," Reliability Engineering and System Safety, Elsevier, vol. 200(C).
de Pater, Ingeborg & Mitici, Mihaela, 2021. "Predictive maintenance for multi-component systems of repairables with Remaining-Useful-Life prognostics and a limited stock of spare components," Reliability Engineering and System Safety, Elsevier, vol. 214(C).
Xu, Zhaoyi & Saleh, Joseph Homer, 2021. "Machine learning for reliability engineering and safety applications: Review of current status and future opportunities," Reliability Engineering and System Safety, Elsevier, vol. 211(C).
Pinciroli, Luca & Baraldi, Piero & Zio, Enrico, 2023. "Maintenance optimization in industry 4.0," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
Najafi, Seyedvahid & Lee, Chi-Guhn, 2023. "A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
Arcieri, Giacomo & Hoelzl, Cyprien & Schwery, Oliver & Straub, Daniel & Papakonstantinou, Konstantinos G. & Chatzi, Eleni, 2023. "Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
Ã–zgÃ¼r-ÃœnlÃ¼akÄ±n, Demet & TÃ¼rkali, Busenur, 2021. "Evaluation of proactive maintenance policies on a stochastically dependent hidden multi-component system using DBNs," Reliability Engineering and System Safety, Elsevier, vol. 211(C).
Nguyen, Van-Thai & Do, Phuc & Vosin, Alexandre & Iung, Benoit, 2022. "Artificial-intelligence-based maintenance decision-making and optimization for multi-state component systems," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
Anwar, Ghazanfar Ali & Zhang, Xiaoge, 2024. "Deep reinforcement learning for intelligent risk optimization of buildings under hazard," Reliability Engineering and System Safety, Elsevier, vol. 247(C).
Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
Lv, Y. & Yan, X.D. & Sun, W. & Gao, Z.Y., 2015. "A risk-based method for planning of busâ€“subway corridor evacuation under hybrid uncertainties," Reliability Engineering and System Safety, Elsevier, vol. 139(C), pages 188-199.
Hao, Zhaojun & Di Maio, Francesco & Zio, Enrico, 2023. "A sequential decision problem formulation and deep reinforcement learning solution of the optimization of O&M of cyber-physical energy systems (CPESs) for reliable and safe power production and supply," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
Nguyen, Khanh T.P. & Medjaher, Kamal, 2019. "A new dynamic predictive maintenance framework using deep learning for failure prognostics," Reliability Engineering and System Safety, Elsevier, vol. 188(C), pages 251-262.
Ã–zgÃ¼r-ÃœnlÃ¼akÄ±n, Demet & BilgiÃ§, Taner, 2017. "Performance analysis of an aggregation and disaggregation solution procedure to obtain a maintenance plan for a partially observable multi-component system," Reliability Engineering and System Safety, Elsevier, vol. 167(C), pages 652-662.

More about this item

Keywords

Inspection and maintenance planning; System risk and reliability; Constrained stochastic optimization; Partially observable Markov decision processes; Deep reinforcement learning; Decentralized multi-agent control;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:212:y:2021:i:c:s095183202100106x. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data