IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v224y2022ics0951832022001922.html
   My bibliography  Save this article

Reinforcement learning for adaptive maintenance policy optimization under imperfect knowledge of the system degradation model and partial observability of system states

Author

Listed:
  • Zhao, Yunfei
  • Smidts, Carol

Abstract

Maintenance policy optimization usually is faced with challenges that arise from an imperfect knowledge of system degradation models and from the partial observability of system degradation states. This paper proposes a reinforcement learning method to address these two challenges for a class of maintenance problems with Markov degradation processes. The reinforcement learning approach consists of a learning component and a planning component. Using sequentially collected observations, at each step of decision-making the learning component improves the knowledge of system degradation in terms of the probability distributions of the transition rates based on sequential Bayesian inference. Using the updated transition rates, at each step of decision-making the maintenance policy optimization problem is then formulated as a partially observable Markov decision problem, and the planning component computes the optimal maintenance policy that maximizes the expected cumulative reward. The proposed method is illustrated using a numerical example with repair and inspection maintenance actions. The result shows that as more observations are collected, the learning component progressively learns the true system degradation process, and the planning component adjusts the optimal maintenance policy accordingly as well, which leads to increased reward.

Suggested Citation

  • Zhao, Yunfei & Smidts, Carol, 2022. "Reinforcement learning for adaptive maintenance policy optimization under imperfect knowledge of the system degradation model and partial observability of system states," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
  • Handle: RePEc:eee:reensy:v:224:y:2022:i:c:s0951832022001922
    DOI: 10.1016/j.ress.2022.108541
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832022001922
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2022.108541?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chen, Zhen & Li, Yaping & Xia, Tangbin & Pan, Ershun, 2019. "Hidden Markov model with auto-correlated observations for remaining useful life prediction and optimal maintenance policy," Reliability Engineering and System Safety, Elsevier, vol. 184(C), pages 123-136.
    2. Insua, David Rios & Ruggeri, Fabrizio & Soyer, Refik & Wilson, Simon, 2020. "Advances in Bayesian decision making in reliability," European Journal of Operational Research, Elsevier, vol. 282(1), pages 1-18.
    3. Cavalcante, C.A.V. & Lopes, R.S. & Scarf, P.A., 2018. "A general inspection and opportunistic replacement policy for one-component systems of variable quality," European Journal of Operational Research, Elsevier, vol. 266(3), pages 911-919.
    4. Ma, Xiaoyang & Liu, Bin & Yang, Li & Peng, Rui & Zhang, Xiaodong, 2020. "Reliability analysis and condition-based maintenance optimization for a warm standby cooling system," Reliability Engineering and System Safety, Elsevier, vol. 193(C).
    5. Alaswad, Suzan & Xiang, Yisha, 2017. "A review on condition-based maintenance optimization models for stochastically deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 157(C), pages 54-63.
    6. Liu, Bin & Wu, Shaomin & Xie, Min & Kuo, Way, 2017. "A condition-based maintenance policy for degrading systems with age- and state-dependent operating cost," European Journal of Operational Research, Elsevier, vol. 263(3), pages 879-887.
    7. Fauriat, William & Zio, Enrico, 2020. "Optimization of an aperiodic sequential inspection and condition-based maintenance policy driven by value of information," Reliability Engineering and System Safety, Elsevier, vol. 204(C).
    8. Michael Jong Kim & Viliam Makis, 2013. "Joint Optimization of Sampling and Control of Partially Observable Failing Systems," Operations Research, INFORMS, vol. 61(3), pages 777-790, June.
    9. Kıvanç, İpek & Özgür-Ünlüakın, Demet & Bilgiç, Taner, 2022. "Maintenance policy analysis of the regenerative air heater system using factored POMDPs," Reliability Engineering and System Safety, Elsevier, vol. 219(C).
    10. Belyi, Dmitriy & Popova, Elmira & Morton, David P. & Damien, Paul, 2017. "Bayesian failure-rate modeling and preventive maintenance optimization," European Journal of Operational Research, Elsevier, vol. 262(3), pages 1085-1093.
    11. Song, Chaolin & Zhang, Chi & Shafieezadeh, Abdollah & Xiao, Rucheng, 2022. "Value of information analysis in non-stationary stochastic decision environments: A reliability-assisted POMDP approach," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    12. Yuan, Xian-Xun & Higo, Eishiro & Pandey, Mahesh D., 2021. "Estimation of the value of an inspection and maintenance program: A Bayesian gamma process model," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    13. Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    14. Mosayebi Omshi, E. & Grall, A. & Shemehsavar, S., 2020. "A dynamic auto-adaptive predictive maintenance policy for degradation with unknown parameters," European Journal of Operational Research, Elsevier, vol. 282(1), pages 81-92.
    15. Walter, Gero & Flapper, Simme Douwe, 2017. "Condition-based maintenance for complex systems based on current component status and Bayesian updating of component reliability," Reliability Engineering and System Safety, Elsevier, vol. 168(C), pages 227-239.
    16. de Jonge, Bram & Scarf, Philip A., 2020. "A review on maintenance optimization," European Journal of Operational Research, Elsevier, vol. 285(3), pages 805-824.
    17. Shahraki, Ameneh Forouzandeh & Yadav, Om Prakash & Vogiatzis, Chrysafis, 2020. "Selective maintenance optimization for multi-state systems considering stochastically dependent components and stochastic imperfect maintenance actions," Reliability Engineering and System Safety, Elsevier, vol. 196(C).
    18. Hazra, Indranil & Pandey, Mahesh D. & Manzana, Noldainerick, 2020. "Approximate Bayesian computation (ABC) method for estimating parameters of the gamma process using noisy data," Reliability Engineering and System Safety, Elsevier, vol. 198(C).
    19. Lam, Ji Ye Janet & Banjevic, Dragan, 2015. "A myopic policy for optimal inspection scheduling for condition based maintenance," Reliability Engineering and System Safety, Elsevier, vol. 144(C), pages 1-11.
    20. Flage, Roger & Coit, David W. & Luxhøj, James T. & Aven, Terje, 2012. "Safety constraints applied to an adaptive Bayesian condition-based maintenance optimization model," Reliability Engineering and System Safety, Elsevier, vol. 102(C), pages 16-26.
    21. Zhao, Yunfei & Gao, Wei & Smidts, Carol, 2021. "Sequential Bayesian inference of transition rates in the hidden Markov model for multi-state system degradation," Reliability Engineering and System Safety, Elsevier, vol. 214(C).
    22. Liu, Xingchen & Sun, Qiuzhuang & Ye, Zhi-Sheng & Yildirim, Murat, 2021. "Optimal multi-type inspection policy for systems with imperfect online monitoring," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    23. Jiang, Tao & Liu, Yu, 2017. "Parameter inference for non-repairable multi-state system reliability models by multi-level observation sequences," Reliability Engineering and System Safety, Elsevier, vol. 166(C), pages 3-15.
    24. Guo, Chiming & Wang, Wenbin & Guo, Bo & Si, Xiaosheng, 2013. "A maintenance optimization model for mission-oriented systems based on Wiener degradation," Reliability Engineering and System Safety, Elsevier, vol. 111(C), pages 183-194.
    25. Juang, Muh-Guey & Anderson, Gary, 2004. "A Bayesian method on adaptive preventive maintenance problem," European Journal of Operational Research, Elsevier, vol. 155(2), pages 455-473, June.
    26. Andriotis, C.P. & Papakonstantinou, K.G., 2021. "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints," Reliability Engineering and System Safety, Elsevier, vol. 212(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    2. Lee, Juseong & Mitici, Mihaela, 2023. "Deep reinforcement learning for predictive aircraft maintenance using probabilistic Remaining-Useful-Life prognostics," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    3. Sánchez, Luciano & Costa, Nahuel & Couso, Inés, 2023. "Simplified models of remaining useful life based on stochastic orderings," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
    4. Ye, Zhenggeng & Cai, Zhiqiang & Yang, Hui & Si, Shubin & Zhou, Fuli, 2023. "Joint optimization of maintenance and quality inspection for manufacturing networks based on deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 236(C).
    5. Mikhail, Mina & Ouali, Mohamed-Salah & Yacout, Soumaya, 2024. "A data-driven methodology with a nonparametric reliability method for optimal condition-based maintenance strategies," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    6. Pinciroli, Luca & Baraldi, Piero & Zio, Enrico, 2023. "Maintenance optimization in industry 4.0," Reliability Engineering and System Safety, Elsevier, vol. 234(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. de Jonge, Bram & Scarf, Philip A., 2020. "A review on maintenance optimization," European Journal of Operational Research, Elsevier, vol. 285(3), pages 805-824.
    2. Bismut, Elizabeth & Pandey, Mahesh D. & Straub, Daniel, 2022. "Reliability-based inspection and maintenance planning of a nuclear feeder piping system," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
    3. Kampitsis, Dimitris & Panagiotidou, Sofia, 2022. "A Bayesian condition-based maintenance and monitoring policy with variable sampling intervals," Reliability Engineering and System Safety, Elsevier, vol. 218(PA).
    4. Peng, Rui & He, Xiaofeng & Zhong, Chao & Kou, Gang & Xiao, Hui, 2022. "Preventive maintenance for heterogeneous parallel systems with two failure modes," Reliability Engineering and System Safety, Elsevier, vol. 220(C).
    5. Azizi, Fariba & Salari, Nooshin, 2023. "A novel condition-based maintenance framework for parallel manufacturing systems based on bivariate birth/birth–death processes," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    6. Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    7. Truong-Ba, Huy & Cholette, Michael E. & Borghesani, Pietro & Ma, Lin & Kent, Geoff, 2021. "Condition-based inspection policies for boiler heat exchangers," European Journal of Operational Research, Elsevier, vol. 291(1), pages 232-243.
    8. Zhao, Xiujie & Liu, Bin & Xu, Jianyu & Wang, Xiao-Lin, 2023. "Imperfect maintenance policies for warranted products under stochastic performance degradation," European Journal of Operational Research, Elsevier, vol. 308(1), pages 150-165.
    9. Giorgio, Massimiliano & Pulcini, Gianpaolo, 2024. "The effect of model misspecification of the bounded transformed gamma process on maintenance optimization," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    10. Liu, Xingchen & Sun, Qiuzhuang & Ye, Zhi-Sheng & Yildirim, Murat, 2021. "Optimal multi-type inspection policy for systems with imperfect online monitoring," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    11. Mosayebi Omshi, E. & Grall, A. & Shemehsavar, S., 2020. "A dynamic auto-adaptive predictive maintenance policy for degradation with unknown parameters," European Journal of Operational Research, Elsevier, vol. 282(1), pages 81-92.
    12. Cai, Yue & Teunter, Ruud H. & de Jonge, Bram, 2023. "A data-driven approach for condition-based maintenance optimization," European Journal of Operational Research, Elsevier, vol. 311(2), pages 730-738.
    13. Dursun, İpek & Akçay, Alp & van Houtum, Geert-Jan, 2022. "Age-based maintenance under population heterogeneity: Optimal exploration and exploitation," European Journal of Operational Research, Elsevier, vol. 301(3), pages 1007-1020.
    14. Kim, Seokgoo & Choi, Joo-Ho & Kim, Nam Ho, 2022. "Inspection schedule for prognostics with uncertainty management," Reliability Engineering and System Safety, Elsevier, vol. 222(C).
    15. Mosayebi Omshi, E. & Grall, A., 2021. "Replacement and imperfect repair of deteriorating system: Study of a CBM policy and impact of repair efficiency," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    16. Alaswad, Suzan & Xiang, Yisha, 2017. "A review on condition-based maintenance optimization models for stochastically deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 157(C), pages 54-63.
    17. Huynh, K.T., 2021. "An adaptive predictive maintenance model for repairable deteriorating systems using inverse Gaussian degradation process," Reliability Engineering and System Safety, Elsevier, vol. 213(C).
    18. Akcay, Alp, 2022. "An alert-assisted inspection policy for a production process with imperfect condition signals," European Journal of Operational Research, Elsevier, vol. 298(2), pages 510-525.
    19. Liu, Qiannan & Ma, Lin & Wang, Naichao & Chen, Ankang & Jiang, Qihang, 2022. "A condition-based maintenance model considering multiple maintenance effects on the dependent failure processes," Reliability Engineering and System Safety, Elsevier, vol. 220(C).
    20. Esposito, Nicola & Mele, Agostino & Castanier, Bruno & GIORGIO, Massimiliano, 2023. "A hybrid maintenance policy for a deteriorating unit in the presence of three forms of variability," Reliability Engineering and System Safety, Elsevier, vol. 237(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:224:y:2022:i:c:s0951832022001922. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.