IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v165y2017icp336-344.html
   My bibliography  Save this article

Optimal backup in heterogeneous standby systems exposed to shocks

Author

Listed:
  • Levitin, Gregory
  • Finkelstein, Maxim

Abstract

The paper considers non-repairable 1-out-of-N heterogeneous warm standby computing systems with components exposed to internal failures and external shocks. To provide the data recovery in the case of operating component failure, the backup procedures are performed during the computational mission. The backups enable an activated standby component to take over the mission task from the point where the last backup has been completed without redoing the entire task from scratch. Both data backup and retrieval times depend on the amount of work performed. The system components are characterized by a different performance level, replacement time, time-to-internal failure distribution, and shocks survival probability. The shock processes also have different characteristics for different components. A numerical method is proposed to evaluate mission success probability for a given allowed mission time and expected mission completion time. The optimal backup scheduling problem is then formulated and solved for different optimization objectives and constraints.

Suggested Citation

  • Levitin, Gregory & Finkelstein, Maxim, 2017. "Optimal backup in heterogeneous standby systems exposed to shocks," Reliability Engineering and System Safety, Elsevier, vol. 165(C), pages 336-344.
  • Handle: RePEc:eee:reensy:v:165:y:2017:i:c:p:336-344
    DOI: 10.1016/j.ress.2017.04.022
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832016300801
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2017.04.022?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Khac Tuan Huynh & Inma T. Castro & Anne Barros & Christophe Bérenguer, 2012. "Modeling age-based maintenance strategies with minimal repairs for systems subject to competing failure modes due to degradation and shocks," Post-Print hal-00790729, HAL.
    2. Montoro-Cazorla, Delia & Pérez-Ocón, Rafael, 2011. "Two shock and wear systems under repair standing a finite number of shocks," European Journal of Operational Research, Elsevier, vol. 214(2), pages 298-307, October.
    3. Chakravarthy, Srinivas R., 2012. "Maintenance of a deteriorating single server system with Markovian arrivals and random shocks," European Journal of Operational Research, Elsevier, vol. 222(3), pages 508-522.
    4. Papageorgiou, Effie & Kokolakis, George, 2010. "Reliability analysis of a two-unit general parallel system with (n-2) warm standbys," European Journal of Operational Research, Elsevier, vol. 201(3), pages 821-827, March.
    5. Huynh, K.T. & Castro, I.T. & Barros, A. & Bérenguer, C., 2012. "Modeling age-based maintenance strategies with minimal repairs for systems subject to competing failure modes due to degradation and shocks," European Journal of Operational Research, Elsevier, vol. 218(1), pages 140-151.
    6. Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2015. "Optimal backup frequency in system with random repair time," Reliability Engineering and System Safety, Elsevier, vol. 144(C), pages 12-22.
    7. Durga Rao, K. & Gopika, V. & Sanyasi Rao, V.V.S. & Kushwaha, H.S. & Verma, A.K. & Srividya, A., 2009. "Dynamic fault tree analysis using Monte Carlo simulation in probabilistic safety assessment," Reliability Engineering and System Safety, Elsevier, vol. 94(4), pages 872-883.
    8. Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2013. "Cold-standby sequencing optimization considering mission cost," Reliability Engineering and System Safety, Elsevier, vol. 118(C), pages 28-34.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Eryilmaz, Serkan & Devrim, Yilser, 2019. "Reliability and optimal replacement policy for a k-out-of-n system subject to shocks," Reliability Engineering and System Safety, Elsevier, vol. 188(C), pages 393-397.
    2. Gregory Levitin & Maxim Finkelstein, 2018. "Optimal mission abort policy with multiple shock number thresholds," Journal of Risk and Reliability, , vol. 232(6), pages 607-615, December.
    3. Jia, Heping & Ding, Yi & Peng, Rui & Liu, Hanlin & Song, Yonghua, 2020. "Reliability assessment and activation sequence optimization of non-repairable multi-state generation systems considering warm standby," Reliability Engineering and System Safety, Elsevier, vol. 195(C).
    4. Maxim Finkelstein & Gregory Levitin, 2018. "Optimal mission duration for systems subject to shocks and internal failures," Journal of Risk and Reliability, , vol. 232(1), pages 82-91, February.
    5. Levitin, Gregory & Finkelstein, Maxim & Dai, Yuanshun, 2018. "Optimizing availability of heterogeneous standby systems exposed to shocks," Reliability Engineering and System Safety, Elsevier, vol. 170(C), pages 137-145.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cha, Ji Hwan & Finkelstein, Maxim, 2016. "New shock models based on the generalized Polya process," European Journal of Operational Research, Elsevier, vol. 251(1), pages 135-141.
    2. Levitin, Gregory & Finkelstein, Maxim, 2019. "Optimal loading of elements in series systems exposed to external shocks," Reliability Engineering and System Safety, Elsevier, vol. 192(C).
    3. Levitin, Gregory & Finkelstein, Maxim, 2017. "Effect of element separation in series-parallel systems exposed to random shocks," European Journal of Operational Research, Elsevier, vol. 260(1), pages 305-315.
    4. Ji Hwan Cha & Maxim Finkelstein, 2019. "Optimal preventive maintenance for systems having a continuous output and operating in a random environment," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 327-350, July.
    5. Ji Hwan Cha & Maxim Finkelstein, 2018. "On a New Shot Noise Process and the Induced Survival Model," Methodology and Computing in Applied Probability, Springer, vol. 20(3), pages 897-917, September.
    6. Cha, Ji Hwan & Finkelstein, Maxim & Levitin, Gregory, 2018. "Optimal mission abort policy for partially repairable heterogeneous systems," European Journal of Operational Research, Elsevier, vol. 271(3), pages 818-825.
    7. Ji Hwan Cha & Maxim Finkelstein, 2019. "On some characteristics of quality for systems operating in a random environment," Journal of Risk and Reliability, , vol. 233(2), pages 257-267, April.
    8. Hazra, Nil Kamal & Finkelstein, Maxim & Cha, Ji Hwan, 2022. "On a hazard (failure) rate process with delays after shocks," Statistics & Probability Letters, Elsevier, vol. 181(C).
    9. Cha, Ji Hwan & Finkelstein, Maxim, 2018. "On information-based residual lifetime in survival models with delayed failures," Statistics & Probability Letters, Elsevier, vol. 137(C), pages 209-216.
    10. Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2018. "Heterogeneous 1-out-of-N warm standby systems with online checkpointing," Reliability Engineering and System Safety, Elsevier, vol. 169(C), pages 127-136.
    11. Wang, Xiaolin & Liu, Bin & Zhao, Xiujie, 2021. "A performance-based warranty for products subject to competing hard and soft failures," International Journal of Production Economics, Elsevier, vol. 233(C).
    12. Tsai, Hsin-Nan & Sheu, Shey-Huei & Zhang, Zhe George, 2017. "A trivariate optimal replacement policy for a deteriorating system based on cumulative damage and inspections," Reliability Engineering and System Safety, Elsevier, vol. 160(C), pages 74-88.
    13. Giorgio, Massimiliano & Pulcini, Gianpaolo, 2018. "A new state-dependent degradation process and related model misidentification problems," European Journal of Operational Research, Elsevier, vol. 267(3), pages 1027-1038.
    14. Hyunju Lee & Ji Hwan Cha, 2021. "A general multivariate new better than used (MNBU) distribution and its properties," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(1), pages 27-46, January.
    15. Alaswad, Suzan & Xiang, Yisha, 2017. "A review on condition-based maintenance optimization models for stochastically deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 157(C), pages 54-63.
    16. Levitin, Gregory & Xing, Liudong & Haim, Hanoch Ben & Dai, Yuanshun, 2019. "Optimal structure of series system with 1-out-of-n warm standby subsystems performing operation and rescue functions," Reliability Engineering and System Safety, Elsevier, vol. 188(C), pages 523-531.
    17. Tanwar, Monika & Rai, Rajiv N. & Bolia, Nomesh, 2014. "Imperfect repair modeling using Kijima type generalized renewal process," Reliability Engineering and System Safety, Elsevier, vol. 124(C), pages 24-31.
    18. Dahal, Keshav & Al-Arfaj, Khalid & Paudyal, Krishna, 2015. "Modelling generator maintenance scheduling costs in deregulated power markets," European Journal of Operational Research, Elsevier, vol. 240(2), pages 551-561.
    19. Hai-Kun Wang & Yan-Feng Li & Yu Liu & Yuan-Jian Yang & Hong-Zhong Huang, 2015. "Remaining useful life estimation under degradation and shock damage," Journal of Risk and Reliability, , vol. 229(3), pages 200-208, June.
    20. Wang, Guan Jun & Zhang, Yuan Lin, 2013. "Optimal repair–replacement policies for a system with two types of failures," European Journal of Operational Research, Elsevier, vol. 226(3), pages 500-506.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:165:y:2017:i:c:p:336-344. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.