IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v249y2024ics0951832024003028.html
   My bibliography  Save this article

Cost optimization and reliability analysis of fault tolerant system with service interruption and reboot

Author

Listed:
  • Jain, Madhu
  • Kumar, Pankaj
  • Singh, Mayank
  • Gupta, Ritu

Abstract

Due to widespread usage in many real time systems, reliability modeling and cost optimization of fault tolerance system have drawn attention of the practitioners. The fault tolerance in these systems can be provided by the support of maintenance and redundant components that help in smooth operation of the system in spite of failure of some active components. This investigation deals with the performance modeling of a fault-tolerant system consisting of a finite number of active (online) and standby components. During the switching from active to standby, the recovery procedure is performed, which may be imperfect. In case of imperfect recovery, the system reboot takes place. The maintenance of all the components is managed by a repairman (server) which is subject to failure. When the server is interrupted for rendering the service, functioning does not get stopped due to the system switch-over from perfect working to working breakdown mode. The system works even when the server is on working vacation and performs repair jobs of the failed components. The machine repair model based on Markovian process is developed to derive the transient probabilities and other performance indices of the fault tolerant system using Laplace transforms and matrix analytical method. Using the direct search strategy and particle swarm optimization, the cost-benefit analysis is done. The optimal design of the control parameters for the fault-tolerant system are presented by framing a cost-effective ratio function. The model is examined computationally by performing the numerical simulation and cost optimization.

Suggested Citation

  • Jain, Madhu & Kumar, Pankaj & Singh, Mayank & Gupta, Ritu, 2024. "Cost optimization and reliability analysis of fault tolerant system with service interruption and reboot," Reliability Engineering and System Safety, Elsevier, vol. 249(C).
  • Handle: RePEc:eee:reensy:v:249:y:2024:i:c:s0951832024003028
    DOI: 10.1016/j.ress.2024.110229
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832024003028
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2024.110229?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:249:y:2024:i:c:s0951832024003028. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.