IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v217y2022ics0951832021005895.html
   My bibliography  Save this article

Reliability analysis of ensemble fault tolerance for soft error mitigation against complex radiation effect

Author

Listed:
  • Yang, Shunkun
  • Shao, Qi
  • Bian, Chong

Abstract

With the progressive miniaturization of integrated circuits and the increasing complexity of logic functions, the possibility of software system failure triggered by soft errors in the context of intensive space radiation is also increasing. The quantitative analysis of the reliability of different mitigation strategies against superposition affections in the context of a dynamic space radiation environment is still challenging. To solve these problems, a mixed extreme run degraded shock model is proposed to comprehensively describe the superposition of the space radiation effect, and a degradation mechanism is introduced to address the cumulative effect. Different ensemble mechanisms of structure fault tolerance, information fault tolerance, and rejuvenation strategy are modeled for quantitative analysis. Subsequently, through parameter transfer and state synchronization control technology, an interaction mechanism is established between the environment and design mode models to realize the dynamic reliability evolution analysis via probabilistic model checking. Experimental results demonstrate that the ensemble of structural fault-tolerance and information fault-tolerance has the best resistance to the space radiation effect with the reliability improvement by 6.17–20.17% under the given experimental conditions.

Suggested Citation

  • Yang, Shunkun & Shao, Qi & Bian, Chong, 2022. "Reliability analysis of ensemble fault tolerance for soft error mitigation against complex radiation effect," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
  • Handle: RePEc:eee:reensy:v:217:y:2022:i:c:s0951832021005895
    DOI: 10.1016/j.ress.2021.108092
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832021005895
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2021.108092?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rafiee, Koosha & Feng, Qianmei & Coit, David W., 2017. "Reliability assessment of competing risks with generalized mixed shock models," Reliability Engineering and System Safety, Elsevier, vol. 159(C), pages 1-11.
    2. Ranjkesh, Somayeh Hamed & Hamadani, Ali Zeinal & Mahmoodi, Safieh, 2019. "A new cumulative shock model with damage and inter-arrival time dependency," Reliability Engineering and System Safety, Elsevier, vol. 192(C).
    3. Eryilmaz, Serkan, 2017. "δ-shock model based on Polya process and its optimal replacement policy," European Journal of Operational Research, Elsevier, vol. 263(2), pages 690-697.
    4. Cirillo, Pasquale & Hüsler, Jürg, 2011. "Extreme shock models: An alternative perspective," Statistics & Probability Letters, Elsevier, vol. 81(1), pages 25-30, January.
    5. Hoque, Khaza Anuarul & Ait Mohamed, Otmane & Savaria, Yvon, 2019. "Dependability modeling and optimization of triple modular redundancy partitioning for SRAM-based FPGAs," Reliability Engineering and System Safety, Elsevier, vol. 182(C), pages 107-119.
    6. Jung, Seunghwa & Choi, Jihwan P., 2019. "Predicting system failure rates of SRAM-based FPGA on-board processors in space radiation environments," Reliability Engineering and System Safety, Elsevier, vol. 183(C), pages 374-386.
    7. Kretzschmar, U. & Gomez-Cornejo, J. & Astarloa, A. & Bidarte, U. & Ser, J. Del, 2016. "Synchronization of faulty processors in coarse-grained TMR protected partially reconfigurable FPGA designs," Reliability Engineering and System Safety, Elsevier, vol. 151(C), pages 1-9.
    8. Ramezani, Reza & Clemente, Juan Antonio & Franco, Francisco J., 2020. "Analytical reliability estimation of SRAM-based FPGA designs against single-bit and multiple-cell upsets," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zeng, Ying & Huang, Tudi & Li, Yan-Feng & Huang, Hong-Zhong, 2023. "Reliability modeling for power converter in satellite considering periodic phased mission," Reliability Engineering and System Safety, Elsevier, vol. 232(C).
    2. Wang, Rongxi & Li, Yufan & Xu, Jinjin & Wang, Zhen & Gao, Jianmin, 2022. "F2G: A hybrid fault-function graphical model for reliability analysis of complex equipment with coupled faults," Reliability Engineering and System Safety, Elsevier, vol. 226(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ramezani, Reza & Ghavidel, Abolfazl & Sedaghat, Yasser, 2021. "Exact and efficient reliability and performance optimization of synchronous task graphs," Reliability Engineering and System Safety, Elsevier, vol. 205(C).
    2. Wang, Xiaoyue & Zhao, Xian & Wang, Siqi & Sun, Leping, 2020. "Reliability and maintenance for performance-balanced systems operating in a shock environment," Reliability Engineering and System Safety, Elsevier, vol. 195(C).
    3. Zhao, Xian & Wang, Siqi & Wang, Xiaoyue & Cai, Kui, 2018. "A multi-state shock model with mutative failure patterns," Reliability Engineering and System Safety, Elsevier, vol. 178(C), pages 1-11.
    4. Jung, Sejin & Yoo, Junbeom & Lee, Young-Jun, 2020. "A practical application of NUREG/CR-6430 software safety hazard analysis to FPGA software," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    5. Lyu, Hao & Qu, Hongchen & Yang, Zaiyou & Ma, Li & Lu, Bing & Pecht, Michael, 2023. "Reliability analysis of dependent competing failure processes with time-varying δ shock model," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    6. Ramezani, Reza & Clemente, Juan Antonio & Franco, Francisco J., 2020. "Analytical reliability estimation of SRAM-based FPGA designs against single-bit and multiple-cell upsets," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    7. Wang, Jia & Han, Xu & Zhang, Yun-an & Bai, Guanghan, 2021. "Modeling the varying effects of shocks for a multi-stage degradation process," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    8. Chadjiconstantinidis, Stathis & Eryilmaz, Serkan, 2023. "Reliability of a mixed δ-shock model with a random change point in shock magnitude distribution and an optimal replacement policy," Reliability Engineering and System Safety, Elsevier, vol. 232(C).
    9. Eryilmaz, Serkan & Kan, Cihangir, 2019. "Reliability and optimal replacement policy for an extreme shock model with a change point," Reliability Engineering and System Safety, Elsevier, vol. 190(C), pages 1-1.
    10. Meango, Toualith Jean-Marc & Ouali, Mohamed-Salah, 2020. "Failure interaction model based on extreme shock and Markov processes," Reliability Engineering and System Safety, Elsevier, vol. 197(C).
    11. Zhang, Jianchun & Zhao, Yu & Ma, Xiaobing, 2020. "Reliability modeling methods for load-sharing k-out-of-n system subject to discrete external load," Reliability Engineering and System Safety, Elsevier, vol. 193(C).
    12. Zhao, Xian & He, Zongda & Wu, Yaguang & Qiu, Qingan, 2022. "Joint optimization of condition-based performance control and maintenance policies for mission-critical systems," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
    13. Jung, Seunghwa & Choi, Jihwan P., 2019. "Predicting system failure rates of SRAM-based FPGA on-board processors in space radiation environments," Reliability Engineering and System Safety, Elsevier, vol. 183(C), pages 374-386.
    14. Levitin, Gregory & Finkelstein, Maxim & Dai, Yuanshun, 2018. "Optimizing availability of heterogeneous standby systems exposed to shocks," Reliability Engineering and System Safety, Elsevier, vol. 170(C), pages 137-145.
    15. Zhiyuan Zuo & Liang Wang & Yuhlong Lio, 2022. "Reliability Estimation for Dependent Left-Truncated and Right-Censored Competing Risks Data with Illustrations," Energies, MDPI, vol. 16(1), pages 1-25, December.
    16. Zhao, Xian & Guo, Xiaoxin & Wang, Xiaoyue, 2018. "Reliability and maintenance policies for a two-stage shock model with self-healing mechanism," Reliability Engineering and System Safety, Elsevier, vol. 172(C), pages 185-194.
    17. Granig, Wolfgang & Faller, Lisa-Marie & Hammerschmidt, Dirk & Zangl, Hubert, 2019. "Dependability considerations of redundant sensor systems," Reliability Engineering and System Safety, Elsevier, vol. 190(C), pages 1-1.
    18. Hoque, Khaza Anuarul & Ait Mohamed, Otmane & Savaria, Yvon, 2019. "Dependability modeling and optimization of triple modular redundancy partitioning for SRAM-based FPGAs," Reliability Engineering and System Safety, Elsevier, vol. 182(C), pages 107-119.
    19. Sun, Fuqiang & Li, Hao & Cheng, Yuanyuan & Liao, Haitao, 2021. "Reliability analysis for a system experiencing dependent degradation processes and random shocks based on a nonlinear Wiener process model," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    20. Zhao, Xian & Dong, Bingbing & Wang, Xiaoyue, 2023. "Reliability analysis of a two-dimensional voting system equipped with protective devices considering triggering failures," Reliability Engineering and System Safety, Elsevier, vol. 232(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:217:y:2022:i:c:s0951832021005895. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.