IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i13p2113-d1429708.html
   My bibliography  Save this article

Software Fault Localization Based on Weighted Association Rule Mining and Complex Networks

Author

Listed:
  • Wentao Wu

    (School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China
    Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing 100191, China)

  • Shihai Wang

    (School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China
    Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing 100191, China
    State Key Laboratory of Software Development Environment, Beijing 100191, China)

  • Bin Liu

    (School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China
    Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing 100191, China
    State Key Laboratory of Software Development Environment, Beijing 100191, China)

Abstract

Software fault localization technology aims to identify suspicious statements that cause software failures, which is crucial for ensuring software quality. Spectrum-based software fault location (SBFL) technology calculates the suspiciousness of each statement by analyzing the correlation between statement coverage information and execution results in test cases. SBFL has attracted increasing attention from scholars due to its high efficiency and scalability. However, existing SBFL studies have shown that a large number of statements share the same suspiciousness, which hinders software debuggers from quickly identifying the location of faulty statements. To address this challenge, we propose an SBFL model based on weighted association rule mining and complex networks: FL-WARMCN. The algorithm first uses Jaccard to measure the distance between passing and failing test cases, and applies it as the weight of passing test cases. Next, FL-WARMCN calculates the initial suspiciousness of each statement based on the program spectrum data. Then, the FL-WARMCN model utilizes a weighted association rule mining algorithm to obtain the correlation relationships between statements and models the network based on this. In the network, the suspiciousness of statements is used as node weights, and the correlation between statements is used as edge weights. We chose the eigenvector centrality that takes into account the degree centrality of statements and the importance of neighboring statements to calculate the importance of each statement, and used it as a weight to incorporate into the weighted suspiciousness calculation of the statement. Finally, we applied the FL-WARMCN model for experimental validation on the Defects4J dataset. The results showed that the model was significantly superior to other baselines. In addition, we analyzed the impact of different node and edge weights on model performance.

Suggested Citation

  • Wentao Wu & Shihai Wang & Bin Liu, 2024. "Software Fault Localization Based on Weighted Association Rule Mining and Complex Networks," Mathematics, MDPI, vol. 12(13), pages 1-21, July.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:13:p:2113-:d:1429708
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/13/2113/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/13/2113/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dorogovtsev, S.N. & Mendes, J.F.F., 2003. "Evolution of Networks: From Biological Nets to the Internet and WWW," OUP Catalogue, Oxford University Press, number 9780198515906.
    2. Wandelt, Sebastian & Shi, Xing & Sun, Xiaoqian, 2021. "Estimation and improvement of transportation network robustness by exploiting communities," Reliability Engineering and System Safety, Elsevier, vol. 206(C).
    3. Zhou, Ying & Li, Chenshuang & Ding, Lieyun & Sekula, Przemyslaw & Love, Peter E.D. & Zhou, Cheng, 2019. "Combining association rules mining with complex networks to monitor coupled risks," Reliability Engineering and System Safety, Elsevier, vol. 186(C), pages 194-208.
    4. Zhengqi He & Dechun Huang & Junmin Fang & Qingyuan Zhu, 2021. "Social Stability Risk Diffusion of Large Complex Engineering Projects Based on an Improved SIR Model: A Simulation Research on Complex Networks," Complexity, Hindawi, vol. 2021, pages 1-17, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fu, Lipeng & Wang, Xueqing & Zhao, Heng & Li, Mengnan, 2022. "Interactions among safety risks in metro deep foundation pit projects: An association rule mining-based modeling framework," Reliability Engineering and System Safety, Elsevier, vol. 221(C).
    2. Ya-Chun Gao & Zong-Wen Wei & Bing-Hong Wang, 2013. "Dynamic Evolution Of Financial Network And Its Relation To Economic Crises," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 24(02), pages 1-10.
    3. Zhou, Wei-Xing & Jiang, Zhi-Qiang & Sornette, Didier, 2007. "Exploring self-similarity of complex cellular networks: The edge-covering method with simulated annealing and log-periodic sampling," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 375(2), pages 741-752.
    4. Bezsudnov, I.V. & Snarskii, A.A., 2014. "From the time series to the complex networks: The parametric natural visibility graph," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 53-60.
    5. Mark S. Handcock & Adrian E. Raftery & Jeremy M. Tantrum, 2007. "Model‐based clustering for social networks," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(2), pages 301-354, March.
    6. Wang, Qingyun & Duan, Zhisheng & Chen, Guanrong & Feng, Zhaosheng, 2008. "Synchronization in a class of weighted complex networks with coupling delays," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(22), pages 5616-5622.
    7. F. W. S. Lima, 2015. "Evolution of egoism on semi-directed and undirected Barabási-Albert networks," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 26(12), pages 1-9.
    8. G. Ghoshal & M. E.J. Newman, 2007. "Growing distributed networks with arbitrary degree distributions," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 58(2), pages 175-184, July.
    9. Chang, Y.F. & Han, S.K. & Wang, X.D., 2018. "The way to uncover community structure with core and diversity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 501(C), pages 111-119.
    10. Chakrabarti, Anindya S., 2015. "Stochastic Lotka-Volterra equations: A model of lagged diffusion of technology in an interconnected world," IIMA Working Papers WP2015-08-05, Indian Institute of Management Ahmedabad, Research and Publication Department.
    11. Kurmankhojayev, Daniyar & Li, Guoyuan & Chen, Anthony, 2024. "Link criticality index: Refinement, framework extension, and a case study," Reliability Engineering and System Safety, Elsevier, vol. 243(C).
    12. Roth, Camille, 2007. "Empiricism for descriptive social network models," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 378(1), pages 53-58.
    13. Douglas R. White & Jason Owen-Smith & James Moody & Walter W. Powell, 2004. "Networks, Fields and Organizations: Micro-Dynamics, Scale and Cohesive Embeddings," Computational and Mathematical Organization Theory, Springer, vol. 10(1), pages 95-117, May.
    14. L. da F. Costa & L. E.C. da Rocha, 2006. "A generalized approach to complex networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 50(1), pages 237-242, March.
    15. Perc, Matjaž, 2010. "Zipf’s law and log-normal distributions in measures of scientific output across fields and institutions: 40 years of Slovenia’s research as an example," Journal of Informetrics, Elsevier, vol. 4(3), pages 358-364.
    16. Florian Blöchl & Fabian J. Theis & Fernando Vega-Redondo & Eric O'N. Fisher, 2010. "Which Sectors of a Modern Economy are most Central?," CESifo Working Paper Series 3175, CESifo.
    17. M. C. González & A. O. Sousa & H. J. Herrmann, 2004. "Opinion Formation On A Deterministic Pseudo-Fractal Network," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 15(01), pages 45-57.
    18. A. Chatterjee, 2009. "Kinetic models for wealth exchange on directed networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 67(4), pages 593-598, February.
    19. Z.-Q. Jiang & L. Guo & W.-X. Zhou, 2007. "Endogenous and exogenous dynamics in the fluctuations of capital fluxes," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 57(3), pages 347-355, June.
    20. D Dylan Johnson Restrepo & Neil F Johnson, 2017. "Unraveling the Collective Dynamics of Complex Adaptive Biomedical Systems," Current Trends in Biomedical Engineering & Biosciences, Juniper Publishers Inc., vol. 8(5), pages 118-132, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:13:p:2113-:d:1429708. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.