IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/6038619.html
   My bibliography  Save this article

An Approach to Semantic and Structural Features Learning for Software Defect Prediction

Author

Listed:
  • Shi Meilong
  • Peng He
  • Haitao Xiao
  • Huixin Li
  • Cheng Zeng

Abstract

Research on software defect prediction has achieved great success at modeling predictors. To build more accurate predictors, a number of hand-crafted features are proposed, such as static code features, process features, and social network features. Few models, however, consider the semantic and structural features of programs. Understanding the context information of source code files could explain a lot about the cause of defects in software. In this paper, we leverage representation learning for semantic and structural features generation. Specifically, we first extract token vectors of code files based on the Abstract Syntax Trees (ASTs) and then feed the token vectors into Convolutional Neural Network (CNN) to automatically learn semantic features. Meanwhile, we also construct a complex network model based on the dependencies between code files, namely, software network (SN). After that, to learn the structural features, we apply the network embedding method to the resulting SN. Finally, we build a novel software defect prediction model based on the learned semantic and structural features (SDP-S2S). We evaluated our method on 6 projects collected from public PROMISE repositories. The results suggest that the contribution of structural features extracted from software network is prominent, and when combined with semantic features, the results seem to be better. In addition, compared with the traditional hand-crafted features, the F -measure values of SDP-S2S are generally increased, with a maximum growth rate of 99.5%. We also explore the parameter sensitivity in the learning process of semantic and structural features and provide guidance for the optimization of predictors.

Suggested Citation

  • Shi Meilong & Peng He & Haitao Xiao & Huixin Li & Cheng Zeng, 2020. "An Approach to Semantic and Structural Features Learning for Software Defect Prediction," Mathematical Problems in Engineering, Hindawi, vol. 2020, pages 1-13, April.
  • Handle: RePEc:hin:jnlmpe:6038619
    DOI: 10.1155/2020/6038619
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/MPE/2020/6038619.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/MPE/2020/6038619.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2020/6038619?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ahmed Abdu & Zhengjun Zhai & Redhwan Algabri & Hakim A. Abdo & Kotiba Hamad & Mugahed A. Al-antari, 2022. "Deep Learning-Based Software Defect Prediction via Semantic Key Features of Source Code—Systematic Survey," Mathematics, MDPI, vol. 10(17), pages 1-26, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:6038619. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.