IDEAS home Printed from https://ideas.repec.org/a/igg/jisp00/v18y2024i1p1-23.html
   My bibliography  Save this article

Application of Machine Learning Models for Malware Classification With Real and Synthetic Datasets

Author

Listed:
  • Santosh Joshi

    (Florida International University, USA)

  • Alexander Perez Pons

    (Florida International University, USA)

  • Shrirang Ambaji Kulkarni

    (Manipal Institute of Technology, Bengaluru, India)

  • Himanshu Upadhyay

    (Florida International University, USA)

Abstract

Stacking of multiple Machine Learning (ML) classifiers have gained popularity in addressing anomalous data classification along with Deep Learning (DL) algorithms. This study compares traditional ML classifiers, multi-layer stacking ML classifiers, and DL classifiers using an open-source malware dataset-containing equal numbers of benign and malware samples. The results on the realistic dataset indicate that the DL classifier, utilizing a Bidirectional Long Short-Term Memory (BiLSTM) model, outperformed the stacked classifiers with Logistic Regression (LR) and Support Vector Machine (SVM) as Meta learners by 36.78% and 39.69%, respectively, in terms of classification accuracy and performance. The research work was extended to study the impact of Generative Adversarial Network (GAN) based synthetic dataset of relatively smaller size on deep learning models. It was observed that the Deep Learning Multi-Layer Perceptron (DLMLP) Model had relatively superior performance as compared to complex deep learning models like Long Short-Term Memory LSTM and BiLSTM

Suggested Citation

  • Santosh Joshi & Alexander Perez Pons & Shrirang Ambaji Kulkarni & Himanshu Upadhyay, 2024. "Application of Machine Learning Models for Malware Classification With Real and Synthetic Datasets," International Journal of Information Security and Privacy (IJISP), IGI Global, vol. 18(1), pages 1-23, January.
  • Handle: RePEc:igg:jisp00:v:18:y:2024:i:1:p:1-23
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJISP.356513
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jisp00:v:18:y:2024:i:1:p:1-23. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.