IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v13y2021i12p318-d705271.html
   My bibliography  Save this article

Models versus Datasets: Reducing Bias through Building a Comprehensive IDS Benchmark

Author

Listed:
  • Rasheed Ahmad

    (Department of Computer Information Sciences, University of the Cumberlands, 6178 College Station Drive, Williamsburg, KY 40769, USA)

  • Izzat Alsmadi

    (Department of computing and cyber security, University of Texas A&M San Antonio, One University Way, San Antonio, TX 78224, USA)

  • Wasim Alhamdani

    (Department of Computer Information Sciences, University of the Cumberlands, 6178 College Station Drive, Williamsburg, KY 40769, USA)

  • Lo’ai Tawalbeh

    (Department of computing and cyber security, University of Texas A&M San Antonio, One University Way, San Antonio, TX 78224, USA)

Abstract

Today, deep learning approaches are widely used to build Intrusion Detection Systems for securing IoT environments. However, the models’ hidden and complex nature raises various concerns, such as trusting the model output and understanding why the model made certain decisions. Researchers generally publish their proposed model’s settings and performance results based on a specific dataset and a classification model but do not report the proposed model’s output and findings. Similarly, many researchers suggest an IDS solution by focusing only on a single benchmark dataset and classifier. Such solutions are prone to generating inaccurate and biased results. This paper overcomes these limitations in previous work by analyzing various benchmark datasets and various individual and hybrid deep learning classifiers towards finding the best IDS solution for IoT that is efficient, lightweight, and comprehensive in detecting network anomalies. We also showed the model’s localized predictions and analyzed the top contributing features impacting the global performance of deep learning models. This paper aims to extract the aggregate knowledge from various datasets and classifiers and analyze the commonalities to avoid any possible bias in results and increase the trust and transparency of deep learning models. We believe this paper’s findings will help future researchers build a comprehensive IDS based on well-performing classifiers and utilize the aggregated knowledge and the minimum set of significantly contributing features.

Suggested Citation

  • Rasheed Ahmad & Izzat Alsmadi & Wasim Alhamdani & Lo’ai Tawalbeh, 2021. "Models versus Datasets: Reducing Bias through Building a Comprehensive IDS Benchmark," Future Internet, MDPI, vol. 13(12), pages 1-22, December.
  • Handle: RePEc:gam:jftint:v:13:y:2021:i:12:p:318-:d:705271
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/13/12/318/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/13/12/318/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:13:y:2021:i:12:p:318-:d:705271. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.