IDEAS home Printed from https://ideas.repec.org/a/sae/intdis/v18y2022i3p15501477211049910.html
   My bibliography  Save this article

A novel and highly efficient botnet detection algorithm based on network traffic analysis of smart systems

Author

Listed:
  • Li Duan
  • Jingxian Zhou
  • You Wu
  • Wenyao Xu

Abstract

In smart systems, attackers can use botnets to launch different cyber attack activities against the Internet of Things. The traditional methods of detecting botnets commonly used machine learning algorithms, and it is difficult to detect and control botnets in a network because of unbalanced traffic data. In this article, we present a novel and highly efficient botnet detection method based on an autoencoder neural network in cooperation with decision trees on a given network. The deep flow inspection method and statistical analysis are first applied as a feature selection technique to select relevant features, which are used to characterize the communication-related behavior between network nodes. Then, the autoencoder neural network for feature selection is used to improve the efficiency of model construction. Finally, Tomek-Recursion Borderline Synthetic Minority Oversampling Technique generates additional minority samples to achieve class balance, and an improved gradient boosting decision tree algorithm is used to train and establish an abnormal traffic detection model to improve the detection of unbalanced botnet data. The results of experiments on the ISCX-botnet traffic dataset show that the proposed method achieved better botnet detection performance with 99.10% recall, 99.20% accuracy, 99.1% F1 score, and 99.0% area under the curve.

Suggested Citation

  • Li Duan & Jingxian Zhou & You Wu & Wenyao Xu, 2022. "A novel and highly efficient botnet detection algorithm based on network traffic analysis of smart systems," International Journal of Distributed Sensor Networks, , vol. 18(3), pages 15501477211, March.
  • Handle: RePEc:sae:intdis:v:18:y:2022:i:3:p:15501477211049910
    DOI: 10.1177/15501477211049910
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/15501477211049910
    Download Restriction: no

    File URL: https://libkey.io/10.1177/15501477211049910?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wo Jae Lee & Gamini P. Mendis & Matthew J. Triebe & John W. Sutherland, 2020. "Monitoring of a machining process using kernel principal component analysis and kernel density estimation," Journal of Intelligent Manufacturing, Springer, vol. 31(5), pages 1175-1189, June.
    2. Döpke, Jörg & Fritsche, Ulrich & Pierdzioch, Christian, 2017. "Predicting recessions with boosted regression trees," International Journal of Forecasting, Elsevier, vol. 33(4), pages 745-759.
    3. Fariba Haddadi & A. Nur Zincir‐Heywood, 2017. "Botnet behaviour analysis: How would a data analytics‐based system with minimum a priori information perform?," International Journal of Network Management, John Wiley & Sons, vol. 27(4), July.
    4. Ruchi Vishwakarma & Ankit Kumar Jain, 2020. "A survey of DDoS attacking techniques and defence mechanisms in the IoT network," Telecommunication Systems: Modelling, Analysis, Design and Management, Springer, vol. 73(1), pages 3-25, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carstensen, Kai & Heinrich, Markus & Reif, Magnus & Wolters, Maik H., 2020. "Predicting ordinary and severe recessions with a three-state Markov-switching dynamic factor model," International Journal of Forecasting, Elsevier, vol. 36(3), pages 829-850.
    2. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    3. Magnus Reif, 2020. "Macroeconomics, Nonlinearities, and the Business Cycle," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 87.
    4. Radwa Ahmed Osman & Sherine Nagy Saleh & Yasmine N. M. Saleh & Mazen Nabil Elagamy, 2021. "A Reliable and Efficient Tracking System Based on Deep Learning for Monitoring the Spread of COVID-19 in Closed Areas," IJERPH, MDPI, vol. 18(24), pages 1-20, December.
    5. Chris Reimann, 2024. "Predicting financial crises: an evaluation of machine learning algorithms and model explainability for early warning systems," Review of Evolutionary Political Economy, Springer, vol. 5(1), pages 51-83, June.
    6. Denis Shibitov & Mariam Mamedli, 2021. "Forecasting Russian Cpi With Data Vintages And Machine Learning Techniques," Bank of Russia Working Paper Series wps70, Bank of Russia.
    7. Gupta, Rangan & Pierdzioch, Christian & Vivian, Andrew J. & Wohar, Mark E., 2019. "The predictive value of inequality measures for stock returns: An analysis of long-span UK data using quantile random forests," Finance Research Letters, Elsevier, vol. 29(C), pages 315-322.
    8. Richardson, Adam & van Florenstein Mulder, Thomas & Vehbi, Tuğrul, 2021. "Nowcasting GDP using machine-learning algorithms: A real-time assessment," International Journal of Forecasting, Elsevier, vol. 37(2), pages 941-948.
    9. Zihao Wang & Kun Li & Steve Q. Xia & Hongfu Liu, 2021. "Economic Recession Prediction Using Deep Neural Network," Papers 2107.10980, arXiv.org.
    10. Paulino José Garcia Nieto & Esperanza García Gonzalo & Fernando Sanchez Lasheras & Antonio Bernardo Sánchez, 2020. "A Hybrid Predictive Approach for Chromium Layer Thickness in the Hard Chromium Plating Process Based on the Differential Evolution/Gradient Boosted Regression Tree Methodology," Mathematics, MDPI, vol. 8(6), pages 1-20, June.
    11. Foltas, Alexander, 2023. "Quantifying priorities in business cycle reports: Analysis of recurring textual patterns around peaks and troughs," Working Papers 44, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.
    12. Hwang, Youngjin, 2019. "Forecasting recessions with time-varying models," Journal of Macroeconomics, Elsevier, vol. 62(C).
    13. Marco Taboga, 2022. "Cross-country differences in the size of venture capital financing rounds: a machine learning approach," Empirical Economics, Springer, vol. 62(3), pages 991-1012, March.
    14. Seulki Chung, 2023. "Real-time Prediction of the Great Recession and the Covid-19 Recession," Papers 2310.08536, arXiv.org, revised May 2024.
    15. Proaño, Christian R. & Tarassow, Artur, 2018. "Evaluating the predicting power of ordered probit models for multiple business cycle phases in the U.S. and Japan," Journal of the Japanese and International Economies, Elsevier, vol. 50(C), pages 60-71.
    16. Yahia Mutalib Tofiq & Sarmad Dashti Latif & Ali Najah Ahmed & Pavitra Kumar & Ahmed El-Shafie, 2022. "Optimized Model Inputs Selections for Enhancing River Streamflow Forecasting Accuracy Using Different Artificial Intelligence Techniques," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(15), pages 5999-6016, December.
    17. Bluwstein, Kristina & Buckmann, Marcus & Joseph, Andreas & Kapadia, Sujit & Şimşek, Özgür, 2023. "Credit growth, the yield curve and financial crisis prediction: Evidence from a machine learning approach," Journal of International Economics, Elsevier, vol. 145(C).
    18. Youngju Kim & Hoyeop Lee & Chang Ouk Kim, 2023. "A variational autoencoder for a semiconductor fault detection model robust to process drift due to incomplete maintenance," Journal of Intelligent Manufacturing, Springer, vol. 34(2), pages 529-540, February.
    19. Tim Meyer, 2019. "On the Directional Accuracy of United States Housing Starts Forecasts: Evidence from Survey Data," The Journal of Real Estate Finance and Economics, Springer, vol. 58(3), pages 457-488, April.
    20. Behrens, Christoph, 2019. "Evaluating the Joint Efficiency of German Trade Forecasts. A nonparametric multivariate approach," Working Papers 9, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:intdis:v:18:y:2022:i:3:p:15501477211049910. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.