IDEAS home Printed from https://ideas.repec.org/a/sae/intdis/v18y2022i3p15501477211049910.html
   My bibliography  Save this article

A novel and highly efficient botnet detection algorithm based on network traffic analysis of smart systems

Author

Listed:
  • Li Duan
  • Jingxian Zhou
  • You Wu
  • Wenyao Xu

Abstract

In smart systems, attackers can use botnets to launch different cyber attack activities against the Internet of Things. The traditional methods of detecting botnets commonly used machine learning algorithms, and it is difficult to detect and control botnets in a network because of unbalanced traffic data. In this article, we present a novel and highly efficient botnet detection method based on an autoencoder neural network in cooperation with decision trees on a given network. The deep flow inspection method and statistical analysis are first applied as a feature selection technique to select relevant features, which are used to characterize the communication-related behavior between network nodes. Then, the autoencoder neural network for feature selection is used to improve the efficiency of model construction. Finally, Tomek-Recursion Borderline Synthetic Minority Oversampling Technique generates additional minority samples to achieve class balance, and an improved gradient boosting decision tree algorithm is used to train and establish an abnormal traffic detection model to improve the detection of unbalanced botnet data. The results of experiments on the ISCX-botnet traffic dataset show that the proposed method achieved better botnet detection performance with 99.10% recall, 99.20% accuracy, 99.1% F1 score, and 99.0% area under the curve.

Suggested Citation

  • Li Duan & Jingxian Zhou & You Wu & Wenyao Xu, 2022. "A novel and highly efficient botnet detection algorithm based on network traffic analysis of smart systems," International Journal of Distributed Sensor Networks, , vol. 18(3), pages 15501477211, March.
  • Handle: RePEc:sae:intdis:v:18:y:2022:i:3:p:15501477211049910
    DOI: 10.1177/15501477211049910
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/15501477211049910
    Download Restriction: no

    File URL: https://libkey.io/10.1177/15501477211049910?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wo Jae Lee & Gamini P. Mendis & Matthew J. Triebe & John W. Sutherland, 2020. "Monitoring of a machining process using kernel principal component analysis and kernel density estimation," Journal of Intelligent Manufacturing, Springer, vol. 31(5), pages 1175-1189, June.
    2. Döpke, Jörg & Fritsche, Ulrich & Pierdzioch, Christian, 2017. "Predicting recessions with boosted regression trees," International Journal of Forecasting, Elsevier, vol. 33(4), pages 745-759.
    3. Fariba Haddadi & A. Nur Zincir‐Heywood, 2017. "Botnet behaviour analysis: How would a data analytics‐based system with minimum a priori information perform?," International Journal of Network Management, John Wiley & Sons, vol. 27(4), July.
    4. Ruchi Vishwakarma & Ankit Kumar Jain, 2020. "A survey of DDoS attacking techniques and defence mechanisms in the IoT network," Telecommunication Systems: Modelling, Analysis, Design and Management, Springer, vol. 73(1), pages 3-25, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carstensen, Kai & Heinrich, Markus & Reif, Magnus & Wolters, Maik H., 2020. "Predicting ordinary and severe recessions with a three-state Markov-switching dynamic factor model," International Journal of Forecasting, Elsevier, vol. 36(3), pages 829-850.
    2. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    3. Magnus Reif, 2020. "Macroeconomics, Nonlinearities, and the Business Cycle," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 87.
    4. Sachin Kumar & T. Gopi & N. Harikeerthana & Munish Kumar Gupta & Vidit Gaur & Grzegorz M. Krolczyk & ChuanSong Wu, 2023. "Machine learning techniques in additive manufacturing: a state of the art review on design, processes and production control," Journal of Intelligent Manufacturing, Springer, vol. 34(1), pages 21-55, January.
    5. Radwa Ahmed Osman & Sherine Nagy Saleh & Yasmine N. M. Saleh & Mazen Nabil Elagamy, 2021. "A Reliable and Efficient Tracking System Based on Deep Learning for Monitoring the Spread of COVID-19 in Closed Areas," IJERPH, MDPI, vol. 18(24), pages 1-20, December.
    6. Behrens, Christoph, 2020. "German trade forecasts since 1970: An evaluation using the panel dimension," Working Papers 26, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.
    7. Nazemi, Abdolreza & Rezazadeh, Hani & Fabozzi, Frank J. & Höchstötter, Markus, 2022. "Deep learning for modeling the collection rate for third-party buyers," International Journal of Forecasting, Elsevier, vol. 38(1), pages 240-252.
    8. Chris Reimann, 2024. "Predicting financial crises: an evaluation of machine learning algorithms and model explainability for early warning systems," Review of Evolutionary Political Economy, Springer, vol. 5(1), pages 51-83, June.
    9. Vrontos, Spyridon D. & Galakis, John & Vrontos, Ioannis D., 2021. "Modeling and predicting U.S. recessions using machine learning techniques," International Journal of Forecasting, Elsevier, vol. 37(2), pages 647-671.
    10. Shahram Fattahi & Kiomars Sohaili & Hamed Monkaresi & Fatemeh Mehrabi, 2017. "Modelling and Forecasting Recessions in Oil-exporting Countries: The Case of Iran," International Journal of Economics and Financial Issues, Econjournals, vol. 7(3), pages 569-574.
    11. Denis Shibitov & Mariam Mamedli, 2021. "Forecasting Russian Cpi With Data Vintages And Machine Learning Techniques," Bank of Russia Working Paper Series wps70, Bank of Russia.
    12. Gupta, Rangan & Pierdzioch, Christian & Vivian, Andrew J. & Wohar, Mark E., 2019. "The predictive value of inequality measures for stock returns: An analysis of long-span UK data using quantile random forests," Finance Research Letters, Elsevier, vol. 29(C), pages 315-322.
    13. Marcus Buckmann & Andy Haldane & Anne-Caroline Hüser, 2021. "Comparing minds and machines: implications for financial stability," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 37(3), pages 479-508.
    14. Emrich Eike & Pierdzioch Christian, 2016. "Public Goods, Private Consumption, and Human Capital: Using Boosted Regression Trees to Model Volunteer Labour Supply," Review of Economics, De Gruyter, vol. 67(3), pages 263-283, December.
    15. Xuejun Zhao & Yong Qin & Changbo He & Limin Jia, 2022. "Underdetermined blind source extraction of early vehicle bearing faults based on EMD and kernelized correlation maximization," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 185-201, January.
    16. Lauri Nevasalmi, 2022. "Recession forecasting with high‐dimensional data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 752-764, July.
    17. Rademacher, Philip, 2024. "Forecasting recessions in Germany with machine learning," DICE Discussion Papers 416, Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE).
    18. Proaño, Christian R. & Tarassow, Artur, 2018. "Evaluating the predicting power of ordered probit models for multiple business cycle phases in the U.S. and Japan," Journal of the Japanese and International Economies, Elsevier, vol. 50(C), pages 60-71.
    19. Lahiri, Kajal & Yang, Cheng, 2022. "Boosting tax revenues with mixed-frequency data in the aftermath of COVID-19: The case of New York," International Journal of Forecasting, Elsevier, vol. 38(2), pages 545-566.
    20. Jaehyuk Choi & Desheng Ge & Kyu Ho Kang & Sungbin Sohn, 2023. "Yield spread selection in predicting recession probabilities," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1772-1785, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:intdis:v:18:y:2022:i:3:p:15501477211049910. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.