IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v14y2022i10p289-d937541.html
   My bibliography  Save this article

A Self-Supervised Learning Model for Unknown Internet Traffic Identification Based on Surge Period

Author

Listed:
  • Dawei Wei

    (School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China)

  • Feifei Shi

    (School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China)

  • Sahraoui Dhelim

    (School of Computer Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland)

Abstract

The identification of Internet protocols provides a significant basis for keeping Internet security and improving Internet Quality of Service (QoS). However, the overwhelming developments and updating of Internet technologies and protocols have led to large volumes of unknown Internet traffic, which threaten the safety of the network environment a lot. Since most of the unknown Internet traffic does not have any labels, it is difficult to adopt deep learning directly. Additionally, the feature accuracy and identification model also impact the identification accuracy a lot. In this paper, we propose a surge period-based feature extraction method that helps remove the negative influence of background traffic in network sessions and acquire as many traffic flow features as possible. In addition, we also establish an identification model of unknown Internet traffic based on JigClu, the self-supervised learning approach to training unlabeled datasets. It finally combines with the clustering method and realizes the further identification of unknown Internet traffic. The model has been demonstrated with an accuracy of no less than 74% in identifying unknown Internet traffic with the public dataset ISCXVPN2016 under different scenarios. The work provides a novel solution for unknown Internet traffic identification, which is the most difficult task in identifying Internet traffic. We believe it is a great leap in Internet traffic identification and is of great significance to maintaining the security of the network environment.

Suggested Citation

  • Dawei Wei & Feifei Shi & Sahraoui Dhelim, 2022. "A Self-Supervised Learning Model for Unknown Internet Traffic Identification Based on Surge Period," Future Internet, MDPI, vol. 14(10), pages 1-16, October.
  • Handle: RePEc:gam:jftint:v:14:y:2022:i:10:p:289-:d:937541
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/14/10/289/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/14/10/289/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sreejith Alathur & Naganna Chetty & Rajesh R. Pai & Vishal Kumar & Sahraoui Dhelim, 2022. "Hate and False Metaphors: Implications to Emerging E-Participation Environment," Future Internet, MDPI, vol. 14(11), pages 1-10, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:14:y:2022:i:10:p:289-:d:937541. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.