IDEAS home Printed from https://ideas.repec.org/a/pkp/rocere/v9y2022i1p13-29id2983.html
   My bibliography  Save this article

Machine Learning and Deep Learning Based Phishing Websites Detection: The Current Gaps and Next Directions

Author

Listed:
  • Kibreab Adane
  • Berhanu Beyene

Abstract

There are many phishing websites detection techniques in literature, namely white-listing, black-listing, visual-similarity, heuristic-based, and others. However, detecting zero-hour or newly designed phishing website attacks is an inherent property of machine learning and deep learning techniques. By considering a promising solution of machine learning and deep learning techniques, researchers have made a great deal of effort to tackle the this problem, which persists due to attackers constantly devising novel strategies to exploit vulnerability or gaps in existing anti-phishing measures. In this study, an extensive effort has been made to rigorously review recent studies focusing on Machine Learning and Deep Learning Based Phishing Websites Detection to excavate the root cause of the aforementioned problems and offer suitable solutions. The study followed the significant criterion to search, download, and screen relevant studies, then to evaluate criterion-based selected studies. The findings show that significant research gaps are available in the rigorously reviewed studies. These gaps are mainly related to imbalanced dataset usage, improper selection of dataset source(s), the unjustified reason for using specific train-test dataset split ratio, scientific disputes on website features inclusion and exclusion, lack of universal consensus on phishing website lifespans and on what is defining a small dataset size, and run-time analysis issues. The study clearly presented a summary of the comparative analysis performed on each reviewed research work so that future researchers could use it as a structured guideline to develop a novel solution for anti-phishing website attacks.

Suggested Citation

  • Kibreab Adane & Berhanu Beyene, 2022. "Machine Learning and Deep Learning Based Phishing Websites Detection: The Current Gaps and Next Directions," Review of Computer Engineering Research, Conscientia Beam, vol. 9(1), pages 13-29.
  • Handle: RePEc:pkp:rocere:v:9:y:2022:i:1:p:13-29:id:2983
    as

    Download full text from publisher

    File URL: https://archive.conscientiabeam.com/index.php/76/article/view/2983/6434
    Download Restriction: no

    File URL: https://archive.conscientiabeam.com/index.php/76/article/view/2983/6594
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pkp:rocere:v:9:y:2022:i:1:p:13-29:id:2983. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dim Michael (email available below). General contact details of provider: https://archive.conscientiabeam.com/index.php/76/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.