IDEAS home Printed from https://ideas.repec.org/a/eme/jfcpps/jfc-09-2022-0227.html
   My bibliography  Save this article

Two decades of financial statement fraud detection literature review; combination of bibliometric analysis and topic modeling approach

Author

Listed:
  • Milad Soltani
  • Alexios Kythreotis
  • Arash Roshanpoor

Abstract

Purpose - The emergence of machine learning has opened a new way for researchers. It allows them to supplement the traditional manual methods for conducting a literature review and turning it into smart literature. This study aims to present a framework for incorporating machine learning into financial statement fraud (FSF) literature analysis. This framework facilitates the analysis of a large amount of literature to show the trend of the field and identify the most productive authors, journals and potential areas for future research. Design/methodology/approach - In this study, a framework was introduced that merges bibliometric analysis techniques such as word frequency, co-word analysis and coauthorship analysis with the Latent Dirichlet Allocation topic modeling approach. This framework was used to uncover subtopics from 20 years of financial fraud research articles. Furthermore, the hierarchical clustering method was used on selected subtopics to demonstrate the primary contexts in the literature on FSF. Findings - This study has contributed to the literature in two ways. First, this study has determined the top journals, articles, countries and keywords based on various bibliometric metrics. Second, using topic modeling and then hierarchy clustering, this study demonstrates the four primary contexts in FSF detection. Research limitations/implications - In this study, the authors tried to comprehensively view the studies related to financial fraud conducted over two decades. However, this research has limitations that can be an opportunity for future researchers. The first limitation is due to language bias. This study has focused on English language articles, so it is suggested that other researchers consider other languages as well. The second limitation is caused by citation bias. In this study, the authors tried to show the top articles based on the citation criteria. However, judging based on citation alone can be misleading. Therefore, this study suggests that the researchers consider other measures to check the citation quality and assess the studies’ precision by applying meta-analysis. Originality/value - Despite the popularity of bibliometric analysis and topic modeling, there have been limited efforts to use machine learning for literature review. This novel approach of using hierarchical clustering on topic modeling results enable us to uncover four primary contexts. Furthermore, this method allowed us to show the keywords of each context and highlight significant articles within each context.

Suggested Citation

  • Milad Soltani & Alexios Kythreotis & Arash Roshanpoor, 2023. "Two decades of financial statement fraud detection literature review; combination of bibliometric analysis and topic modeling approach," Journal of Financial Crime, Emerald Group Publishing Limited, vol. 30(5), pages 1367-1388, April.
  • Handle: RePEc:eme:jfcpps:jfc-09-2022-0227
    DOI: 10.1108/JFC-09-2022-0227
    as

    Download full text from publisher

    File URL: https://www.emerald.com/insight/content/doi/10.1108/JFC-09-2022-0227/full/html?utm_source=repec&utm_medium=feed&utm_campaign=repec
    Download Restriction: no

    File URL: https://www.emerald.com/insight/content/doi/10.1108/JFC-09-2022-0227/full/pdf?utm_source=repec&utm_medium=feed&utm_campaign=repec
    Download Restriction: no

    File URL: https://libkey.io/10.1108/JFC-09-2022-0227?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eme:jfcpps:jfc-09-2022-0227. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emerald Support (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.