IDEAS home Printed from https://ideas.repec.org/a/eee/ijoais/v11y2010i3p157-181.html
   My bibliography  Save this article

Data mining journal entries for fraud detection: An exploratory study

Author

Listed:
  • Debreceny, Roger S.
  • Gray, Glen L.

Abstract

Fraud detection has become a critical component of financial audits and audit standards have heightened emphasis on journal entries as part of fraud detection. This paper canvasses perspectives on applying data mining techniques to journal entries. In the past, the impediment to researching journal entry data mining is getting access to journal entry data sets, which may explain why the published research in this area is a null set. For this project, we had access to journal entry data sets for 29 different organizations. Our initial exploratory test of the data sets had interesting preliminary findings. (1) For all 29 entities, the distribution of first digits of journal dollar amounts differed from that expected by Benford's Law. (2) Regarding last digits, unlike first digits, which are expected to have a logarithmic distribution, the last digits would be expected to have a uniform distribution. Our test found that the distribution was not uniform for many of the entities. In fact, eight entities had one number whose frequency was three times more than expected. (3) We compared the number of accounts related to the top five most frequently occurring three last digit combinations. Four entities had a very high occurrences of the most frequent three digit combinations that involved only a small set of accounts, one entity had a low occurrences of the most frequent three digit combination that involved a large set of accounts and 24 had a low occurrences of the most frequent three digit combinations that involved a small set of accounts. In general, the first four entities would probably pose the highest risk of fraud because it could indicate that the fraudster is covering up or falsifying a particular class of transactions. In the future, we will apply more data mining techniques to discover other patterns and relationships in the data sets. We also want to seed the dataset with fraud indicators (e.g., pairs of accounts that would not be expected in a journal entry) and compare the sensitivity of the different data mining techniques to find these seeded indicators.

Suggested Citation

  • Debreceny, Roger S. & Gray, Glen L., 2010. "Data mining journal entries for fraud detection: An exploratory study," International Journal of Accounting Information Systems, Elsevier, vol. 11(3), pages 157-181.
  • Handle: RePEc:eee:ijoais:v:11:y:2010:i:3:p:157-181
    DOI: 10.1016/j.accinf.2010.08.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1467089510000540
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.accinf.2010.08.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Andreas Diekmann, 2007. "Not the First Digit! Using Benford's Law to Detect Fraudulent Scientif ic Data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(3), pages 321-329.
    2. Richard Mattessich, 2003. "Accounting research and researchers of the nineteenth century and the beginning of the twentieth century: an international survey of authors, ideas and publications," Accounting History Review, Taylor & Francis Journals, vol. 13(2), pages 125-170.
    3. Ijiri, Yuji & Kelly, Edward C., 1980. "Multidimensional accounting and distributed databases: Their implications for organizations and society," Accounting, Organizations and Society, Elsevier, vol. 5(1), pages 115-123, January.
    4. Hales, Douglas N. & Sridharan, V. & Radhakrishnan, Abirami & Chakravorty, Satya S. & Siha, Samia M., 2008. "Testing the accuracy of employee-reported data: An inexpensive alternative approach to traditional methods," European Journal of Operational Research, Elsevier, vol. 189(3), pages 583-593, September.
    5. Wallace, Wanda, 2000. "Reporting practices: potential lessons from Cendant Corporation," European Management Journal, Elsevier, vol. 18(3), pages 328-333, June.
    6. Anil Arya & John Fellingham & Doug Schroeder, 2000. "Accounting Information, Aggregation, and Discriminant Analysis," Management Science, INFORMS, vol. 46(6), pages 790-806, June.
    7. M.‐Y. Cheng & P. Hall, 1998. "Calibrating the excess mass and dip tests of modality," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(3), pages 579-589.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rabeea SADAF, 2016. "Benford’S Law In The Case Of Hungarian Whole-Sale Trade Sector," SEA - Practical Application of Science, Romanian Foundation for Business Intelligence, Editorial Department, issue 12, pages 561-566, December.
    2. Yoon, Kyunghee & Liu, Yue & Chiu, Tiffany & Vasarhelyi, Miklos A., 2021. "Design and evaluation of an advanced continuous data level auditing system: A three-layer structure," International Journal of Accounting Information Systems, Elsevier, vol. 42(C).
    3. Marco Schreyer & Timur Sattarov & Christian Schulze & Bernd Reimer & Damian Borth, 2019. "Detection of Accounting Anomalies in the Latent Space using Adversarial Autoencoder Neural Networks," Papers 1908.00734, arXiv.org.
    4. Alles, Michael & Gray, Glen L., 2016. "Incorporating big data in audits: Identifying inhibitors and a research agenda to address those inhibitors," International Journal of Accounting Information Systems, Elsevier, vol. 22(C), pages 44-59.
    5. Fábio Albuquerque & Paula Gomes Dos Santos, 2023. "Recent Trends in Accounting and Information System Research: A Literature Review Using Textual Analysis Tools," FinTech, MDPI, vol. 2(2), pages 1-27, April.
    6. Amani, Farzaneh A. & Fadlalla, Adam M., 2017. "Data mining applications in accounting: A review of the literature and organizing framework," International Journal of Accounting Information Systems, Elsevier, vol. 24(C), pages 32-58.
    7. Montag, Josef, 2017. "Identifying odometer fraud in used car market data," Transport Policy, Elsevier, vol. 60(C), pages 10-23.
    8. Kishore Singh & Peter Best, 2020. "Implementing Benford’s Law in Continuous Monitoring Applications," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 19(2), pages 379-404, June.
    9. Pizzi, Simone & Venturelli, Andrea & Variale, Michele & Macario, Giuseppe Pio, 2021. "Assessing the impacts of digital transformation on internal auditing: A bibliometric analysis," Technology in Society, Elsevier, vol. 67(C).
    10. Stratopoulos, Theophanis C. & Vance, Tom W. & Zou, Xiorong, 2013. "Incentive effects of enterprise systems on the magnitude and detectability of reporting manipulations," International Journal of Accounting Information Systems, Elsevier, vol. 14(1), pages 39-57.
    11. Chen, Yuh-Jen & Wu, Chun-Han & Chen, Yuh-Min & Li, Hsin-Ying & Chen, Huei-Kuen, 2017. "Enhancement of fraud detection for narratives in annual reports," International Journal of Accounting Information Systems, Elsevier, vol. 26(C), pages 32-45.
    12. Werner, Michael, 2017. "Financial process mining - Accounting data structure dependent control flow inference," International Journal of Accounting Information Systems, Elsevier, vol. 25(C), pages 57-80.
    13. Fay, Rebecca & Negangard, Eric M., 2017. "Manual journal entry testing: Data analytics and the risk of fraud," Journal of Accounting Education, Elsevier, vol. 38(C), pages 37-49.
    14. de Araújo Silva, Archibald & Aparecida Gouvêa, Maria, 2023. "Study on the effect of sample size on type I error, in the first, second and first-two digits excessmad tests," International Journal of Accounting Information Systems, Elsevier, vol. 48(C).
    15. Montag, Josef, 2015. "Identifying Odometer Fraud: Evidence from the Used Car Market in the Czech Republic," MPRA Paper 65182, University Library of Munich, Germany.
    16. Ricardo Sartori Cella & Ercilio Zanolla, 2018. "Benford’s Law and transparency: an analysis of municipal expenditure," Brazilian Business Review, Fucape Business School, vol. 15(4), pages 331-347, July.
    17. Pall Rikhardsson & Kishore Singh & Peter Best, 2019. "Exploring Continuous Auditing Solutions and Internal Auditing: A Research Note," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 18(4), pages 614-639, December.
    18. Gray, Glen L. & Debreceny, Roger S., 2014. "A taxonomy to guide research on the application of data mining to fraud detection in financial statement audits," International Journal of Accounting Information Systems, Elsevier, vol. 15(4), pages 357-380.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Theoharry Grammatikos & Nikolaos I. Papanikolaou, 2021. "Applying Benford’s Law to Detect Accounting Data Manipulation in the Banking Industry," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 115-142, April.
    2. Lee, Kang-Bok & Han, Sumin & Jeong, Yeasung, 2020. "COVID-19, flattening the curve, and Benford’s law," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 559(C).
    3. Sitsofe Tsagbey & Miguel de Carvalho & Garritt L. Page, 2017. "All Data are Wrong, but Some are Useful? Advocating the Need for Data Auditing," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 231-235, July.
    4. Suren Basov & Svetlana Danilkina & David Prentice, 2020. "When Does Variety Increase with Quality?," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(3), pages 463-487, May.
    5. Kedslie, Moyra J.M. & Leech, Stewart A., 1992. "Professor michael Mepham's contribution to accounting research: A review essay," The British Accounting Review, Elsevier, vol. 24(3), pages 269-279.
    6. Anil Arya & John Fellingham & Doug Schroeder & Jonathan Glover, 2002. "Depreciation in a model of probabilistic investment," European Accounting Review, Taylor & Francis Journals, vol. 11(4), pages 681-697.
    7. Giuseppe Marzo & Yannick Tazzari & Stefano Bonnini, 2020. "Benford’s Law: genesi, letteratura e applicazioni empiriche," Working Papers 2020019, University of Ferrara, Department of Economics.
    8. Jose Ameijeiras-Alonso & Rosa M. Crujeiras & Alberto Rodríguez-Casal, 2019. "Mode testing, critical bandwidth and excess mass," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 900-919, September.
    9. Diekmann Andreas & Jann Ben, 2010. "Benford’s Law and Fraud Detection: Facts and Legends," German Economic Review, De Gruyter, vol. 11(3), pages 397-401, August.
    10. Chase Thiel & Zhanna Bagdasarov & Lauren Harkrider & James Johnson & Michael Mumford, 2012. "Leader Ethical Decision-Making in Organizations: Strategies for Sensemaking," Journal of Business Ethics, Springer, vol. 107(1), pages 49-64, April.
    11. Feng Zhu, 2005. "A nonparametric analysis of the shape dynamics of the US personal income distribution: 1962-2000," BIS Working Papers 184, Bank for International Settlements.
    12. Montag, Josef, 2017. "Identifying odometer fraud in used car market data," Transport Policy, Elsevier, vol. 60(C), pages 10-23.
    13. Adriano Silva & Sergio Floquet & Ricardo Lima, 2023. "Newcomb–Benford’s Law in Neuromuscular Transmission: Validation in Hyperkalemic Conditions," Stats, MDPI, vol. 6(4), pages 1-19, October.
    14. Koch, Christoffer & Okamura, Ken, 2020. "Benford’s Law and COVID-19 reporting," Economics Letters, Elsevier, vol. 196(C).
    15. Wang, Xiaogang & Qiu, Weiliang & Zamar, Ruben H., 2007. "CLUES: A non-parametric clustering method based on local shrinking," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 286-298, September.
    16. Montag, Josef, 2015. "Identifying Odometer Fraud: Evidence from the Used Car Market in the Czech Republic," MPRA Paper 65182, University Library of Munich, Germany.
    17. Jalan, Akanksha & Matkovskyy, Roman & Yarovaya, Larisa, 2021. "“Shiny” crypto assets: A systemic look at gold-backed cryptocurrencies during the COVID-19 pandemic," International Review of Financial Analysis, Elsevier, vol. 78(C).
    18. Li Lin, 2024. "Quantum Probability Theoretic Asset Return Modeling: A Novel Schr\"odinger-Like Trading Equation and Multimodal Distribution," Papers 2401.05823, arXiv.org.
    19. Juan Miguel Campanario & María Angeles Coslado, 2011. "Benford’s law and citations, articles and impact factors of scientific journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 421-432, August.
    20. Schräpler Jörg-Peter, 2011. "Benford’s Law as an Instrument for Fraud Detection in Surveys Using the Data of the Socio-Economic Panel (SOEP)," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(5-6), pages 685-718, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ijoais:v:11:y:2010:i:3:p:157-181. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/international-journal-of-accounting-information-systems/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.