IDEAS home Printed from https://ideas.repec.org/a/ist/iujepr/v11y2024i2p242-262.html
   My bibliography  Save this article

Predicting Credit Card Fraud using Supervised Machine Learning Methods: Comparative Analysis

Author

Listed:
  • Güner Altan

    (İstanbul-Türkiye)

  • Metin Recep Zafer

    (İstanbul-Türkiye)

Abstract

Currently, with the progress of technology, people’s and institutions’ range of expenditure channels via digital platforms has expanded. In addition, payment methods have become easier with the digital age. An expenditure, made from even a distant corner of the World, takes place instantaneously through the Internet. Although the rapid and global nature of digitisation contains many advantages, ensuring transaction security can be challenging. In this context, banks have undoubtedly become the most crucial institutions that mediate safe transactions between customers and sellers. In an era where credit card transactions are so prevalent, it is seen as a problem that needs to be solved by banks to determine whether these transactions involve fraud or not, both for their profitability and reputation. It takes a serious effort to determine that credit card expenditures, characterised by dynamic nature, are real expenses of the customer. Therefore, the aim of this study is to propose a model based on supervised machine learning with using real and current data with a few key features. The objective is to reduce banks’ operational burden and cost when identifying credit card fraud. In this context, the credit card transactions of a state-owned bank in January 2023 were considered, using a dataset comprising 13,050 observations. Python programming language is used for model building, and classification algorithms with high discriminatory power, such as Random Forest, Logistic Regression, K-Nearest Neighbours, Decision Trees, and Gradient Boosting, are preferred, which are machine learning techniques. The accuracy scores of the algorithms used in the model setup were determined as follows: Logistic Regression, 92.5%; Decision Tree, 93.1%; K-Nearest Neighbour 86.4%; Random Forest 91.8% and Gradient Boosting 86.9% and performance metrics, such as precision, recall, F1 score, and ROC-AUC, were also examined. Based on their performances, five algorithms were recommended for this study.

Suggested Citation

  • Güner Altan & Metin Recep Zafer, 2024. "Predicting Credit Card Fraud using Supervised Machine Learning Methods: Comparative Analysis," Journal of Economic Policy Researches, Istanbul University, Faculty of Economics, vol. 11(2), pages 242-262, July.
  • Handle: RePEc:ist:iujepr:v:11:y:2024:i:2:p:242-262
    DOI: 10.26650/JEPR1433315
    as

    Download full text from publisher

    File URL: https://cdn.istanbul.edu.tr/file/JTA6CLJ8T5/2B11EE1642DA46B7B55F9D37AAE8FBEF
    Download Restriction: no

    File URL: https://iupress.istanbul.edu.tr/en/journal/jepr/article/denetimli-makine-ogrenmesi-yontemleri-ile-kredi-karti-sahteciligini-tahmin-etme-karsilastirmali-analiz
    Download Restriction: no

    File URL: https://libkey.io/10.26650/JEPR1433315?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    Credit card fraud; Machine learning; Supervised learning; Random forest; Gradient boosting JEL Classification : C60 ; C69 ; C81;
    All these keywords.

    JEL classification:

    • C60 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - General
    • C69 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Other
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ist:iujepr:v:11:y:2024:i:2:p:242-262. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ertugrul YASAR (email available below). General contact details of provider: https://edirc.repec.org/data/ifisttr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.