IDEAS home Printed from https://ideas.repec.org/a/ora/journl/v2y2023i2p67-75.html
   My bibliography  Save this article

Decision Tree Or Logistic Regression - Which Basic Model Is Better?

Author

Listed:
  • Kitti Fodor

    (Department of Business Statistics and Economic Forecasting, Faculty of Economics, University of Miskolc, Miskolc, Hungary)

Abstract

In this paper, my aim is to show which of the data in the Central Credit Information System are the ones that influence the factors that are then used to perform the analysis using a decision tree and logistic regression, and I would like to know, which of the two basic model is the better one. For the analyses, I used a random sample of 500 items, reflecting the proportions of performing and nonperforming loans in the population. For both methods, one variable was found to be significant, which was the ratio of the repayment to the contract amount, so this is the most significant of the data recorded by the Central Credit Information System in terms of loan defaults. If I compare the two methods, I can conclude that both methods have a high level of accuracy, but logistic regression is the one that produced better results, as it was able to identify a higher proportion of defaulted loans. Unfortunately, the decision tree could not identify any defaulting loans despite its higher classification accuracy. The reason can be the unfavourable sample composition. Finally, the logistic regression was able to categorize the transactions with 81,1% accuracy and has better AUC value and better value for Gini coefficients.

Suggested Citation

  • Kitti Fodor, 2023. "Decision Tree Or Logistic Regression - Which Basic Model Is Better?," Annals of Faculty of Economics, University of Oradea, Faculty of Economics, vol. 32(2), pages 67-75, December.
  • Handle: RePEc:ora:journl:v:2:y:2023:i:2:p:67-75
    as

    Download full text from publisher

    File URL: https://anale.steconomiceuoradea.ro/en/wp-content/uploads/2024/03/Volume-2_AUOES_december-2023-70-78.pdf
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    loan default; decision tree; logistic regression; random sample; classification; ROC curve;
    All these keywords.

    JEL classification:

    • B16 - Schools of Economic Thought and Methodology - - History of Economic Thought through 1925 - - - Quantitative and Mathematical
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ora:journl:v:2:y:2023:i:2:p:67-75. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catalin ZMOLE (email available below). General contact details of provider: https://edirc.repec.org/data/feoraro.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.