IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2011.09137.html
   My bibliography  Save this paper

Principal Component Analysis and Factor Analysis for Feature Selection in Credit Rating

Author

Listed:
  • Shenghuan Yang
  • lonut Florescu
  • Md Tariqul Islam

Abstract

The credit rating is an evaluation of a company's credit risk that values the ability to pay back the debt and predict the likelihood of the debtor defaulting. There are various features influencing credit rating. Therefore, it is essential to select substantive features to explore the main reason for credit rating change. To address this issue, this paper exploited Principal Component Analysis and Factor Analysis as feature selection algorithms to select important features, summarized the similar features together, and obtained a minimum set of features for four sectors, Financial Sector, Energy Sector, Health Care Sector, Consumer Discretionary Sector. This paper used two data sets, Financial Ratio and Balance Sheet, with two mappings, Detailed Mapping, and Coarse Mapping, converting the target variable(credit rating) into categorical variable. To test the accuracy of credit rating prediction, Random Forest Classifier was used to test and train feature sets. The results showed that the accuracy of Financial Ratio feature sets was higher than that of Balance Sheet feature sets. In addition, Factor Analysis can reduce the number of features significantly to obtain almost the same accuracy that can decrease dramatically the time spent on analyzing data; we also summarized seven dominant factors and ten dominant factors affecting credit rating change in Financial Ratio and Balance Sheet by utilizing Factor Analysis, respectively, which can explain the reason of credit rating change better.

Suggested Citation

  • Shenghuan Yang & lonut Florescu & Md Tariqul Islam, 2020. "Principal Component Analysis and Factor Analysis for Feature Selection in Credit Rating," Papers 2011.09137, arXiv.org, revised Dec 2020.
  • Handle: RePEc:arx:papers:2011.09137
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2011.09137
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Parisa Golbayani & Ionuc{t} Florescu & Rupak Chatterjee, 2020. "A comparative study of forecasting Corporate Credit Ratings using Neural Networks, Support Vector Machines, and Decision Trees," Papers 2007.06617, arXiv.org.
    2. Parisa Golbayani & Dan Wang & Ionut Florescu, 2020. "Application of Deep Neural Networks to assess corporate Credit Rating," Papers 2003.02334, arXiv.org.
    3. Dan Wang & Tianrui Wang & Ionuc{t} Florescu, 2020. "Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance," Papers 2010.08698, arXiv.org.
    4. Golbayani, Parisa & Florescu, Ionuţ & Chatterjee, Rupak, 2020. "A comparative study of forecasting corporate credit ratings using neural networks, support vector machines, and decision trees," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dan Wang & Zhi Chen & Ionut Florescu, 2021. "A Sparsity Algorithm with Applications to Corporate Credit Rating," Papers 2107.10306, arXiv.org.
    2. Wang, Dan & Chen, Zhi & Florescu, Ionuţ & Wen, Bingyang, 2023. "A sparsity algorithm for finding optimal counterfactual explanations: Application to corporate credit rating," Research in International Business and Finance, Elsevier, vol. 64(C).
    3. Bojing Feng & Wenfang Xue & Bindang Xue & Zeyu Liu, 2020. "Every Corporation Owns Its Image: Corporate Credit Ratings via Convolutional Neural Networks," Papers 2012.03744, arXiv.org.
    4. Davidescu Adriana AnaMaria & Agafiței Marina-Diana & Strat Vasile Alecsandru & Dima Alina Mihaela, 2024. "Mapping the Landscape: A Bibliometric Analysis of Rating Agencies in the Era of Artificial Intelligence and Machine Learning," Proceedings of the International Conference on Business Excellence, Sciendo, vol. 18(1), pages 67-85.
    5. Barboza, Flavio & Altman, Edward, 2024. "Predicting financial distress in Latin American companies: A comparative analysis of logistic regression and random forest models," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).
    6. Kim, Jong-Min & Kim, Dong H. & Jung, Hojin, 2021. "Applications of machine learning for corporate bond yield spread forecasting," The North American Journal of Economics and Finance, Elsevier, vol. 58(C).
    7. Goldmann, Leonie & Crook, Jonathan & Calabrese, Raffaella, 2024. "A new ordinal mixed-data sampling model with an application to corporate credit rating levels," European Journal of Operational Research, Elsevier, vol. 314(3), pages 1111-1126.
    8. Koresh Galil & Ami Hauptman & Rosit Levy Rosenboim, 2023. "Prediction of Corporate Credit Ratings with Machine Learning: Simple Interpretative Models," Working Papers 2308, Ben-Gurion University of the Negev, Department of Economics.
    9. Seyyide Doğan & Yasin Büyükkör & Murat Atan, 2022. "A comparative study of corporate credit ratings prediction with machine learning," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 32(1), pages 25-47.
    10. Galil, Koresh & Hauptman, Ami & Rosenboim, Rosit Levy, 2023. "Prediction of corporate credit ratings with machine learning: Simple interpretative models," Finance Research Letters, Elsevier, vol. 58(PD).
    11. María Jesús Segovia‐Vargas & I. Marta Miranda‐García & Freddy Alejandro Oquendo‐Torres, 2023. "Sustainable finance: The role of savings and credit cooperatives in Ecuador," Annals of Public and Cooperative Economics, Wiley Blackwell, vol. 94(3), pages 951-980, September.
    12. Yu, Baojun & Li, Changming & Mirza, Nawazish & Umar, Muhammad, 2022. "Forecasting credit ratings of decarbonized firms: Comparative assessment of machine learning models," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    13. Dan Wang & Tianrui Wang & Ionuc{t} Florescu, 2020. "Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance," Papers 2010.08698, arXiv.org.
    14. Kai Ren, 2023. "Study on Intelligent Forecasting of Credit Bond Default Risk," Papers 2305.12142, arXiv.org, revised Jun 2023.
    15. Mahsa Tavakoli & Rohitash Chandra & Fengrui Tian & Cristi'an Bravo, 2023. "Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams," Papers 2304.10740, arXiv.org, revised Sep 2023.
    16. Helmut Wasserbacher & Martin Spindler, 2024. "Credit Ratings: Heterogeneous Effect on Capital Structure," Papers 2406.18936, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2011.09137. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.