IDEAS home Printed from https://ideas.repec.org/a/vrs/stintr/v17y2016i2p265-280n13.html
   My bibliography  Save this article

Heteroscedastic Discriminant Analysis Combined with Feature Selection for Credit Scoring

Author

Listed:
  • Stąpor Katarzyna

    (Institute of Computer Science, Silesian University of Technology, Gliwice, Poland)

  • Smolarczyk Tomasz

    (Institute of Computer Science, Silesian University of Technology, Gliwice, Poland)

  • Fabian Piotr

    (Institute of Computer Science, Silesian University of Technology, Gliwice, Poland)

Abstract

Credit granting is a fundamental question and one of the most complex tasks that every credit institution is faced with. Typically, credit scoring databases are often large and characterized by redundant and irrelevant features. An effective classification model will objectively help managers instead of intuitive experience. This study proposes an approach for building a credit scoring model based on the combination of heteroscedastic extension (Loog, Duin, 2002) of classical Fisher Linear Discriminant Analysis (Fisher, 1936, Krzyśko, 1990) and a feature selection algorithm that retains sufficient information for classification purpose. We have tested five feature subset selection algorithms: two filters and three wrappers. To evaluate the accuracy of the proposed credit scoring model and to compare it with the existing approaches we have used the German credit data set from the study (Chen, Li, 2010). The results of our study suggest that the proposed hybrid approach is an effective and promising method for building credit scoring models.

Suggested Citation

  • Stąpor Katarzyna & Smolarczyk Tomasz & Fabian Piotr, 2016. "Heteroscedastic Discriminant Analysis Combined with Feature Selection for Credit Scoring," Statistics in Transition New Series, Statistics Poland, vol. 17(2), pages 265-280, June.
  • Handle: RePEc:vrs:stintr:v:17:y:2016:i:2:p:265-280:n:13
    DOI: 10.21307/stattrans-2016-018
    as

    Download full text from publisher

    File URL: https://doi.org/10.21307/stattrans-2016-018
    Download Restriction: no

    File URL: https://libkey.io/10.21307/stattrans-2016-018?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    2. Pacheco, Joaquin & Casado, Silvia & Nunez, Laura & Gomez, Olga, 2006. "Analysis of new variable selection methods for discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 1463-1478, December.
    3. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    4. L C Thomas & R W Oliver & D J Hand, 2005. "A survey of the issues in consumer credit modelling research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1006-1015, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Katarzyna Stąpor & Tomasz Smolarczyk & Piotr Fabian, 2016. "Heteroscedastic Discriminant Analysis Combined With Feature Selection For Credit Scoring," Statistics in Transition New Series, Polish Statistical Association, vol. 17(2), pages 265-280, June.
    2. Tomasz Smolarczyk & Katarzyna Stąpor & Piotr Fabian, 2016. "Heteroscedastic Discriminant Analysis Combined With Feature Selection For Credit Scoring," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 17(2), pages 265-280, June.
    3. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    4. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    5. Naveed Chehrazi & Peter W. Glynn & Thomas A. Weber, 2019. "Dynamic Credit-Collections Optimization," Management Science, INFORMS, vol. 67(6), pages 2737-2769, June.
    6. Thomas Wainwright, 2011. "Elite Knowledges: Framing Risk and the Geographies of Credit," Environment and Planning A, , vol. 43(3), pages 650-665, March.
    7. K Rajaratnam & P Beling & G Overstreet, 2010. "Scoring decisions in the context of economic uncertainty," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 421-429, March.
    8. Huseyin Ince & Bora Aktan, 2009. "A comparison of data mining techniques for credit scoring in banking: A managerial perspective," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 10(3), pages 233-240, March.
    9. Ting Sun & Miklos A. Vasarhelyi, 2018. "Predicting credit card delinquencies: An application of deep neural networks," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 25(4), pages 174-189, October.
    10. repec:syb:wpbsba:03/2013 is not listed on IDEAS
    11. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    12. Pulina, Manuela & Paba, Antonello, 2010. "A discrete choice approach to model credit card fraud," MPRA Paper 20019, University Library of Munich, Germany.
    13. Lobna Abid & Afif Masmoudi & Sonia Zouari-Ghorbel, 2018. "The Consumer Loan’s Payment Default Predictive Model: an Application of the Logistic Regression and the Discriminant Analysis in a Tunisian Commercial Bank," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 9(3), pages 948-962, September.
    14. Linhui Wang & Jianping Zhu & Chenlu Zheng & Zhiyuan Zhang, 2024. "Incorporating Digital Footprints into Credit-Scoring Models through Model Averaging," Mathematics, MDPI, vol. 12(18), pages 1-15, September.
    15. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    16. Zhixin Liu & Ping He & Bo Chen, 2019. "A Markov decision model for consumer term-loan collections," Review of Quantitative Finance and Accounting, Springer, vol. 52(4), pages 1043-1064, May.
    17. Lu Gao & Kanshukan Rajaratnam & Peter Beling, 2016. "Loan origination decisions using a multinomial scorecard," Annals of Operations Research, Springer, vol. 243(1), pages 199-210, August.
    18. Finlay, Steven, 2010. "Credit scoring for profitability objectives," European Journal of Operational Research, Elsevier, vol. 202(2), pages 528-537, April.
    19. Akkoç, Soner, 2012. "An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish cred," European Journal of Operational Research, Elsevier, vol. 222(1), pages 168-178.
    20. Naveed Chehrazi & Thomas A. Weber, 2015. "Dynamic Valuation of Delinquent Credit-Card Accounts," Management Science, INFORMS, vol. 61(12), pages 3077-3096, December.
    21. Tomáš Vaněk & David Hampel, 2017. "The Probability of Default Under IFRS 9: Multi-period Estimation and Macroeconomic Forecast," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 65(2), pages 759-776.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:stintr:v:17:y:2016:i:2:p:265-280:n:13. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://stat.gov.pl/en/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.