IDEAS home Printed from https://ideas.repec.org/p/tcb/wpaper/2413.html
   My bibliography  Save this paper

The Good, the Better and the Challenging:Insights into Predicting High-Growth Firms using Machine Learning

Author

Listed:
  • Sermet Pekin
  • Aykut Sengul

Abstract

This study aims to classify high-growth firms using several machine learning algorithms, including K-Nearest Neighbors, Logistic Regression with L1 (Lasso) and L2 (Ridge) Regularization, XGBoost, Gradient Descent, Naive Bayes and Random Forest. Leveraging a dataset composed of financial metrics and firm characteristics between 2009 and 2022 with 1,318,799 unique firms (averaging 554,178 annually), we evaluate the performance of each model using metrics such as MCC, ROC AUC, accuracy, precision, recall and F1-score. In our study, ROC AUC values ranged from 0.53 to 0.87 for employee-high growth and from 0.53 to 0.91 for turnover-high growth, depending on the method used. Our findings indicate that XGBoost achieves the highest performance, followed by Random Forest and Logistic Regression, demonstrating their effectiveness in distinguishing between high-growth and non-high-growth firms. Conversely, KNN and Naive Bayes yield lower accuracy. Furthermore, our findings reveal that growth opportunity emerges as the most significant factor in our study. This research contributes valuable insights in identifying high-growth firms and underscores the potential of machine learning in economic prediction.

Suggested Citation

  • Sermet Pekin & Aykut Sengul, 2024. "The Good, the Better and the Challenging:Insights into Predicting High-Growth Firms using Machine Learning," Working Papers 2413, Research and Monetary Policy Department, Central Bank of the Republic of Turkey.
  • Handle: RePEc:tcb:wpaper:2413
    as

    Download full text from publisher

    File URL: https://www.tcmb.gov.tr/wps/wcm/connect/f6ba5939-e28a-4a4e-bb57-26edd66d1e0d/wp_2413.pdf?MOD=AJPERES&CACHEID=ROOTWORKSPACE-f6ba5939-e28a-4a4e-bb57-26edd66d1e0d-pg3jixk
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ramaswamy, Venkat & Ozcan, Kerimcan, 2016. "Brand value co-creation in a digitalized world: An integrative framework and research implications," International Journal of Research in Marketing, Elsevier, vol. 33(1), pages 93-106.
    2. Alex Coad & Stjepan Srhoj, 2020. "Catching Gazelles with a Lasso: Big data techniques for the prediction of high-growth firms," Small Business Economics, Springer, vol. 55(3), pages 541-565, October.
    3. Giulio Bottazzi & Angelo Secchi, 2006. "Explaining the distribution of firm growth rates," RAND Journal of Economics, RAND Corporation, vol. 37(2), pages 235-256, June.
    4. Miroslav Mateev & Yanko Anastasov, 2010. "Determinants of small and medium sized fast growing enterprises in central and eastern Europe: a panel data analysis," Financial Theory and Practice, Institute of Public Finance, vol. 34(3), pages 269-295.
    5. Magnus Henrekson & Dan Johansson, 2010. "Gazelles as job creators: a survey and interpretation of the evidence," Small Business Economics, Springer, vol. 35(2), pages 227-244, September.
    6. D.B. Audretsch & L. Klomp & E. Santarelli & A.R. Thurik, 2004. "Gibrat's Law: Are the Services Different?," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 24(3), pages 301-324, May.
    7. Alex Coad & Christina Guenther, 2014. "Processes of firm growth and diversification: theory and evidence," Small Business Economics, Springer, vol. 43(4), pages 857-871, December.
    8. Alex Coad & Sven-Olov Daunfeldt & Werner Hölzl & Dan Johansson & Paul Nightingale, 2014. "High-growth firms: introduction to the special section," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 23(1), pages 91-112, February.
    9. Jo Danbolt & Ian Hirst & Eddie Jones, 2011. "The growth companies puzzle: can growth opportunities measures predict firm growth?," The European Journal of Finance, Taylor & Francis Journals, vol. 17(1), pages 1-25.
    10. Joachim Wagner, 2003. "Unobserved firm heterogeneity and the size-exports nexus: Evidence from German panel data," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 139(1), pages 161-172, March.
    11. Giulio Bottazzi & Angelo Secchi, 2006. "Explaining the Distribution of Firm Growth Rates," RAND Journal of Economics, The RAND Corporation, vol. 37(2), pages 235-256, Summer.
    12. Amith Vikram Megaravalli & Gabriele Sampagnaro, 2018. "Firm age and liquidity ratio as predictors of firm growth: evidence from Indian firms," Applied Economics Letters, Taylor & Francis Journals, vol. 25(19), pages 1373-1375, November.
    13. Sabri Boughorbel & Fethi Jarray & Mohammed El-Anbari, 2017. "Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric," PLOS ONE, Public Library of Science, vol. 12(6), pages 1-17, June.
    14. David B. Audretsch & Dirk Dohse, 2007. "Location: A Neglected Determinant of Firm Growth," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 143(1), pages 79-107, April.
    15. Noel Capon & John U. Farley & Scott Hoenig, 1990. "Determinants of Financial Performance: A Meta-Analysis," Management Science, INFORMS, vol. 36(10), pages 1143-1159, October.
    16. Delmar, Frederic & Davidsson, Per & Gartner, William B., 2003. "Arriving at the high-growth firm," Journal of Business Venturing, Elsevier, vol. 18(2), pages 189-216, March.
    17. Thomas H. Brush & Philip Bromiley & Margaretha Hendrickx, 2000. "The free cash flow hypothesis for sales growth and firm performance," Strategic Management Journal, Wiley Blackwell, vol. 21(4), pages 455-472, April.
    18. Penrose, Edith, 2009. "The Theory of the Growth of the Firm," OUP Catalogue, Oxford University Press, edition 4, number 9780199573844.
    19. Omo Aregbeyen*, 2012. "The Determinants of Firm Growth in Nigeria," Pakistan Journal of Applied Economics, Applied Economics Research Centre, vol. 22, pages 19-38.
    20. Coad, Alex & Frankish, Julian & Roberts, Richard G. & Storey, David J., 2013. "Growth paths and survival chances: An application of Gambler's Ruin theory," Journal of Business Venturing, Elsevier, vol. 28(5), pages 615-632.
    21. Stephanie Houle & Ryan Macdonald, 2023. "Identifying Nascent High-Growth Firms Using Machine Learning," Staff Working Papers 23-53, Bank of Canada.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Silviano Esteve-Pérez & Fabio Pieri & Diego Rodriguez, 2022. "One swallow does not make a summer: episodes and persistence in high growth," Small Business Economics, Springer, vol. 58(3), pages 1517-1544, March.
    2. Juan Federico & Joan-Lluis Capelleras, 2015. "The heterogeneous dynamics between growth and profits: the case of young firms," Small Business Economics, Springer, vol. 44(2), pages 231-253, February.
    3. Ari Hyytinen & Petri Rouvinen & Mika Pajarinen & Joosua Virtanen, 2023. "Ex Ante Predictability of Rapid Growth: A Design Science Approach," Entrepreneurship Theory and Practice, , vol. 47(6), pages 2465-2493, November.
    4. Alex Coad & Sven-Olov Daunfeldt & Daniel Halvarsson, 2022. "Amundsen versus Scott: are growth paths related to firm performance?," Small Business Economics, Springer, vol. 59(2), pages 593-610, August.
    5. Alex Coad, 2022. "Lumps, Bumps and Jumps in the Firm Growth Process," Foundations and Trends(R) in Entrepreneurship, now publishers, vol. 18(4), pages 212-267, April.
    6. Guilherme Fowler A. Monteiro & Adriana Bruscato Bortoluzzo & Pedro Lipkin P. Rosa, 2021. "Is there heterogeneity among persistent high‐growth firms?," Growth and Change, Wiley Blackwell, vol. 52(4), pages 2272-2292, December.
    7. Ho-Chang Chae, 2024. "In search of gazelles: machine learning prediction for Korean high-growth firms," Small Business Economics, Springer, vol. 62(1), pages 243-284, January.
    8. Canarella, Giorgio & Miller, Stephen M., 2018. "The determinants of growth in the U.S. information and communication technology (ICT) industry: A firm-level analysis," Economic Modelling, Elsevier, vol. 70(C), pages 259-271.
    9. Friesenbichler, Klaus S. & Hoelzl, Werner, 2022. "Firm-growth and Functional Strategic Domains: Exploratory evidence for differences between frontier and catching-up economies," Journal of Economics and Business, Elsevier, vol. 119(C).
    10. Alex Coad & Clemens Domnick & Florian Flachenecker & Peter Harasztosi & Mario Lorenzo Janiri & Rozalia Pal & Mercedes Teruel, 2022. "Capacity constraints as a trigger for high growth," Small Business Economics, Springer, vol. 59(3), pages 893-923, October.
    11. Giorgio Barba Navaretti & Davide Castellani & Fabio Pieri, 2014. "Age and firm growth: evidence from three European countries," Small Business Economics, Springer, vol. 43(4), pages 823-837, December.
    12. Rolando Vaz, 2023. "Firm growth in the Portuguese footwear industry: the location dilemma," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 70(2), pages 407-427, April.
    13. Yannis Caloghirou & Ioannis Giotopoulos & Alexandra Kontolaimou & Aggelos Tsakanikas, 2022. "Inside the black box of high-growth firms in a crisis-hit economy: corporate strategy, employee human capital and R&D capabilities," International Entrepreneurship and Management Journal, Springer, vol. 18(3), pages 1319-1345, September.
    14. Marko Peric & Vanja Vitezic, 2016. "Impact of global economic crisis on firm growth," Small Business Economics, Springer, vol. 46(1), pages 1-12, January.
    15. Bianchini, Stefano & Pellegrino, Gabriele, 2019. "Innovation persistence and employment dynamics," Research Policy, Elsevier, vol. 48(5), pages 1171-1186.
    16. Daniele Moschella & Federico Tamagni & Xiaodan Yu, 2019. "Persistent high-growth firms in China’s manufacturing," Small Business Economics, Springer, vol. 52(3), pages 573-594, March.
    17. Erhardt, Eva Christine, 2018. "Firm performance after high growth: A comparison of absolute and relative growth measures," MPRA Paper 88077, University Library of Munich, Germany.
    18. Besnik A. Krasniqi & Sameeksha Desai, 2016. "Institutional drivers of high-growth firms: country-level evidence from 26 transition economies," Small Business Economics, Springer, vol. 47(4), pages 1075-1094, December.
    19. Daunfeldt, Sven-Olov & Lang, Åsa & Macuchova, Zuzana & Rudholm, Niklas, 2011. "Firm Growth in the Retail and Wholesale Trade Sectors – Evidence from Sweden," HUI Working Papers 50, HUI Research.
    20. Alex Coad & Sven-Olov Daunfeldt & Daniel Halvarsson, 2018. "Bursting into life: firm growth and growth persistence by age," Small Business Economics, Springer, vol. 50(1), pages 55-75, January.

    More about this item

    Keywords

    High-growth firms; Machine learning; Prediction; Firm dynamics;
    All these keywords.

    JEL classification:

    • C40 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - General
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C60 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - General
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • L25 - Industrial Organization - - Firm Objectives, Organization, and Behavior - - - Firm Performance

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tcb:wpaper:2413. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sermet Pekin or Ilker Cakar or the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/tcmgvtr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.