IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04434027.html
   My bibliography  Save this paper

Predicting the Performance of MSMEs: A Hybrid DEA-machine Learning Approach

Author

Listed:
  • Sabri Boubaker

    (Métis Lab EM Normandie - EM Normandie - École de Management de Normandie)

  • T.D.Q. Le
  • T. Ngo
  • R. Manita

Abstract

Micro, small and medium enterprises (MSMEs) dominate the business landscape and create more than half of employment worldwide. How we can apply big data analytical tools such as machine learning to examine the performance of MSMEs has become an important question to provide quicker results and recommend better and more reliable solutions that improve performance. This paper proposes a novel method for estimating a common set of weights (CSW) based on regression analysis for data envelopment analysis (DEA) as an important analytical and operational research technique, which (i) allows for measurement evaluations and ranking comparisons of the MSMEs, and (ii) helps overcome the time-consuming non-convexity issues of other CSW DEA methodologies. Our hybrid approach used several econometric and machine learning techniques (such as Tobit, least absolute shrinkage and selection operator, and Random Forest regression) to empirically explain and predict the performance of more than 5400 Vietnamese MSMEs (2010-2016), and showed that the machine learning techniques are more efficient and accurate than the econometric ones. Our study, therefore, sheds new light on the two-stage DEA literature, especially in terms of predicting performance in the era of big data to strengthen the role of analytics in business and management. \textcopyright 2023, The Author(s).

Suggested Citation

  • Sabri Boubaker & T.D.Q. Le & T. Ngo & R. Manita, 2023. "Predicting the Performance of MSMEs: A Hybrid DEA-machine Learning Approach," Post-Print hal-04434027, HAL.
  • Handle: RePEc:hal:journl:hal-04434027
    DOI: 10.1007/s10479-023-05230-8
    Note: View the original document on HAL open archive server: https://normandie-univ.hal.science/hal-04434027
    as

    Download full text from publisher

    File URL: https://normandie-univ.hal.science/hal-04434027/document
    Download Restriction: no

    File URL: https://libkey.io/10.1007/s10479-023-05230-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Pierre‐Philippe Combes & Gilles Duranton & Laurent Gobillon & Diego Puga & Sébastien Roux, 2012. "The Productivity Advantages of Large Cities: Distinguishing Agglomeration From Firm Selection," Econometrica, Econometric Society, vol. 80(6), pages 2543-2594, November.
    2. Daraio, Cinzia & Simar, Leopold & Wilson, Paul, 2010. "Testing whether two-stage estimation is meaningful in non-parametric models of production," LIDAM Discussion Papers ISBA 2010031, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    3. Wang, Ying-Ming & Chin, Kwai-Sang, 2010. "Some alternative models for DEA cross-efficiency evaluation," International Journal of Production Economics, Elsevier, vol. 128(1), pages 332-338, November.
    4. Per Andersen & Niels Christian Petersen, 1993. "A Procedure for Ranking Efficient Units in Data Envelopment Analysis," Management Science, INFORMS, vol. 39(10), pages 1261-1264, October.
    5. Hung T. Pham & Thanh L. Dao & Barry Reilly, 2010. "Technical efficiency in the Vietnamese manufacturing sector," Journal of International Development, John Wiley & Sons, Ltd., vol. 22(4), pages 503-520.
    6. Lee, Chia-Yen & Cai, Jia-Ying, 2020. "LASSO variable selection in data envelopment analysis with small datasets," Omega, Elsevier, vol. 91(C).
    7. John R. Baldwin & Wulong Gu, 2004. "Trade Liberalization: Export-market Participation, Productivity Growth, and Innovation," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 20(3), pages 372-392, Autumn.
    8. Tu D. Q. Le & Tin H. Ho & Dat T. Nguyen & Thanh Ngo, 2021. "Fintech Credit and Bank Efficiency: International Evidence," IJFS, MDPI, vol. 9(3), pages 1-16, August.
    9. Le, Viet & Vu, Xuan-Binh (Benjamin) & Nghiem, Son, 2018. "Technical efficiency of small and medium manufacturing firms in Vietnam: A stochastic meta-frontier analysis," Economic Analysis and Policy, Elsevier, vol. 59(C), pages 84-91.
    10. Hailu, Kidanemariam Berhe & Tanaka, Makoto, 2015. "A “true” random effects stochastic frontier analysis for technical efficiency and heterogeneity: Evidence from manufacturing firms in Ethiopia," Economic Modelling, Elsevier, vol. 50(C), pages 179-192.
    11. Banker, Rajiv D., 1984. "Estimating most productive scale size using data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 17(1), pages 35-44, July.
    12. Mary Amiti & Jozef Konings, 2007. "Trade Liberalization, Intermediate Inputs, and Productivity: Evidence from Indonesia," American Economic Review, American Economic Association, vol. 97(5), pages 1611-1638, December.
    13. Kamble, Sachin S. & Gunasekaran, Angappa & Ghadge, Abhijeet & Raut, Rakesh, 2020. "A performance measurement system for industry 4.0 enabled smart manufacturing system in SMMEs- A review and empirical investigation," International Journal of Production Economics, Elsevier, vol. 229(C).
    14. Dao Le Trang Anh & Christopher Gan, 2020. "Profitability and marketability efficiencies of Vietnam manufacturing firms," International Journal of Social Economics, Emerald Group Publishing Limited, vol. 47(1), pages 54-71, January.
    15. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    16. John R. Baldwin & Wulong Gu, 2004. "Trade Liberalization: Export-market Participation, Productivity Growth, and Innovation," Oxford Review of Economic Policy, Oxford University Press, vol. 20(3), pages 372-392, Autumn.
    17. Helmi Hammami & Thanh Ngo & David Tripe & Dinh-Tri Vo, 2022. "Ranking with a Euclidean common set of weights in data envelopment analysis: with application to the Eurozone banking sector," Annals of Operations Research, Springer, vol. 311(2), pages 675-694, April.
    18. Gabriele Pellegrino, 2018. "Barriers to innovation in young and mature firms," Journal of Evolutionary Economics, Springer, vol. 28(1), pages 181-206, January.
    19. Chia-Hui Huang & Chih-Hai Yang, 2016. "Ownership, trade, and productivity in Vietnam’s manufacturing firms," Asia-Pacific Journal of Accounting & Economics, Taylor & Francis Journals, vol. 23(3), pages 356-371, July.
    20. Qing Wang & Zhaojun Liu & Yang Zhang, 2017. "A Novel Weighting Method for Finding Common Weights in DEA," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 34(05), pages 1-21, October.
    21. Misiunas, Nicholas & Oztekin, Asil & Chen, Yao & Chandra, Kavitha, 2016. "DEANN: A healthcare analytic methodology of data envelopment analysis and artificial neural networks for the prediction of organ recipient functional status," Omega, Elsevier, vol. 58(C), pages 46-54.
    22. Thanh Ngo & Hung V. Vu & Huong Ho & Thuy T. T. Dao & Hai T. H. Nguyen, 2019. "Performance of Fish Farms in Vietnam–Does Financial Access Help Improve Their Cost Efficiency?," IJFS, MDPI, vol. 7(3), pages 1-10, August.
    23. Yang, Ji-Chung, 2006. "The efficiency of SMEs in the global market: Measuring the Korean performance," Journal of Policy Modeling, Elsevier, vol. 28(8), pages 861-876, November.
    24. Thanh Ngo & Tu Le & Son H. Tran & Anh Nguyen & Canh Nguyen, 2019. "Sources of the performance of manufacturing firms: evidence from Vietnam," Post-Communist Economies, Taylor & Francis Journals, vol. 31(6), pages 790-804, November.
    25. Javier Vidal-García & Marta Vidal & Sabri Boubaker & Majdi Hassan, 2018. "The efficiency of mutual funds," Annals of Operations Research, Springer, vol. 267(1), pages 555-584, August.
    26. Chen, Ya & Tsionas, Mike G. & Zelenyuk, Valentin, 2021. "LASSO+DEA for small and big wide data," Omega, Elsevier, vol. 102(C).
    27. Mostafa Davtalab-Olyaie, 2019. "A secondary goal in DEA cross-efficiency evaluation: A “one home run is much better than two doubles” criterion," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(5), pages 807-816, May.
    28. Jamal Ouenniche & Skarleth Carrales, 2018. "Assessing efficiency profiles of UK commercial banks: a DEA analysis with regression-based feedback," Annals of Operations Research, Springer, vol. 266(1), pages 551-587, July.
    29. Tin H. Ho & Dat T. Nguyen & Thanh Ngo & Tu D. Q. Le, 2021. "Efficiency in Vietnamese Banking: A Meta-Regression Analysis Approach," IJFS, MDPI, vol. 9(3), pages 1-15, August.
    30. Adler, Nicole & Friedman, Lea & Sinuany-Stern, Zilla, 2002. "Review of ranking methods in the data envelopment analysis context," European Journal of Operational Research, Elsevier, vol. 140(2), pages 249-265, July.
    31. Huong Van Vu & Mark Holmes & Tuyen Quang Tran & Steven Lim, 2016. "Firm exporting and productivity: what if productivity is no longer a black box," Baltic Journal of Economics, Baltic International Centre for Economic Policy Studies, vol. 16(2), pages 95-113.
    32. C Kao & H-T Hung, 2005. "Data envelopment analysis with common weights: the compromise solution approach," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(10), pages 1196-1203, October.
    33. Ayyagari, Meghana & Beck, Thorsten & Demirguc-Kunt, Asl, 2003. "Small and medium enterprises across the globe : a new database," Policy Research Working Paper Series 3127, The World Bank.
    34. Sabri Boubaker & Asma Houcine & Zied Ftiti & Hatem Masri, 2018. "Does audit quality affect firms’ investment efficiency?," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 69(10), pages 1688-1699, October.
    35. Thuy T. T. Dao & Xuan T. T. Mai & Thanh Ngo & Tu Le & Huong Ho, 2021. "From Efficiency Analyses to Policy Implications: a Multilevel Hierarchical Linear Model Approach," International Journal of the Economics of Business, Taylor & Francis Journals, vol. 28(3), pages 457-470, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Helmi Hammami & Thanh Ngo & David Tripe & Dinh-Tri Vo, 2022. "Ranking with a Euclidean common set of weights in data envelopment analysis: with application to the Eurozone banking sector," Annals of Operations Research, Springer, vol. 311(2), pages 675-694, April.
    2. Mostafa Davtalab-Olyaie & Hadis Mahmudi-Baram & Masoud Asgharian, 2023. "Measuring individual efficiency and unit influence in centrally managed systems," Annals of Operations Research, Springer, vol. 321(1), pages 139-164, February.
    3. Feng Li & Han Wu & Qingyuan Zhu & Liang Liang & Gang Kou, 2021. "Data envelopment analysis cross efficiency evaluation with reciprocal behaviors," Annals of Operations Research, Springer, vol. 302(1), pages 173-210, July.
    4. Dariush Akbarian, 2015. "Ranking All DEA-Efficient DMUs Based on Cross Efficiency and Analytic Hierarchy Process Methods," Journal of Optimization, Hindawi, vol. 2015, pages 1-10, January.
    5. Kim, Nam Hyok & He, Feng & Kwon, O Chol, 2023. "Combining common-weights DEA window with the Malmquist index: A case of China’s iron and steel industry," Socio-Economic Planning Sciences, Elsevier, vol. 87(PB).
    6. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    7. Soltanifar, Mehdi & Shahghobadi, Saeid, 2013. "Selecting a benevolent secondary goal model in data envelopment analysis cross-efficiency evaluation by a voting model," Socio-Economic Planning Sciences, Elsevier, vol. 47(1), pages 65-74.
    8. Ebrahimi, Bohlool & Dhamotharan, Lalitha & Ghasemi, Mohammad Reza & Charles, Vincent, 2022. "A cross-inefficiency approach based on the deviation variables framework," Omega, Elsevier, vol. 111(C).
    9. Mai, Nhat Chi, 2015. "Efficiency of the banking system in Vietnam under financial liberalization," OSF Preprints qsf6d, Center for Open Science.
    10. Kanematsu, Simon Y. & Carvalho, Ney P. & Martinhon, Carlos A. & Almeida, Mariana R., 2020. "Ranking using η-efficiency and relative size measures based on DEA," Omega, Elsevier, vol. 90(C).
    11. Thanh Ngo & David Tripe & Duc Khuong Nguyen, 2024. "Estimating the productivity of US agriculture: The Fisher total factor productivity index for time series data with unknown prices," Australian Journal of Agricultural and Resource Economics, Australian Agricultural and Resource Economics Society, vol. 68(3), pages 701-712, July.
    12. Thanh Ngo & Tu DQ Le & Dat T Nguyen & Tin H Ho, 2023. "Determinants Of Bank Performance: Revisiting The Role Of Ceo’S Personality Traits Using Graphology," Bulletin of Monetary Economics and Banking, Bank Indonesia, vol. 26(2), pages 289-310, May.
    13. Ruiz, José L. & Sirvent, Inmaculada, 2016. "Common benchmarking and ranking of units with DEA," Omega, Elsevier, vol. 65(C), pages 1-9.
    14. Jie Wu & Junfei Chu & Qingyuan Zhu & Pengzhen Yin & Liang Liang, 2016. "DEA cross-efficiency evaluation based on satisfaction degree: an application to technology selection," International Journal of Production Research, Taylor & Francis Journals, vol. 54(20), pages 5990-6007, October.
    15. Nam Hyok Kim & Feng He & Kwon Ryong Hong & Hyok-Chol Kim & Sok-Min Han, 2024. "A new common weights DEA model based on cluster analysis," Operational Research, Springer, vol. 24(2), pages 1-35, June.
    16. Wenli Liu & Ying-Ming Wang & Shulong Lv, 2017. "An aggressive game cross-efficiency evaluation in data envelopment analysis," Annals of Operations Research, Springer, vol. 259(1), pages 241-258, December.
    17. Li, Yongjun & Xie, Jianhui & Wang, Meiqiang & Liang, Liang, 2016. "Super efficiency evaluation using a common platform on a cooperative game," European Journal of Operational Research, Elsevier, vol. 255(3), pages 884-892.
    18. Davtalab-Olyaie, Mostafa & Asgharian, Masoud & Nia, Vahid Partovi, 2019. "Stochastic ranking and dominance in DEA," International Journal of Production Economics, Elsevier, vol. 214(C), pages 125-138.
    19. Hamid Kiaei & Reza Kazemi Matin, 2022. "New common set of weights method in black-box and two-stage data envelopment analysis," Annals of Operations Research, Springer, vol. 309(1), pages 143-162, February.
    20. Azarnoosh Kafi & Behrouz Daneshian & Mohsen Rostamy-Malkhalifeh, 2021. "Forecasting the confidence interval of efficiency in fuzzy DEA," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 31(1), pages 41-59.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04434027. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.