IDEAS home Printed from https://ideas.repec.org/a/eee/jomega/v104y2021ics0305048321000992.html
   My bibliography  Save this article

Support vector frontiers: A new approach for estimating production functions through support vector machines

Author

Listed:
  • Valero-Carreras, Daniel
  • Aparicio, Juan
  • Guerrero, Nadia M.

Abstract

In microeconomics, a topic of interest is the estimation of production functions. By definition, a production function is a non-decreasing function that envelops all the observations (firms) from above in the input-output space, capturing the extreme behavior of the data. These characteristics are far from the usual ones assumed by machine learning techniques like Support Vector Regression (SVR) in Support Vector Machines, where the function to be estimated relates the response variable to the covariables in terms of the mean instead of the extremes and, additionally, they try to fit the data as much as possible, determining a function that increases and decreases following a data-driven process. In this paper, we introduce an adaptation of SVR, denominated Support Vector Frontiers (SVF), with the objective of estimating production functions. To do so and seeking meeting points between SVR and the standard non-parametric techniques for estimating production functions, mainly Free Disposal Hull (FDH) and Data Envelopment Analysis (DEA), an estimator is defined in this paper through a specific input transformation function. However, and in contrast to FDH and DEA, SVF overcomes the overfitting problems from using these techniques. Additionally, we show in this paper that standard FDH and DEA could be reinterpreted, in some sense, as Support Vector Regression techniques. Moreover, a new robust notion of efficiency is introduced, called ε-insensitive technical efficiency, directly inherited from Support Vector Machines. Finally, the performance of SVF is measured through several experiments using synthetic data, showing that the new approach considerably reduces the bias and mean squared error associated with the estimation of the true production function in comparison with standard FDH and DEA, although at the expense of a more computational burden.

Suggested Citation

  • Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
  • Handle: RePEc:eee:jomega:v:104:y:2021:i:c:s0305048321000992
    DOI: 10.1016/j.omega.2021.102490
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0305048321000992
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.omega.2021.102490?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Leopold Simar & Paul Wilson, 2000. "A general methodology for bootstrapping in non-parametric frontier models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(6), pages 779-802.
    2. Léopold Simar & Paul Wilson, 2000. "Statistical Inference in Nonparametric Frontier Models: The State of the Art," Journal of Productivity Analysis, Springer, vol. 13(1), pages 49-78, January.
    3. Khezrimotlagh, Dariush & Zhu, Joe & Cook, Wade D. & Toloo, Mehdi, 2019. "Data envelopment analysis and big data," European Journal of Operational Research, Elsevier, vol. 274(3), pages 1047-1054.
    4. Kerstens, Kristiaan & O’Donnell, Christopher & Van de Woestyne, Ignace, 2019. "Metatechnology frontier and convexity: A restatement," European Journal of Operational Research, Elsevier, vol. 275(2), pages 780-792.
    5. Parag Pendharkar & Marvin Troutt, 2014. "Interactive classification using data envelopment analysis," Annals of Operations Research, Springer, vol. 214(1), pages 125-141, March.
    6. Afriat, Sidney N, 1972. "Efficiency Estimation of Production Function," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 13(3), pages 568-598, October.
    7. Lee, Chia-Yen & Cai, Jia-Ying, 2020. "LASSO variable selection in data envelopment analysis with small datasets," Omega, Elsevier, vol. 91(C).
    8. Walter Briec & Kristiaan Kerstens & Philippe Venden Eeckaut, 2004. "Non-convex Technologies and Cost Functions: Definitions, Duality and Nonparametric Tests of Convexity," Journal of Economics, Springer, vol. 81(2), pages 155-192, February.
    9. Li, Yongjun & Wang, Lizheng & Li, Feng, 2021. "A data-driven prediction approach for sports team performance and its application to National Basketball Association," Omega, Elsevier, vol. 98(C).
    10. Daniel J. Henderson & Christopher F. Parmeter, 2009. "Imposing economic constraints in nonparametric regression: survey, implementation, and extension," Advances in Econometrics, in: Nonparametric Econometric Methods, pages 433-469, Emerald Group Publishing Limited.
    11. Cinzia Daraio & Léopold Simar, 2005. "Introducing Environmental Variables in Nonparametric Frontier Models: a Probabilistic Approach," Journal of Productivity Analysis, Springer, vol. 24(1), pages 93-121, September.
    12. Isabel Narbón-Perpiñá & Maria Balaguer-Coll & Emili Tortosa-Ausina, 2019. "Evaluating local government performance in times of crisis," Local Government Studies, Taylor & Francis Journals, vol. 45(1), pages 64-100, January.
    13. Hatami-Marbini, Adel & Emrouznejad, Ali & Tavana, Madjid, 2011. "A taxonomy and review of the fuzzy data envelopment analysis literature: Two decades in the making," European Journal of Operational Research, Elsevier, vol. 214(3), pages 457-472, November.
    14. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    15. Christopher J. O'Donnell, 2018. "Productivity and Efficiency Analysis," Springer Books, Springer, number 978-981-13-2984-5, December.
    16. Balaguer-Coll, Maria Teresa & Prior, Diego & Tortosa-Ausina, Emili, 2007. "On the determinants of local government performance: A two-stage nonparametric approach," European Economic Review, Elsevier, vol. 51(2), pages 425-451, February.
    17. Misiunas, Nicholas & Oztekin, Asil & Chen, Yao & Chandra, Kavitha, 2016. "DEANN: A healthcare analytic methodology of data envelopment analysis and artificial neural networks for the prediction of organ recipient functional status," Omega, Elsevier, vol. 58(C), pages 46-54.
    18. Léopold Simar & Paul W. Wilson, 1998. "Sensitivity Analysis of Efficiency Scores: How to Bootstrap in Nonparametric Frontier Models," Management Science, INFORMS, vol. 44(1), pages 49-61, January.
    19. Dominique Deprins & Léopold Simar & Henry Tulkens, 2006. "Measuring Labor-Efficiency in Post Offices," Springer Books, in: Parkash Chander & Jacques Drèze & C. Knox Lovell & Jack Mintz (ed.), Public goods, environmental externalities and fiscal competition, chapter 0, pages 285-309, Springer.
    20. Aragon, Y. & Daouia, A. & Thomas-Agnan, C., 2005. "Nonparametric Frontier Estimation: A Conditional Quantile-Based Approach," Econometric Theory, Cambridge University Press, vol. 21(2), pages 358-389, April.
    21. Aparicio, Juan & Pastor, Jesús T. & Vidal, Fernando & Zofío, José L., 2017. "Evaluating productive performance: A new approach based on the product-mix problem consistent with Data Envelopment Analysis," Omega, Elsevier, vol. 67(C), pages 134-144.
    22. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    23. Timo Kuosmanen & Andrew L. Johnson, 2010. "Data Envelopment Analysis as Nonparametric Least-Squares Regression," Operations Research, INFORMS, vol. 58(1), pages 149-160, February.
    24. Han-Ying Kao & Tao-Ku Chang & Yi-Cheng Chang, 2013. "Classification of Hospital Web Security Efficiency Using Data Envelopment Analysis and Support Vector Machine," Mathematical Problems in Engineering, Hindawi, vol. 2013, pages 1-8, October.
    25. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    26. Olesen, Ole B. & Petersen, Niels Christian, 2016. "Stochastic Data Envelopment Analysis—A review," European Journal of Operational Research, Elsevier, vol. 251(1), pages 2-21.
    27. Kerry Poitier & Sohyung Cho, 2011. "Estimation of true efficient frontier of organisational performance using data envelopment analysis and support vector machine learning," International Journal of Information and Decision Sciences, Inderscience Enterprises Ltd, vol. 3(2), pages 148-172.
    28. Chen, Zhongfei & Matousek, Roman & Wanke, Peter, 2018. "Chinese bank efficiency during the global financial crisis: A combined approach using satisficing DEA and Support Vector Machines☆," The North American Journal of Economics and Finance, Elsevier, vol. 43(C), pages 71-86.
    29. Vincent Charles & Juan Aparicio & Joe Zhu (ed.), 2020. "Data Science and Productivity Analytics," International Series in Operations Research and Management Science, Springer, number 978-3-030-43384-0, December.
    30. Christopher Parmeter & Kai Sun & Daniel Henderson & Subal Kumbhakar, 2014. "Estimation and inference under economic restrictions," Journal of Productivity Analysis, Springer, vol. 41(1), pages 111-129, February.
    31. Isabel Narbón-Perpiñá & Maria Teresa Balaguer-Coll & Marko Petrović & Emili Tortosa-Ausina, 2020. "Which estimator to measure local governments’ cost efficiency? The case of Spanish municipalities," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 11(1), pages 51-82, March.
    32. Rajiv D. Banker, 1993. "Maximum Likelihood, Consistency and Data Envelopment Analysis: A Statistical Foundation," Management Science, INFORMS, vol. 39(10), pages 1265-1273, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Ranking the Importance of Variables in a Nonparametric Frontier Analysis Using Unsupervised Machine Learning Techniques," Mathematics, MDPI, vol. 11(11), pages 1-24, June.
    2. Qianying Jin & Kristiaan Kerstens & Ignace Van de Woestyne, 2024. "Convex and nonconvex nonparametric frontier-based classification methods for anomaly detection," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 46(4), pages 1213-1239, December.
    3. Mónica Domínguez & Juan Aparicio & Antonio Fonfria, 2024. "The defence economy: an assessment of productivity change in NATO countries," Applied Economics, Taylor & Francis Journals, vol. 56(18), pages 2158-2175, April.
    4. Astorino, Annabella & Avolio, Matteo & Fuduli, Antonio, 2022. "A maximum-margin multisphere approach for binary Multiple Instance Learning," European Journal of Operational Research, Elsevier, vol. 299(2), pages 642-652.
    5. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Measuring technical efficiency for multi-input multi-output production processes through OneClass Support Vector Machines: a finite-sample study," Operational Research, Springer, vol. 23(3), pages 1-33, September.
    6. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    7. Nadia M. Guerrero & Juan Aparicio & Daniel Valero-Carreras, 2022. "Combining Data Envelopment Analysis and Machine Learning," Mathematics, MDPI, vol. 10(6), pages 1-22, March.
    8. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.
    9. Tsionas, Mike, 2022. "Efficiency estimation using probabilistic regression trees with an application to Chilean manufacturing industries," International Journal of Production Economics, Elsevier, vol. 249(C).
    10. Moragues, Raul & Aparicio, Juan & Esteve, Miriam, 2023. "An unsupervised learning-based generalization of Data Envelopment Analysis," Operations Research Perspectives, Elsevier, vol. 11(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    2. Valentin Zelenyuk, 2019. "Data Envelopment Analysis and Business Analytics: The Big Data Challenges and Some Solutions," CEPA Working Papers Series WP072019, School of Economics, University of Queensland, Australia.
    3. Nadia M. Guerrero & Juan Aparicio & Daniel Valero-Carreras, 2022. "Combining Data Envelopment Analysis and Machine Learning," Mathematics, MDPI, vol. 10(6), pages 1-22, March.
    4. Amir Moradi-Motlagh & Ali Emrouznejad, 2022. "The origins and development of statistical approaches in non-parametric frontier models: a survey of the first two decades of scholarly literature (1998–2020)," Annals of Operations Research, Springer, vol. 318(1), pages 713-741, November.
    5. Léopold Simar & Paul W. Wilson, 2015. "Statistical Approaches for Non-parametric Frontier Models: A Guided Tour," International Statistical Review, International Statistical Institute, vol. 83(1), pages 77-110, April.
    6. Quaranta, Anna Grazia & Raffoni, Anna & Visani, Franco, 2018. "A multidimensional approach to measuring bank branch efficiency," European Journal of Operational Research, Elsevier, vol. 266(2), pages 746-760.
    7. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.
    8. Chen, Ya & Tsionas, Mike G. & Zelenyuk, Valentin, 2021. "LASSO+DEA for small and big wide data," Omega, Elsevier, vol. 102(C).
    9. George Halkos & Nickolaos Tzeremes, 2010. "The effect of foreign ownership on SMEs performance: An efficiency analysis perspective," Journal of Productivity Analysis, Springer, vol. 34(2), pages 167-180, October.
    10. Gounopoulos, Dimitrios & Kallias, Konstantinos & Newton, David & Tzeremes, Nickolaos, 2016. "Political connections and IPO underpricing: An efficiency problem," MPRA Paper 69427, University Library of Munich, Germany.
    11. Keshvari, Abolfazl & Kuosmanen, Timo, 2013. "Stochastic non-convex envelopment of data: Applying isotonic regression to frontier estimation," European Journal of Operational Research, Elsevier, vol. 231(2), pages 481-491.
    12. Kristiaan Kerstens & Ignace Van de Woestyne, 2021. "Cost functions are nonconvex in the outputs when the technology is nonconvex: convexification is not harmless," Annals of Operations Research, Springer, vol. 305(1), pages 81-106, October.
    13. Zelenyuk, Valentin, 2020. "Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data," European Journal of Operational Research, Elsevier, vol. 282(1), pages 172-187.
    14. Halkos, George & Tzeremes, Nickolaos, 2008. "Measuring regional public health provision," MPRA Paper 23762, University Library of Munich, Germany.
    15. Halkos, George & Tzeremes, Nickolaos, 2009. "Exploring the effect of countries’ economic prosperity on their biodiversity performance," MPRA Paper 32102, University Library of Munich, Germany.
    16. Ya Chen & Mike Tsionas & Valentin Zelenyuk, 2020. "LASSO DEA for small and big data," CEPA Working Papers Series WP092020, School of Economics, University of Queensland, Australia.
    17. Halkos, George & Tzeremes, Nickolaos, 2011. "The effect of national culture on countries’ innovation efficiency," MPRA Paper 30100, University Library of Munich, Germany.
    18. Rafael Benítez & Vicente Coll-Serrano & Vicente J. Bolós, 2021. "deaR-Shiny: An Interactive Web App for Data Envelopment Analysis," Sustainability, MDPI, vol. 13(12), pages 1-19, June.
    19. Sinuany-Stern, Zilla, 2023. "Foundations of operations research: From linear programming to data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1069-1080.
    20. Podinovski, V. V., 2005. "Selective convexity in DEA models," European Journal of Operational Research, Elsevier, vol. 161(2), pages 552-563, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jomega:v:104:y:2021:i:c:s0305048321000992. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/375/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.