IDEAS home Printed from https://ideas.repec.org/a/eee/oprepe/v11y2023ics2214716023000192.html
   My bibliography  Save this article

An unsupervised learning-based generalization of Data Envelopment Analysis

Author

Listed:
  • Moragues, Raul
  • Aparicio, Juan
  • Esteve, Miriam

Abstract

In this paper, we introduce an unsupervised machine learning method for production frontier estimation. This new approach satisfies fundamental properties of microeconomics, such as convexity and free disposability (shape constraints). The new method generalizes Data Envelopment Analysis (DEA) through the adaptation of One-Class Support Vector Machines with piecewise linear transformation mapping. The new technique aims to reduce the overfitting problem occurring in DEA. How to measure technical inefficiency through the directional distance function is also introduced. Finally, we evaluate the performance of the new technique via a computational experience, showing that the mean squared error in the estimation of the frontier is up to 83% better than the standard DEA in certain scenarios.

Suggested Citation

  • Moragues, Raul & Aparicio, Juan & Esteve, Miriam, 2023. "An unsupervised learning-based generalization of Data Envelopment Analysis," Operations Research Perspectives, Elsevier, vol. 11(C).
  • Handle: RePEc:eee:oprepe:v:11:y:2023:i:c:s2214716023000192
    DOI: 10.1016/j.orp.2023.100284
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S2214716023000192
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.orp.2023.100284?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Leopold Simar & Paul Wilson, 2000. "A general methodology for bootstrapping in non-parametric frontier models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(6), pages 779-802.
    2. Luenberger, David G., 1992. "Benefit functions and duality," Journal of Mathematical Economics, Elsevier, vol. 21(5), pages 461-481.
    3. Léopold Simar & Paul W. Wilson, 1998. "Sensitivity Analysis of Efficiency Scores: How to Bootstrap in Nonparametric Frontier Models," Management Science, INFORMS, vol. 44(1), pages 49-61, January.
    4. Peter Bogetoft & Lars Otto, 2011. "Data Envelopment Analysis DEA," International Series in Operations Research & Management Science, in: Benchmarking with DEA, SFA, and R, chapter 0, pages 81-113, Springer.
    5. Tsionas, Mike, 2022. "Efficiency estimation using probabilistic regression trees with an application to Chilean manufacturing industries," International Journal of Production Economics, Elsevier, vol. 249(C).
    6. Jesus Pastor & C. Lovell & Juan Aparicio, 2012. "Families of linear efficiency programs based on Debreu’s loss function," Journal of Productivity Analysis, Springer, vol. 38(2), pages 109-120, October.
    7. Daouia, Abdelaati & Noh, Hohsuk & Park, Byeong U., 2016. "Data envelope fitting with constrained polynomial splines," LIDAM Reprints ISBA 2016011, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    8. Emrouznejad, Ali & Yang, Guo-liang, 2018. "A survey and analysis of the first 40 years of scholarly literature in DEA: 1978–2016," Socio-Economic Planning Sciences, Elsevier, vol. 61(C), pages 4-8.
    9. Timo Kuosmanen & Andrew L. Johnson, 2010. "Data Envelopment Analysis as Nonparametric Least-Squares Regression," Operations Research, INFORMS, vol. 58(1), pages 149-160, February.
    10. Timothy J. Coelli & D.S. Prasada Rao & Christopher J. O’Donnell & George E. Battese, 2005. "An Introduction to Efficiency and Productivity Analysis," Springer Books, Springer, edition 0, number 978-0-387-25895-9, January.
    11. Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
    12. W. Briec, 1999. "Hölder Distance Function and Measurement of Technical Efficiency," Journal of Productivity Analysis, Springer, vol. 11(2), pages 111-131, April.
    13. R. D. Banker & A. Charnes & W. W. Cooper, 1984. "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis," Management Science, INFORMS, vol. 30(9), pages 1078-1092, September.
    14. Abdelaati Daouia & Hohsuk Noh & Byeong U. Park, 2016. "Data envelope fitting with constrained polynomial splines," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 3-30, January.
    15. Olesen, O.B. & Ruggiero, J., 2018. "An improved Afriat–Diewert–Parkan nonparametric production function estimator," European Journal of Operational Research, Elsevier, vol. 264(3), pages 1172-1188.
    16. Cherchye, Laurens & De Rock, Bram & Walheer, Barnabé, 2016. "Multi-output profit efficiency and directional distance functions," Omega, Elsevier, vol. 61(C), pages 100-109.
    17. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    18. Christopher Parmeter & Kai Sun & Daniel Henderson & Subal Kumbhakar, 2014. "Estimation and inference under economic restrictions," Journal of Productivity Analysis, Springer, vol. 41(1), pages 111-129, February.
    19. Olesen, O.B. & Ruggiero, J., 2022. "The hinging hyperplanes: An alternative nonparametric representation of a production function," European Journal of Operational Research, Elsevier, vol. 296(1), pages 254-266.
    20. Aigner, Dennis & Lovell, C. A. Knox & Schmidt, Peter, 1977. "Formulation and estimation of stochastic frontier production function models," Journal of Econometrics, Elsevier, vol. 6(1), pages 21-37, July.
    21. R. G. Chambers & Y. Chung & R. Färe, 1998. "Profit, Directional Distance Functions, and Nerlovian Efficiency," Journal of Optimization Theory and Applications, Springer, vol. 98(2), pages 351-364, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Measuring technical efficiency for multi-input multi-output production processes through OneClass Support Vector Machines: a finite-sample study," Operational Research, Springer, vol. 23(3), pages 1-33, September.
    2. Raul Moragues & Juan Aparicio & Miriam Esteve, 2023. "Ranking the Importance of Variables in a Nonparametric Frontier Analysis Using Unsupervised Machine Learning Techniques," Mathematics, MDPI, vol. 11(11), pages 1-24, June.
    3. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.
    4. Esteve, Miriam & Aparicio, Juan & Rodriguez-Sala, Jesus J. & Zhu, Joe, 2023. "Random Forests and the measurement of super-efficiency in the context of Free Disposal Hull," European Journal of Operational Research, Elsevier, vol. 304(2), pages 729-744.
    5. Nadia M. Guerrero & Juan Aparicio & Daniel Valero-Carreras, 2022. "Combining Data Envelopment Analysis and Machine Learning," Mathematics, MDPI, vol. 10(6), pages 1-22, March.
    6. Kaffash, Sepideh & Azizi, Roza & Huang, Ying & Zhu, Joe, 2020. "A survey of data envelopment analysis applications in the insurance industry 1993–2018," European Journal of Operational Research, Elsevier, vol. 284(3), pages 801-813.
    7. Léopold Simar & Paul W. Wilson, 2015. "Statistical Approaches for Non-parametric Frontier Models: A Guided Tour," International Statistical Review, International Statistical Institute, vol. 83(1), pages 77-110, April.
    8. Valero-Carreras, Daniel & Aparicio, Juan & Guerrero, Nadia M., 2021. "Support vector frontiers: A new approach for estimating production functions through support vector machines," Omega, Elsevier, vol. 104(C).
    9. Valentin Zelenyuk, 2023. "Productivity analysis: roots, foundations, trends and perspectives," Journal of Productivity Analysis, Springer, vol. 60(3), pages 229-247, December.
    10. Rafael Benítez & Vicente Coll-Serrano & Vicente J. Bolós, 2021. "deaR-Shiny: An Interactive Web App for Data Envelopment Analysis," Sustainability, MDPI, vol. 13(12), pages 1-19, June.
    11. Manuel Salas-Velasco, 2020. "Measuring and explaining the production efficiency of Spanish universities using a non-parametric approach and a bootstrapped-truncated regression," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(2), pages 825-846, February.
    12. Subhash C. Ray, 2018. "Data Envelopment Analysis with Alternative Returns to Scale," Working papers 2018-20, University of Connecticut, Department of Economics.
    13. Thiago Victorino & Carlos Rosano Peña, 2023. "The Development of Efficiency Analysis in Transportation Systems: A Bibliometric and Systematic Review," Sustainability, MDPI, vol. 15(13), pages 1-32, June.
    14. Wijesiri, Mahinda & Yaron, Jacob & Meoli, Michele, 2017. "Assessing the financial and outreach efficiency of microfinance institutions: Do age and size matter?," Journal of Multinational Financial Management, Elsevier, vol. 40(C), pages 63-76.
    15. Thanh Ngo & Kan Wai Hong Tsui, 2022. "Estimating the confidence intervals for DEA efficiency scores of Asia-Pacific airlines," Operational Research, Springer, vol. 22(4), pages 3411-3434, September.
    16. Aziz Karimov, 2013. "Productive Efficiency of Potato and Melon Growing Farms in Uzbekistan: A Two Stage Double Bootstrap Data Envelopment Analysis," Agriculture, MDPI, vol. 3(3), pages 1-13, September.
    17. Varabyova, Yauheniya & Schreyögg, Jonas, 2013. "International comparisons of the technical efficiency of the hospital sector: Panel data analysis of OECD countries using parametric and non-parametric approaches," Health Policy, Elsevier, vol. 112(1), pages 70-79.
    18. Kao, Chiang & Liu, Shiang-Tai, 2009. "Stochastic data envelopment analysis in measuring the efficiency of Taiwan commercial banks," European Journal of Operational Research, Elsevier, vol. 196(1), pages 312-322, July.
    19. Aparicio, Juan & Monge, Juan F., 2022. "The generalized range adjusted measure in data envelopment analysis: Properties, computational aspects and duality," European Journal of Operational Research, Elsevier, vol. 302(2), pages 621-632.
    20. Danish Ahmed SIDDIQUI & Qazi Masood AHMED, 2019. "Are institutions a crucial determinant of cross country economic efficiency? A two-stage double bootstrap data envelopment analysis," Theoretical and Applied Economics, Asociatia Generala a Economistilor din Romania / Editura Economica, vol. 0(1(618), S), pages 89-114, Spring.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:oprepe:v:11:y:2023:i:c:s2214716023000192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/operations-research-perspectives .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.