IDEAS home Printed from https://ideas.repec.org/a/kap/jproda/v53y2020i2d10.1007_s11123-019-00570-9.html
   My bibliography  Save this article

Insights from machine learning for evaluating production function estimators on manufacturing survey data

Author

Listed:
  • José Luis Preciado Arreola

    (Texas A&M University
    Tecnológico de Monterrey)

  • Daisuke Yagi

    (Texas A&M University)

  • Andrew L. Johnson

    (Texas A&M University
    Osaka University)

Abstract

National statistical organizations often rely on non-exhaustive surveys to estimate industry-level production functions in years in which a full census is not conducted. When analyzing data from non-census years, we propose selecting an estimator based on a weighting of its in-sample and predictive performance. We compare Cobb–Douglas functional assumption to existing nonparametric shape constrained estimators and a newly proposed estimator. For simulated data, we find that our proposed estimator has the lowest weighted errors. Using the 2010 Chilean Annual National Industrial Survey, a Cobb–Douglas specification describes at least 90% as much variance as the best alternative estimators in practically all cases considered providing two insights: the benefits of using application data for selecting an estimator, and the benefits of structure in noisy data. Finally for the five largest manufacturing industries, we find that a 30% sample, on average, achieves 60% of the R-squared value that would have been achieved with a full census; however, the variance across industries and samples is large.

Suggested Citation

  • José Luis Preciado Arreola & Daisuke Yagi & Andrew L. Johnson, 2020. "Insights from machine learning for evaluating production function estimators on manufacturing survey data," Journal of Productivity Analysis, Springer, vol. 53(2), pages 181-225, April.
  • Handle: RePEc:kap:jproda:v:53:y:2020:i:2:d:10.1007_s11123-019-00570-9
    DOI: 10.1007/s11123-019-00570-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11123-019-00570-9
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11123-019-00570-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Léopold Simar & Valentin Zelenyuk, 2011. "Stochastic FDH/DEA estimators for frontier analysis," Journal of Productivity Analysis, Springer, vol. 36(1), pages 1-20, August.
    2. Jan De Loecker & Pinelopi K. Goldberg & Amit K. Khandelwal & Nina Pavcnik, 2016. "Prices, Markups, and Trade Reform," Econometrica, Econometric Society, vol. 84, pages 445-510, March.
    3. Afriat, Sidney N, 1972. "Efficiency Estimation of Production Function," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 13(3), pages 568-598, October.
    4. Chad Syverson, 2011. "What Determines Productivity?," Journal of Economic Literature, American Economic Association, vol. 49(2), pages 326-365, June.
    5. Mark Andor & Christopher Parmeter, 2017. "Pseudolikelihood estimation of the stochastic frontier model," Applied Economics, Taylor & Francis Journals, vol. 49(55), pages 5651-5661, November.
    6. Kuosmanen, Timo & Saastamoinen, Antti & Sipiläinen, Timo, 2013. "What is the best practice for benchmark regulation of electricity distribution? Comparison of DEA, SFA and StoNED methods," Energy Policy, Elsevier, vol. 61(C), pages 740-750.
    7. Rahul Mazumder & Arkopal Choudhury & Garud Iyengar & Bodhisattva Sen, 2019. "A Computational Framework for Multivariate Convex Regression and Its Variants," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 318-331, January.
    8. Lucia Foster & John Haltiwanger & Chad Syverson, 2008. "Reallocation, Firm Turnover, and Efficiency: Selection on Productivity or Profitability?," American Economic Review, American Economic Association, vol. 98(1), pages 394-425, March.
    9. Lee, Chia-Yen & Johnson, Andrew L. & Moreno-Centeno, Erick & Kuosmanen, Timo, 2013. "A more efficient algorithm for Convex Nonparametric Least Squares," European Journal of Operational Research, Elsevier, vol. 227(2), pages 391-400.
    10. Teresa C Fort & John Haltiwanger & Ron S Jarmin & Javier Miranda, 2013. "How Firms Respond to Business Cycles: The Role of Firm Age and Firm Size," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 61(3), pages 520-559, August.
    11. Léopold Simar & Ingrid Keilegom & Valentin Zelenyuk, 2017. "Nonparametric least squares methods for stochastic frontier models," Journal of Productivity Analysis, Springer, vol. 47(3), pages 189-204, June.
    12. Timo Kuosmanen, 2008. "Representation theorem for convex nonparametric least squares," Econometrics Journal, Royal Economic Society, vol. 11(2), pages 308-325, July.
    13. Chambers,Robert G., 1988. "Applied Production Analysis," Cambridge Books, Cambridge University Press, number 9780521314275, October.
    14. Bradley Efron, 2004. "The Estimation of Prediction Error: Covariance Penalties and Cross-Validation," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 619-632, January.
    15. Olesen, Ole B. & Ruggiero, John, 2014. "Maintaining the Regular Ultra Passum Law in data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 235(3), pages 798-809.
    16. Mark Andor & Frederik Hesse, 2014. "The StoNED age: the departure into a new era of efficiency analysis? A monte carlo comparison of StoNED and the “oldies” (SFA and DEA)," Journal of Productivity Analysis, Springer, vol. 41(1), pages 85-109, February.
    17. Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
    18. Olesen, O.B. & Ruggiero, J., 2018. "An improved Afriat–Diewert–Parkan nonparametric production function estimator," European Journal of Operational Research, Elsevier, vol. 264(3), pages 1172-1188.
    19. Aigner, Dennis & Lovell, C. A. Knox & Schmidt, Peter, 1977. "Formulation and estimation of stochastic frontier production function models," Journal of Econometrics, Elsevier, vol. 6(1), pages 21-37, July.
    20. Timo Kuosmanen & Mika Kortelainen, 2012. "Stochastic non-smooth envelopment of data: semi-parametric frontier estimation subject to shape constraints," Journal of Productivity Analysis, Springer, vol. 38(1), pages 11-28, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andor, Mark A. & Parmeter, Christopher & Sommer, Stephan, 2019. "Combining uncertainty with uncertainty to get certainty? Efficiency analysis for regulation purposes," European Journal of Operational Research, Elsevier, vol. 274(1), pages 240-252.
    2. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    3. Julia Schaefer & Marcel Clermont, 2018. "Stochastic non-smooth envelopment of data for multi-dimensional output," Journal of Productivity Analysis, Springer, vol. 50(3), pages 139-154, December.
    4. Léopold Simar & Paul W. Wilson, 2015. "Statistical Approaches for Non-parametric Frontier Models: A Guided Tour," International Statistical Review, International Statistical Institute, vol. 83(1), pages 77-110, April.
    5. Andor, Mark A. & Parmeter, Christopher & Sommer, Stephan, 2019. "Combining uncertainty with uncertainty to get certainty? Efficiency analysis for regulation purposes," European Journal of Operational Research, Elsevier, vol. 274(1), pages 240-252.
    6. Sickles, Robin C. & Song, Wonho & Zelenyuk, Valentin, 2018. "Econometric Analysis of Productivity: Theory and Implementation in R," Working Papers 18-008, Rice University, Department of Economics.
    7. Parmeter, Christopher F., 2021. "Is it MOLS or COLS?," Efficiency Series Papers 2021/04, University of Oviedo, Department of Economics, Oviedo Efficiency Group (OEG).
    8. Minegishi, Kota, 2013. "Explaining Production Heterogeneity By Contextual Environments: Two-Stage DEA Application to Technical Change Measurement," 2013 Annual Meeting, August 4-6, 2013, Washington, D.C. 150289, Agricultural and Applied Economics Association.
    9. Lee, Chia-Yen & Wang, Ke, 2019. "Nash marginal abatement cost estimation of air pollutant emissions using the stochastic semi-nonparametric frontier," European Journal of Operational Research, Elsevier, vol. 273(1), pages 390-400.
    10. Marijn Verschelde & Michel Dumont & Glenn Rayp & Bruno Merlevede, 2016. "Semiparametric stochastic metafrontier efficiency of European manufacturing firms," Journal of Productivity Analysis, Springer, vol. 45(1), pages 53-69, February.
    11. Layer, Kevin & Johnson, Andrew L. & Sickles, Robin C. & Ferrier, Gary D., 2020. "Direction selection in stochastic directional distance functions," European Journal of Operational Research, Elsevier, vol. 280(1), pages 351-364.
    12. Mike Tsionas & Valentin Zelenyuk, 2021. "Goodness-of-fit in Optimizing Models of Production: A Generalization with a Bayesian Perspective," CEPA Working Papers Series WP182021, School of Economics, University of Queensland, Australia.
    13. Preciado Arreola, José Luis & Johnson, Andrew L. & Chen, Xun C. & Morita, Hiroshi, 2020. "Estimating stochastic production frontiers: A one-stage multivariate semiparametric Bayesian concave regression method," European Journal of Operational Research, Elsevier, vol. 287(2), pages 699-711.
    14. Keshvari, Abolfazl & Kuosmanen, Timo, 2013. "Stochastic non-convex envelopment of data: Applying isotonic regression to frontier estimation," European Journal of Operational Research, Elsevier, vol. 231(2), pages 481-491.
    15. Lee, Chia-Yen & Cai, Jia-Ying, 2020. "LASSO variable selection in data envelopment analysis with small datasets," Omega, Elsevier, vol. 91(C).
    16. Ferrara, Giancarlo & Vidoli, Francesco, 2017. "Semiparametric stochastic frontier models: A generalized additive model approach," European Journal of Operational Research, Elsevier, vol. 258(2), pages 761-777.
    17. Ahn, Heinz & Clermont, Marcel & Langner, Julia, 2023. "Comparative performance analysis of frontier-based efficiency measurement methods – A Monte Carlo simulation," European Journal of Operational Research, Elsevier, vol. 307(1), pages 294-312.
    18. Caitlin O’Loughlin & Léopold Simar & Paul W. Wilson, 2023. "Methodologies for assessing government efficiency," Chapters, in: António Afonso & João Tovar Jalles & Ana Venâncio (ed.), Handbook on Public Sector Efficiency, chapter 4, pages 72-101, Edward Elgar Publishing.
    19. Mark Andor & Frederik Hesse, "undated". "The StoNED age: The Departure Into a New Era of Efficiency Analysis? An MC study Comparing StoNED and the "Oldies" (SFA and DEA)," Working Papers 201285, Institute of Spatial and Housing Economics, Munster Universitary.
    20. Kenneth Rødseth & Eirik Romstad, 2014. "Environmental Regulations, Producer Responses, and Secondary Benefits: Carbon Dioxide Reductions Under the Acid Rain Program," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 59(1), pages 111-135, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jproda:v:53:y:2020:i:2:d:10.1007_s11123-019-00570-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.