IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v216y2020i1p268-283.html
   My bibliography  Save this article

Double machine learning with gradient boosting and its application to the Big N audit quality effect

Author

Listed:
  • Yang, Jui-Chung
  • Chuang, Hui-Ching
  • Kuan, Chung-Ming

Abstract

In this paper, we study the double machine learning (DML) approach of Chernozhukov et al. (2018) for estimating average treatment effect and apply this approach to examine the Big N audit quality effect in the accounting literature. This approach relies on machine learning methods and is suitable when a high dimensional nuisance function with many covariates is present in the model. This approach does not suffer from the “regularization bias” when a learning method with a proper convergence rate is used. We demonstrate by simulations that, for the DML approach, the gradient boosting method is fairly robust and to be preferred to other methods, such as regression tree, random forest, support vector regression machine, and the conventional Nadaraya–Watson nonparametric estimator. We then apply the DML approach with gradient boosting to estimate the Big N effect. We find that Big N auditors have a positive effect on audit quality and that this effect is not only statistically significant but also economically important. We further show that, in contrast to the results of propensity score matching, our estimates of said effect are quite robust to the hyper-parameters in the gradient boosting algorithm.

Suggested Citation

  • Yang, Jui-Chung & Chuang, Hui-Ching & Kuan, Chung-Ming, 2020. "Double machine learning with gradient boosting and its application to the Big N audit quality effect," Journal of Econometrics, Elsevier, vol. 216(1), pages 268-283.
  • Handle: RePEc:eee:econom:v:216:y:2020:i:1:p:268-283
    DOI: 10.1016/j.jeconom.2020.01.018
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407620300245
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2020.01.018?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    2. Myoung-Jae Lee, 2018. "Simple least squares estimator for treatment effects using propensity score residuals," Biometrika, Biometrika Trust, vol. 105(1), pages 149-164.
    3. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107016064, October.
    4. Ian D. Gow & David F. Larcker & Peter C. Reiss, 2016. "Causal Inference in Accounting Research," Journal of Accounting Research, Wiley Blackwell, vol. 54(2), pages 477-523, May.
    5. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107638105, October.
    6. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107016057, October.
    7. Roberts, Michael R. & Whited, Toni M., 2013. "Endogeneity in Empirical Corporate Finance1," Handbook of the Economics of Finance, in: G.M. Constantinides & M. Harris & R. M. Stulz (ed.), Handbook of the Economics of Finance, volume 2, chapter 0, pages 493-572, Elsevier.
    8. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "Inference on Treatment Effects after Selection among High-Dimensional Controlsâ€," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(2), pages 608-650.
    9. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107674165, October.
    10. Ye Luo & Martin Spindler & Jannis Kuck, 2016. "High-Dimensional $L_2$Boosting: Rate of Convergence," Papers 1602.08927, arXiv.org, revised Jul 2022.
    11. Kothari, S.P. & Leone, Andrew J. & Wasley, Charles E., 2005. "Performance matched discretionary accrual measures," Journal of Accounting and Economics, Elsevier, vol. 39(1), pages 163-197, February.
    12. Connie L. Becker & Mark L. Defond & James Jiambalvo & K.R. Subramanyam, 1998. "The Effect of Audit Quality on Earnings Management," Contemporary Accounting Research, John Wiley & Sons, vol. 15(1), pages 1-24, March.
    13. Gow, Ian D. & Larcker, David F. & Reiss, Peter C., 2016. "Causal Inference in Accounting Research," Research Papers 3393, Stanford University, Graduate School of Business.
    14. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107627314, October.
    15. Acemoglu,Daron & Arellano,Manuel & Dekel,Eddie (ed.), 2013. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9781107016040, October.
    16. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bilgin, Rumeysa, 2023. "The Selection Of Control Variables In Capital Structure Research With Machine Learning," SocArXiv e26qf, Center for Open Science.
    2. Yong Bian & Xiqian Wang & Qin Zhang, 2023. "How Does China's Household Portfolio Selection Vary with Financial Inclusion?," Papers 2311.01206, arXiv.org.
    3. Jonathan Fuhr & Philipp Berens & Dominik Papies, 2024. "Estimating Causal Effects with Double Machine Learning -- A Method Evaluation," Papers 2403.14385, arXiv.org, revised Apr 2024.
    4. Huber, Martin & Meier, Jonas & Wallimann, Hannes, 2022. "Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 22-39.
    5. Pierfrancesco Alaimo Di Loro & Daria Scacciatelli & Giovanna Tagliaferri, 2023. "2-step Gradient Boosting approach to selectivity bias correction in tax audit: an application to the VAT gap in Italy," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(1), pages 237-270, March.
    6. Wang, Xiqian & Bian, Yong & Zhang, Qin, 2023. "The effect of cooking fuel choice on the elderly’s well-being: Evidence from two non-parametric methods," Energy Economics, Elsevier, vol. 125(C).
    7. Lu Kang & Jie Lv & Haoyang Zhang, 2024. "Can the Water Resource Fee-to-Tax Reform Promote the “Three-Wheel Drive” of Corporate Green Energy-Saving Innovations? Quasi-Natural Experimental Evidence from China," Energies, MDPI, vol. 17(12), pages 1-38, June.
    8. Jizhou Wang & Jin’an He & Richard Cebula & Maggie Foley & Fangping Peng, 2024. "Mixed ownership reform, political connections, and overinvestment," American Journal of Economics and Sociology, Wiley Blackwell, vol. 83(2), pages 407-425, March.
    9. Xinyu Wei & Mingwang Cheng & Kaifeng Duan & Xiangxing Kong, 2024. "Effects of Big Data on PM 2.5 : A Study Based on Double Machine Learning," Land, MDPI, vol. 13(3), pages 1-21, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Denis Fougère & Nicolas Jacquemet, 2020. "Policy Evaluation Using Causal Inference Methods," SciencePo Working papers Main hal-03455978, HAL.
    2. Kaspar Wüthrich, 2020. "A Comparison of Two Quantile Models With Endogeneity," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 443-456, April.
    3. Alexandre Belloni & Victor Chernozhukov & Ying Wei, 2016. "Post-Selection Inference for Generalized Linear Models With Many Controls," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 606-619, October.
    4. Pushan Dutt & Ilia Tsetlin, 2021. "Income distribution and economic development: Insights from machine learning," Economics and Politics, Wiley Blackwell, vol. 33(1), pages 1-36, March.
    5. Elsby, Michael W.L. & Hobijn, Bart & Şahin, Ayşegül, 2015. "On the importance of the participation margin for labor market fluctuations," Journal of Monetary Economics, Elsevier, vol. 72(C), pages 64-82.
    6. Wen Xu, 2016. "Estimation of Dynamic Panel Data Models with Stochastic Volatility Using Particle Filters," Econometrics, MDPI, vol. 4(4), pages 1-13, October.
    7. Özgür Orhangazi & A. Erinç Yeldan, 2021. "The Re‐making of the Turkish Crisis," Development and Change, International Institute of Social Studies, vol. 52(3), pages 460-503, May.
    8. Alessandra Bonfiglioli & Rosario Crinò & Gino Gancia, 2018. "Firms and Economic Performance: A view from Trade," Working Papers 1034, Barcelona School of Economics.
    9. Guriev, Sergei & Treisman, Daniel, 2020. "A theory of informational autocracy," Journal of Public Economics, Elsevier, vol. 186(C).
    10. Ufuk Akcigit & Sina T. Ates & Giammario Impullitti, 2018. "Innovation and Trade Policy in a Globalized World," NBER Working Papers 24543, National Bureau of Economic Research, Inc.
    11. Daron Acemoglu & Gino Gancia & Fabrizio Zilibotti, 2015. "Offshoring and Directed Technical Change," American Economic Journal: Macroeconomics, American Economic Association, vol. 7(3), pages 84-122, July.
    12. Makoto Shimoji, 2016. "Rationalizable Persuasion," Discussion Papers 16/08, Department of Economics, University of York.
    13. Ivan Balbuzanov, 2019. "Lies and consequences," International Journal of Game Theory, Springer;Game Theory Society, vol. 48(4), pages 1203-1240, December.
    14. Raphael Corbi & Fabio Miessi Sanches, 2022. "Church Competition, Religious Subsidies and the Rise of Evangelicalism: a Dynamic Structural Analysis," Working Papers, Department of Economics 2022_09, University of São Paulo (FEA-USP).
    15. Shirai, Daichi, 2016. "Persistence and Amplification of Financial Frictions," MPRA Paper 72187, University Library of Munich, Germany.
    16. Haiwen Zhou, 2018. "Impact of international trade on unemployment under oligopoly," The Journal of International Trade & Economic Development, Taylor & Francis Journals, vol. 27(4), pages 365-379, May.
    17. Benchimol, Jonathan & Ivashchenko, Sergey, 2021. "Switching volatility in a nonlinear open economy," Journal of International Money and Finance, Elsevier, vol. 110(C).
    18. Lefgren, Lars J. & Stoddard, Olga B. & Stovall, John E., 2021. "Rationalizing self-defeating behaviors: Theory and evidence," Journal of Health Economics, Elsevier, vol. 76(C).
    19. Liang, Yan, 2022. "Impact of financial development on outsourcing and aggregate productivity," Journal of Development Economics, Elsevier, vol. 154(C).
    20. Jerzmanowski, Michal & Tamura, Robert, 2019. "Directed technological change & cross-country income differences: A quantitative analysis," Journal of Development Economics, Elsevier, vol. 141(C).

    More about this item

    Keywords

    Audit quality; Average treatment effect; Big N effect; Double machine learning; Gradient boosting; Performance-matched discretionary accruals;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models
    • M42 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Accounting - - - Auditing

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:216:y:2020:i:1:p:268-283. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.