A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

My bibliography Save this paper

A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

Author

Listed:

Victor Chernozhukov
Whitney K. Newey
Rahul Singh

Registered:

Abstract

Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i.e. scalar summaries, of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased machine learning theorem that encompasses any global or local functional of any machine learning algorithm that satisfies a few simple, interpretable conditions. Formally, we prove consistency, Gaussian approximation, and semiparametric efficiency by finite sample arguments. The rate of convergence is $n^{-1/2}$ for global functionals, and it degrades gracefully for local functionals. Our results culminate in a simple set of conditions that an analyst can use to translate modern learning theory rates into traditional statistical inference. The conditions reveal a general double robustness property for ill posed inverse problems.

Suggested Citation

Victor Chernozhukov & Whitney K. Newey & Rahul Singh, 2021. "A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees," Papers 2105.15197, arXiv.org, revised Oct 2022.

Handle: RePEc:arx:papers:2105.15197

Download full text from publisher

Other versions of this item:

V Chernozhukov & W K Newey & R Singh, 2023. "A simple and general debiased machine learning theorem with finite-sample guarantees," Biometrika, Biometrika Trust, vol. 110(1), pages 257-264.

References listed on IDEAS

Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Andrews, Donald W K, 1994. "Asymptotics for Semiparametric Econometric Models via Stochastic Equicontinuity," Econometrica, Econometric Society, vol. 62(1), pages 43-72, January.
Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
Severini, Thomas A. & Tripathi, Gautam, 2012. "Efficiency bounds for estimating linear functionals of nonparametric regression models with endogenous regressors," Journal of Econometrics, Elsevier, vol. 170(2), pages 491-498.
- Thomas A. Severini & Gautam Tripathi, 2007. "Efficiency Bounds for Estimating Linear Functionals of Nonparametric Regression Models with Endogenous Regressors," Working papers 2007-18, University of Connecticut, Department of Economics.
- Thomas A. Severini & Gautam Tripathi, 2007. "Efficiency bounds for estimating linear functionals of nonparametric regression models with endogenous regressors," CeMMAP working papers CWP13/07, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Newey, Whitney K, 1994. "The Asymptotic Variance of Semiparametric Estimators," Econometrica, Econometric Society, vol. 62(6), pages 1349-1382, November.
- Newey, W.K., 1989. "The Asymptotic Variance Of Semiparametric Estimotors," Papers 346, Princeton, Department of Economics - Econometric Research Program.
- Newey, W.K., 1991. "The Asymptotic Variance of Semiparametric Estimators," Working papers 583, Massachusetts Institute of Technology (MIT), Department of Economics.
Rahul Singh, 2021. "Kernel Ridge Riesz Representers: Generalization, Mis-specification, and the Counterfactual Effective Dimension," Papers 2102.11076, arXiv.org, revised Jul 2024.
Nishanth Dikkala & Greg Lewis & Lester Mackey & Vasilis Syrgkanis, 2020. "Minimax Estimation of Conditional Moment Models," Papers 2006.07201, arXiv.org.
Jason Abrevaya & Yu-Chin Hsu & Robert P. Lieli, 2015. "Estimating Conditional Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(4), pages 485-505, October.
- Jason Abrevaya & Yu-Chin Hsu & Robert P. Lieli, 2012. "Estimating Conditional Average Treatment Effects," CEU Working Papers 2012_16, Department of Economics, Central European University, revised 20 Jul 2012.
Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers CWP31/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers 31/16, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2018. "Locally robust semiparametric estimation," CeMMAP working papers CWP30/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2016. "Locally Robust Semiparametric Estimation," Papers 1608.00033, arXiv.org, revised Aug 2020.
Whitney K. Newey & Fushing Hsieh & James M. Robins, 2004. "Twicing Kernels and a Small Bias Property of Semiparametric Estimators," Econometrica, Econometric Society, vol. 72(3), pages 947-962, May.
Yoici Arai & Taisuke Otsu & Myung Hwan Seo, 2019. "Causal inference on regression discontinuity designs by high-dimensional methods," STICERD - Econometrics Paper Series 601, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Newey, Whitney K., 1994. "Kernel Estimation of Partial Means and a General Variance Estimator," Econometric Theory, Cambridge University Press, vol. 10(2), pages 1-21, June.
- Newey, W.K., 1992. "Kernel Estimation of Partial Means and a General Variance Estimator," Working papers 93-3, Massachusetts Institute of Technology (MIT), Department of Economics.
Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
Whitney K. Newey & James L. Powell, 2003. "Instrumental Variable Estimation of Nonparametric Models," Econometrica, Econometric Society, vol. 71(5), pages 1565-1578, September.
Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jikai Jin & Vasilis Syrgkanis, 2024. "Structure-agnostic Optimality of Doubly Robust Learning for Treatment Effect Estimation," Papers 2402.14264, arXiv.org, revised Mar 2024.
Andrew Bennett & Nathan Kallus & Xiaojie Mao & Whitney Newey & Vasilis Syrgkanis & Masatoshi Uehara, 2023. "Source Condition Double Robust Inference on Functionals of Inverse Problems," Papers 2307.13793, arXiv.org.
Rahul Singh & Liyuan Xu & Arthur Gretton, 2021. "Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves," Papers 2111.03950, arXiv.org, revised Mar 2025.
Dmitry Arkhangelsky & Kazuharu Yanagimoto & Tom Zohar, 2024. "Using Event Studies as an Outcome in Causal Analysis," Papers 2403.19563, arXiv.org, revised Jan 2025.
- Dmitry Arkhangelsky & Kazuharu Yanagimoto & Tom Zohar, 2025. "Using Event Studies as an Outcome in Causal Analysis," Working Papers wp2025_2503, CEMFI.
Rahul Singh, 2021. "Generalized Kernel Ridge Regression for Causal Inference with Missing-at-Random Sample Selection," Papers 2111.05277, arXiv.org.
David Bruns-Smith & Oliver Dukes & Avi Feller & Elizabeth L. Ogburn, 2023. "Augmented balancing weights as linear regression," Papers 2304.14545, arXiv.org, revised Jun 2024.
Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.
Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers CWP31/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2016. "Locally Robust Semiparametric Estimation," Papers 1608.00033, arXiv.org, revised Aug 2020.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2018. "Locally robust semiparametric estimation," CeMMAP working papers CWP30/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers 31/16, Institute for Fiscal Studies.
Hidehiko Ichimura & Whitney K. Newey, 2022. "The influence function of semiparametric estimators," Quantitative Economics, Econometric Society, vol. 13(1), pages 29-61, January.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The influence function of semiparametric estimators," CeMMAP working papers 44/15, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The influence function of semiparametric estimators," CeMMAP working papers CWP44/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The Influence Function of Semiparametric Estimators," CIRJE F-Series CIRJE-F-985, CIRJE, Faculty of Economics, University of Tokyo.
- Hidehiko Ichimura & Whitney K. Newey, 2017. "The influence function of semiparametric estimators," CeMMAP working papers 06/17, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2017. "The influence function of semiparametric estimators," CeMMAP working papers CWP06/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
Rahul Singh, 2021. "Kernel Ridge Riesz Representers: Generalization, Mis-specification, and the Counterfactual Effective Dimension," Papers 2102.11076, arXiv.org, revised Jul 2024.
Anish Agarwal & Rahul Singh, 2021. "Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy," Papers 2107.02780, arXiv.org, revised Feb 2024.
Qizhao Chen & Vasilis Syrgkanis & Morgane Austern, 2022. "Debiased Machine Learning without Sample-Splitting for Stable Estimators," Papers 2206.01825, arXiv.org, revised Nov 2022.
Dong, Chaohua & Gao, Jiti & Linton, Oliver, 2023. "High dimensional semiparametric moment restriction models," Journal of Econometrics, Elsevier, vol. 232(2), pages 320-345.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2017. "High dimensional semiparametric moment restriction models," Monash Econometrics and Business Statistics Working Papers 17/17, Monash University, Department of Econometrics and Business Statistics.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," CeMMAP working papers CWP69/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," CeMMAP working papers CWP04/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Dong, C. & Gao, J. & Linton, O., 2018. "High Dimensional Semiparametric Moment Restriction Models," Cambridge Working Papers in Economics 1881, Faculty of Economics, University of Cambridge.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," Monash Econometrics and Business Statistics Working Papers 23/18, Monash University, Department of Econometrics and Business Statistics.
Taisuke Otsu & Mengshan Xu, 2022. "Isotonic propensity score matching," STICERD - Econometrics Paper Series 623, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Liu, Lin & Mukherjee, Rajarshi & Robins, James M., 2024. "Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators," Journal of Econometrics, Elsevier, vol. 240(2).
Mengshan Xu & Taisuke Otsu, 2022. "Isotonic propensity score matching," Papers 2207.08868, arXiv.org, revised Jan 2025.
Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Gayle, Wayne-Roy & Namoro, Soiliou Daw, 2013. "Estimation of a nonlinear panel data model with semiparametric individual effects," Journal of Econometrics, Elsevier, vol. 175(1), pages 46-59.
Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
- Michael Jansson & Demian Pouzo, 2019. "Towards a general large sample theory for regularized estimators," CeMMAP working papers CWP63/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Gayle, George-Levi & Viauroux, Christelle, 2007. "Root-N consistent semiparametric estimators of a dynamic panel-sample-selection model," Journal of Econometrics, Elsevier, vol. 141(1), pages 179-212, November.
- George-Levi Gayle & Christelle Viauroux, "undated". "Root-N Consistent Semiparametric Estimators of a Dynamic Panel Sample Selection Model," GSIA Working Papers 2004-E62, Carnegie Mellon University, Tepper School of Business.
- Christelle Viauroux, G.L. Gayle, 2004. "Root-N Consistent Semiparametric Eestimators of a Dynamic Panel Sample Selection Model," University of Cincinnati, Economics Working Papers Series 2004-05, University of Cincinnati, Department of Economics.
Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
Abhinandan Dalal & Patrick Blobaum & Shiva Kasiviswanathan & Aaditya Ramdas, 2024. "Anytime-Valid Inference for Double/Debiased Machine Learning of Causal Parameters," Papers 2408.09598, arXiv.org, revised Sep 2024.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2021-06-21 (Big Data)
NEP-CMP-2021-06-21 (Computational Economics)
NEP-ECM-2021-06-21 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2105.15197. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data