Debiased Machine Learning without Sample-Splitting for Stable Estimators

My bibliography Save this paper

Debiased Machine Learning without Sample-Splitting for Stable Estimators

Author

Listed:

Qizhao Chen
Vasilis Syrgkanis
Morgane Austern

Registered:

Abstract

Estimation and inference on causal parameters is typically reduced to a generalized method of moments problem, which involves auxiliary functions that correspond to solutions to a regression or classification problem. Recent line of work on debiased machine learning shows how one can use generic machine learning estimators for these auxiliary problems, while maintaining asymptotic normality and root-$n$ consistency of the target parameter of interest, while only requiring mean-squared-error guarantees from the auxiliary estimation algorithms. The literature typically requires that these auxiliary problems are fitted on a separate sample or in a cross-fitting manner. We show that when these auxiliary estimation algorithms satisfy natural leave-one-out stability properties, then sample splitting is not required. This allows for sample re-use, which can be beneficial in moderately sized sample regimes. For instance, we show that the stability properties that we propose are satisfied for ensemble bagged estimators, built via sub-sampling without replacement, a popular technique in machine learning practice.

Suggested Citation

Qizhao Chen & Vasilis Syrgkanis & Morgane Austern, 2022. "Debiased Machine Learning without Sample-Splitting for Stable Estimators," Papers 2206.01825, arXiv.org, revised Nov 2022.

Handle: RePEc:arx:papers:2206.01825

Download full text from publisher

References listed on IDEAS

Ai, Chunrong & Chen, Xiaohong, 2012. "The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," Journal of Econometrics, Elsevier, vol. 170(2), pages 442-457.
- Chunrong Ai & Xiaohong Chen, 2009. "Semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," CeMMAP working papers CWP28/09, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Chunrong Ai & Xiaohong Chen, 2009. "Semiparametric Efficiency Bound for Models of Sequential Moment Restrictions Containing Unknown Functions," Cowles Foundation Discussion Papers 1731, Cowles Foundation for Research in Economics, Yale University.
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
Newey, Whitney K, 1994. "The Asymptotic Variance of Semiparametric Estimators," Econometrica, Econometric Society, vol. 62(6), pages 1349-1382, November.
- Newey, W.K., 1989. "The Asymptotic Variance Of Semiparametric Estimotors," Papers 346, Princeton, Department of Economics - Econometric Research Program.
- Newey, W.K., 1991. "The Asymptotic Variance of Semiparametric Estimators," Working papers 583, Massachusetts Institute of Technology (MIT), Department of Economics.
Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Annual Review of Economics, Annual Reviews, vol. 7(1), pages 649-688, August.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Papers 1501.03430, arXiv.org, revised Aug 2015.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers 36/16, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers CWP36/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers CWP31/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2016. "Locally Robust Semiparametric Estimation," Papers 1608.00033, arXiv.org, revised Aug 2020.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2018. "Locally robust semiparametric estimation," CeMMAP working papers CWP30/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers 31/16, Institute for Fiscal Studies.
Whitney K. Newey & Fushing Hsieh & James M. Robins, 2004. "Twicing Kernels and a Small Bias Property of Semiparametric Estimators," Econometrica, Econometric Society, vol. 72(3), pages 947-962, May.
A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fern'andez-Val & Christian Hansen, 2013. "Program Evaluation and Causal Inference with High-Dimensional Data," Papers 1311.2645, arXiv.org, revised Jan 2018.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2016. "Program evaluation and causal inference with high-dimensional data," CeMMAP working papers 13/16, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2016. "Program evaluation and causal inference with high-dimensional data," CeMMAP working papers CWP13/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
van der Laan Mark J. & Rubin Daniel, 2006. "Targeted Maximum Likelihood Learning," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-40, December.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "Inference on Treatment Effects after Selection among High-Dimensional Controlsâ€," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(2), pages 608-650.
Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504, January.
Khashayar Khosravi & Greg Lewis & Vasilis Syrgkanis, 2019. "Non-Parametric Inference Adaptive to Intrinsic Dimension," Papers 1901.03719, arXiv.org, revised Jun 2019.
Whitney Newey & Fushing Hsieh & James Robins, 1998. "Undersmoothing and Bias Corrected Functional Estimation," Working papers 98-17, Massachusetts Institute of Technology (MIT), Department of Economics.
Emmanuel Rio, 2009. "Moment Inequalities for Sums of Dependent Random Variables under Projective Conditions," Journal of Theoretical Probability, Springer, vol. 22(1), pages 146-163, March.
Cun-Hui Zhang & Stephanie S. Zhang, 2014. "Confidence intervals for low dimensional parameters in high dimensional linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 217-242, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Philippe Goulet Coulombe & Maximilian Goebel, 2023. "Maximally Machine-Learnable Portfolios," Papers 2306.05568, arXiv.org, revised Apr 2024.
Chad Brown, 2024. "Inference in Partially Linear Models under Dependent Data with Deep Neural Networks," Papers 2410.22574, arXiv.org.
Philippe Goulet Coulombe & Maximilian Gobel, 2023. "Maximally Machine-Learnable Portfolios," Working Papers 23-01, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Apr 2023.
Chad Brown, 2024. "Statistical Properties of Deep Neural Networks with Dependent Data," Papers 2410.11113, arXiv.org, revised Jan 2025.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Rahul Singh, 2021. "Kernel Ridge Riesz Representers: Generalization, Mis-specification, and the Counterfactual Effective Dimension," Papers 2102.11076, arXiv.org, revised Jul 2024.
Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers CWP31/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2016. "Locally Robust Semiparametric Estimation," Papers 1608.00033, arXiv.org, revised Aug 2020.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2018. "Locally robust semiparametric estimation," CeMMAP working papers CWP30/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey, 2016. "Locally robust semiparametric estimation," CeMMAP working papers 31/16, Institute for Fiscal Studies.
Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Hidehiko Ichimura & Whitney K. Newey, 2022. "The influence function of semiparametric estimators," Quantitative Economics, Econometric Society, vol. 13(1), pages 29-61, January.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The influence function of semiparametric estimators," CeMMAP working papers 44/15, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The influence function of semiparametric estimators," CeMMAP working papers CWP44/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2015. "The Influence Function of Semiparametric Estimators," CIRJE F-Series CIRJE-F-985, CIRJE, Faculty of Economics, University of Tokyo.
- Hidehiko Ichimura & Whitney K. Newey, 2017. "The influence function of semiparametric estimators," CeMMAP working papers 06/17, Institute for Fiscal Studies.
- Hidehiko Ichimura & Whitney K. Newey, 2017. "The influence function of semiparametric estimators," CeMMAP working papers CWP06/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Nov 2024.
Anish Agarwal & Rahul Singh, 2021. "Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy," Papers 2107.02780, arXiv.org, revised Feb 2024.
Neng-Chieh Chang, 2018. "Semiparametric Difference-in-Differences with Potentially Many Control Variables," Papers 1812.10846, arXiv.org, revised Jan 2019.
Victor Chernozhukov & Whitney K. Newey & Rahul Singh, 2022. "Automatic Debiased Machine Learning of Causal and Structural Effects," Econometrica, Econometric Society, vol. 90(3), pages 967-1027, May.
- Victor Chernozhukov & Whitney K Newey & Rahul Singh, 2018. "Automatic Debiased Machine Learning of Causal and Structural Effects," Papers 1809.05224, arXiv.org, revised Oct 2022.
Liu, Lin & Mukherjee, Rajarshi & Robins, James M., 2024. "Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators," Journal of Econometrics, Elsevier, vol. 240(2).
Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.
Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
- Pedro H. C. Sant'Anna & Jun B. Zhao, 2018. "Doubly Robust Difference-in-Differences Estimators," Papers 1812.01723, arXiv.org, revised May 2020.
Victor Chernozhukov & Whitney K. Newey & Victor Quintas-Martinez & Vasilis Syrgkanis, 2021. "Automatic Debiased Machine Learning via Riesz Regression," Papers 2104.14737, arXiv.org, revised Mar 2024.
Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
V Chernozhukov & W K Newey & R Singh, 2023. "A simple and general debiased machine learning theorem with finite-sample guarantees," Biometrika, Biometrika Trust, vol. 110(1), pages 257-264.
- Victor Chernozhukov & Whitney K. Newey & Rahul Singh, 2021. "A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees," Papers 2105.15197, arXiv.org, revised Oct 2022.
Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Semenova, Vira, 2023. "Debiased machine learning of set-identified linear models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1725-1746.
Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2022-07-18 (Big Data)
NEP-CMP-2022-07-18 (Computational Economics)
NEP-DEM-2022-07-18 (Demographic Economics)
NEP-ECM-2022-07-18 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2206.01825. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Debiased Machine Learning without Sample-Splitting for Stable Estimators

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data