Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects

My bibliography Save this paper

Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects

Author

Listed:

Andr'es Ram'irez-Hassan
Raquel Vargas-Correa
Gustavo Garc'ia
Daniel Londo~no

Registered:

Gustavo Adolfo Garcia Cruz

Abstract

We propose a simple approach to optimally select the number of control units in k nearest neighbors (kNN) algorithm focusing in minimizing the mean squared error for the average treatment effects. Our approach is non-parametric where confidence intervals for the treatment effects were calculated using asymptotic results with bias correction. Simulation exercises show that our approach gets relative small mean squared errors, and a balance between confidence intervals length and type I error. We analyzed the average treatment effects on treated (ATET) of participation in 401(k) plans on accumulated net financial assets confirming significant effects on amount and positive probability of net asset. Our optimal k selection produces significant narrower ATET confidence intervals compared with common practice of using k=1.

Suggested Citation

Andr'es Ram'irez-Hassan & Raquel Vargas-Correa & Gustavo Garc'ia & Daniel Londo~no, 2020. "Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects," Papers 2008.06564, arXiv.org.

Handle: RePEc:arx:papers:2008.06564

Download full text from publisher

References listed on IDEAS

Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Angrist, Joshua D. & Krueger, Alan B., 1999. "Empirical strategies in labor economics," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 23, pages 1277-1366, Elsevier.
- Alan B. Krueger & Joshua D. Angrist, 1998. "Empirical Strategies in Labor Economics," Working Papers 780, Princeton University, Department of Economics, Industrial Relations Section..
- Joshua Angrist & Alan Krueger, 1998. "Empirical Strategies in Labor Economics," Working papers 98-7, Massachusetts Institute of Technology (MIT), Department of Economics.
Alberto Abadie & Guido W. Imbens, 2016. "Matching on the Estimated Propensity Score," Econometrica, Econometric Society, vol. 84, pages 781-807, March.
- Alberto Abadie & Guido W. Imbens, 2009. "Matching on the Estimated Propensity Score," NBER Working Papers 15301, National Bureau of Economic Research, Inc.
Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
- Alberto Abadie & Guido W. Imbens, 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 1-11, January.
Alberto Abadie & Guido W. Imbens, 2008. "On the Failure of the Bootstrap for Matching Estimators," Econometrica, Econometric Society, vol. 76(6), pages 1537-1557, November.
- Alberto Abadie & Guido W. Imbens, 2006. "On the Failure of the Bootstrap for Matching Estimators," NBER Technical Working Papers 0325, National Bureau of Economic Research, Inc.
- Imbens, Guido & Abadie, Alberto, 2008. "On the Failure of the Bootstrap for Matching Estimators," Scholarly Articles 3043415, Harvard University Department of Economics.
Taisuke Otsu & Yoshiyasu Rai, 2017. "Bootstrap Inference of Matching Estimators for Average Treatment Effects," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1720-1732, October.
- Taisuke Otsu & Yoshiyasu Rai, 2015. "Bootstrap inference of matching estimators for average treatment effects," STICERD - Econometrics Paper Series /2015/580, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Victor Chernozhukov & Christian Hansen, 2004. "The Effects of 401(K) Participation on the Wealth Distribution: An Instrumental Quantile Regression Analysis," The Review of Economics and Statistics, MIT Press, vol. 86(3), pages 735-751, August.
Athey, Susan & Imbens, Guido W., 2015. "Machine Learning for Estimating Heterogeneous Causal Effects," Research Papers 3350, Stanford University, Graduate School of Business.
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Nov 2024.
Benjamin, Daniel J., 2003. "Does 401(k) eligibility increase saving?: Evidence from propensity score subclassification," Journal of Public Economics, Elsevier, vol. 87(5-6), pages 1259-1290, May.
Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yihui He & Fang Han, 2023. "On propensity score matching with a diverging number of matches," Papers 2310.14142, arXiv.org, revised Nov 2023.
Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
- Martin Huber, 2019. "An introduction to flexible methods for policy evaluation," Papers 1910.00641, arXiv.org.
Ziming Lin & Fang Han, 2024. "On the consistency of bootstrap for matching estimators," Papers 2410.23525, arXiv.org, revised Nov 2024.
Taisuke Otsu & Mengshan Xu, 2022. "Isotonic propensity score matching," STICERD - Econometrics Paper Series 623, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Bodory, Hugo & Camponovo, Lorenzo & Huber, Martin & Lechner, Michael, 2024. "Nonparametric bootstrap for propensity score matching estimators," Statistics & Probability Letters, Elsevier, vol. 208(C).
Huber, Martin & Camponovo, Lorenzo & Bodory, Hugo & Lechner, Michael, 2016. "A wild bootstrap algorithm for propensity score matching estimators," FSES Working Papers 470, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
Mengshan Xu & Taisuke Otsu, 2022. "Isotonic propensity score matching," Papers 2207.08868, arXiv.org, revised Jan 2025.
Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
- Guido M. Imbens & Jeffrey M. Wooldridge, 2008. "Recent Developments in the Econometrics of Program Evaluation," NBER Working Papers 14251, National Bureau of Economic Research, Inc.
- Wooldridge, Jeffrey M. & Imbens, Guido, 2009. "Recent Developments in the Econometrics of Program Evaluation," Scholarly Articles 3043416, Harvard University Department of Economics.
- Guido Imbens & Jeffrey M. Wooldridge, 2008. "Recent developments in the econometrics of program evaluation," CeMMAP working papers CWP24/08, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Imbens, Guido W. & Wooldridge, Jeffrey M., 2008. "Recent Developments in the Econometrics of Program Evaluation," IZA Discussion Papers 3640, Institute of Labor Economics (IZA).
Songliang Chen & Fang Han, 2024. "On the limiting variance of matching estimators," Papers 2411.05758, arXiv.org.
Stanislao Maldonado, 2024. "Empowering women through multifaceted interventions: long-term evidence from a double matching design," Journal of Population Economics, Springer;European Society for Population Economics, vol. 37(1), pages 1-44, March.
- Maldonado, S, 2020. "Empowering women through multifaceted interventions: Long-term evidence from a double matching design," Documentos de Trabajo 18456, Universidad del Rosario.
- Stanislao Maldonado, 2020. "Empowering women through multifaceted interventions: Long-term evidence from a double matching design," Working Papers 170, Peruvian Economic Association.
Zongwu Cai & Ying Fang & Ming Lin & Shengfang Tang, 2020. "Testing Unconfoundedness Assumption Using Auxiliary Variables," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202004, University of Kansas, Department of Economics, revised Feb 2020.
Ferman, Bruno, 2021. "Matching estimators with few treated and many control observations," Journal of Econometrics, Elsevier, vol. 225(2), pages 295-307.
- Ferman, Bruno, 2017. "Matching Estimators with Few Treated and Many Control Observations," MPRA Paper 78940, University Library of Munich, Germany.
- Bruno Ferman, 2019. "Matching Estimators with Few Treated and Many Control Observations," Papers 1909.05093, arXiv.org, revised Mar 2021.
Zeqin Liu & Zongwu Cai & Ying Fang & Ming Lin, 2019. "Statistical Analysis and Evaluation of Macroeconomic Policies: A Selective Review," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 201904, University of Kansas, Department of Economics, revised Mar 2019.
Phillip Heiler, 2022. "Efficient Covariate Balancing for the Local Average Treatment Effect," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1569-1582, October.
- Phillip Heiler, 2020. "Efficient Covariate Balancing for the Local Average Treatment Effect," Papers 2007.04346, arXiv.org.
David Kaplan & Jianshen Chen, 2012. "A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 581-609, July.
Shu Yang & Yunshu Zhang, 2023. "Multiply robust matching estimators of average and quantile treatment effects," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 235-265, March.
Athey, Susan & Imbens, Guido W. & Metzger, Jonas & Munro, Evan, 2024. "Using Wasserstein Generative Adversarial Networks for the design of Monte Carlo simulations," Journal of Econometrics, Elsevier, vol. 240(2).
- Susan Athey & Guido W. Imbens & Jonas Metzger & Evan M. Munro, 2019. "Using Wasserstein Generative Adversarial Networks for the Design of Monte Carlo Simulations," NBER Working Papers 26566, National Bureau of Economic Research, Inc.
- Susan Athey & Guido Imbens & Jonas Metzger & Evan Munro, 2019. "Using Wasserstein Generative Adversarial Networks for the Design of Monte Carlo Simulations," Papers 1909.02210, arXiv.org, revised Jul 2020.
Matthew Blackwell & Anton Strezhnev, 2022. "Telescope matching for reducing model dependence in the estimation of the effects of time‐varying treatments: An application to negative advertising," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 377-399, January.
Zhexiao Lin & Peng Ding & Fang Han, 2021. "Estimation based on nearest neighbor matching: from density ratio to average treatment effect," Papers 2112.13506, arXiv.org.
Shu Yang & Jae Kwang Kim, 2020. "Asymptotic theory and inference of predictive mean matching imputation using a superpopulation model framework," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(3), pages 839-861, September.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2020-09-07 (Computational Economics)
NEP-ECM-2020-09-07 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2008.06564. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data