IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2008.06564.html
   My bibliography  Save this paper

Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects

Author

Listed:
  • Andr'es Ram'irez-Hassan
  • Raquel Vargas-Correa
  • Gustavo Garc'ia
  • Daniel Londo~no

Abstract

We propose a simple approach to optimally select the number of control units in k nearest neighbors (kNN) algorithm focusing in minimizing the mean squared error for the average treatment effects. Our approach is non-parametric where confidence intervals for the treatment effects were calculated using asymptotic results with bias correction. Simulation exercises show that our approach gets relative small mean squared errors, and a balance between confidence intervals length and type I error. We analyzed the average treatment effects on treated (ATET) of participation in 401(k) plans on accumulated net financial assets confirming significant effects on amount and positive probability of net asset. Our optimal k selection produces significant narrower ATET confidence intervals compared with common practice of using k=1.

Suggested Citation

  • Andr'es Ram'irez-Hassan & Raquel Vargas-Correa & Gustavo Garc'ia & Daniel Londo~no, 2020. "Optimal selection of the number of control units in kNN algorithm to estimate average treatment effects," Papers 2008.06564, arXiv.org.
  • Handle: RePEc:arx:papers:2008.06564
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2008.06564
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Taisuke Otsu & Yoshiyasu Rai, 2017. "Bootstrap Inference of Matching Estimators for Average Treatment Effects," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1720-1732, October.
    2. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    3. Angrist, Joshua D. & Krueger, Alan B., 1999. "Empirical strategies in labor economics," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 23, pages 1277-1366, Elsevier.
    4. Alberto Abadie & Guido W. Imbens, 2016. "Matching on the Estimated Propensity Score," Econometrica, Econometric Society, vol. 84, pages 781-807, March.
    5. Alberto Abadie & Guido W. Imbens, 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 1-11, January.
    6. Alberto Abadie & Guido W. Imbens, 2008. "On the Failure of the Bootstrap for Matching Estimators," Econometrica, Econometric Society, vol. 76(6), pages 1537-1557, November.
    7. Victor Chernozhukov & Christian Hansen, 2004. "The Effects of 401(K) Participation on the Wealth Distribution: An Instrumental Quantile Regression Analysis," The Review of Economics and Statistics, MIT Press, vol. 86(3), pages 735-751, August.
    8. Athey, Susan & Imbens, Guido W., 2015. "Machine Learning for Estimating Heterogeneous Causal Effects," Research Papers 3350, Stanford University, Graduate School of Business.
    9. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Nov 2024.
    10. Benjamin, Daniel J., 2003. "Does 401(k) eligibility increase saving?: Evidence from propensity score subclassification," Journal of Public Economics, Elsevier, vol. 87(5-6), pages 1259-1290, May.
    11. Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yihui He & Fang Han, 2023. "On propensity score matching with a diverging number of matches," Papers 2310.14142, arXiv.org, revised Nov 2023.
    2. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    3. Ziming Lin & Fang Han, 2024. "On the consistency of bootstrap for matching estimators," Papers 2410.23525, arXiv.org, revised Nov 2024.
    4. Taisuke Otsu & Mengshan Xu, 2022. "Isotonic propensity score matching," STICERD - Econometrics Paper Series 623, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    5. Bodory, Hugo & Camponovo, Lorenzo & Huber, Martin & Lechner, Michael, 2024. "Nonparametric bootstrap for propensity score matching estimators," Statistics & Probability Letters, Elsevier, vol. 208(C).
    6. Huber, Martin & Camponovo, Lorenzo & Bodory, Hugo & Lechner, Michael, 2016. "A wild bootstrap algorithm for propensity score matching estimators," FSES Working Papers 470, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    7. Mengshan Xu & Taisuke Otsu, 2022. "Isotonic propensity score matching," Papers 2207.08868, arXiv.org, revised Aug 2024.
    8. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    9. Songliang Chen & Fang Han, 2024. "On the limiting variance of matching estimators," Papers 2411.05758, arXiv.org.
    10. Zongwu Cai & Ying Fang & Ming Lin & Shengfang Tang, 2020. "Testing Unconfoundedness Assumption Using Auxiliary Variables," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202004, University of Kansas, Department of Economics, revised Feb 2020.
    11. Ferman, Bruno, 2021. "Matching estimators with few treated and many control observations," Journal of Econometrics, Elsevier, vol. 225(2), pages 295-307.
    12. Zeqin Liu & Zongwu Cai & Ying Fang & Ming Lin, 2019. "Statistical Analysis and Evaluation of Macroeconomic Policies: A Selective Review," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 201904, University of Kansas, Department of Economics, revised Mar 2019.
    13. Phillip Heiler, 2020. "Efficient Covariate Balancing for the Local Average Treatment Effect," Papers 2007.04346, arXiv.org.
    14. David Kaplan & Jianshen Chen, 2012. "A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 581-609, July.
    15. Shu Yang & Yunshu Zhang, 2023. "Multiply robust matching estimators of average and quantile treatment effects," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 235-265, March.
    16. Athey, Susan & Imbens, Guido W. & Metzger, Jonas & Munro, Evan, 2024. "Using Wasserstein Generative Adversarial Networks for the design of Monte Carlo simulations," Journal of Econometrics, Elsevier, vol. 240(2).
    17. Matthew Blackwell & Anton Strezhnev, 2022. "Telescope matching for reducing model dependence in the estimation of the effects of time‐varying treatments: An application to negative advertising," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 377-399, January.
    18. Zhexiao Lin & Peng Ding & Fang Han, 2021. "Estimation based on nearest neighbor matching: from density ratio to average treatment effect," Papers 2112.13506, arXiv.org.
    19. Shu Yang & Jae Kwang Kim, 2020. "Asymptotic theory and inference of predictive mean matching imputation using a superpopulation model framework," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(3), pages 839-861, September.
    20. Michael Lechner & Ruth Miquel & Conny Wunsch, 2011. "Long‐Run Effects Of Public Sector Sponsored Training In West Germany," Journal of the European Economic Association, European Economic Association, vol. 9(4), pages 742-784, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2008.06564. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.