IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04678541.html
   My bibliography  Save this paper

Explicit solutions for the asymptotically optimal bandwidth in cross-validation

Author

Listed:
  • Karim M Abadir

    (Imperial College London)

  • Michel Lubrano

    (AMSE - Aix-Marseille Sciences Economiques - EHESS - École des hautes études en sciences sociales - AMU - Aix Marseille Université - ECM - École Centrale de Marseille - CNRS - Centre National de la Recherche Scientifique, AMU - Aix Marseille Université)

Abstract

We show that least-squares cross-validation methods share a common structure that has an explicit asymptotic solution, when the chosen kernel is asymptotically separable in bandwidth and data. For density estimation with a multivariate Student-t(ν) kernel, the cross-validation criterion becomes asymptotically equivalent to a polynomial of only three terms. Our bandwidth formulae are simple and noniterative, thus leading to very fast computations, their integrated squared-error dominates traditional cross-validation implementations, they alleviate the notorious sample variability of cross-validation and overcome its breakdown in the case of repeated observations. We illustrate our method with univariate and bivariate applications, of density estimation and nonparametric regressions, to a large dataset of Michigan State University academic wages and experience.

Suggested Citation

  • Karim M Abadir & Michel Lubrano, 2024. "Explicit solutions for the asymptotically optimal bandwidth in cross-validation," Post-Print hal-04678541, HAL.
  • Handle: RePEc:hal:journl:hal-04678541
    DOI: 10.1093/biomet/asae007
    Note: View the original document on HAL open archive server: https://hal.science/hal-04678541
    as

    Download full text from publisher

    File URL: https://hal.science/hal-04678541/document
    Download Restriction: no

    File URL: https://libkey.io/10.1093/biomet/asae007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kim, W. C. & Park, B. U. & Marron, J. S., 1994. "Asymptotically best bandwidth selectors in kernel density estimation," Statistics & Probability Letters, Elsevier, vol. 19(2), pages 119-127, January.
    2. Mammen, Enno & Martínez Miranda, María Dolores & Nielsen, Jens Perch & Sperlich, Stefan, 2011. "Do-Validation for Kernel Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 651-660.
    3. Savchuk, Olga Y. & Hart, Jeffrey D. & Sheather, Simon J., 2010. "Indirect Cross-Validation for Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 415-423.
    4. Karim Abadir, 1999. "An introduction to hypergeometric functions for economists," Econometric Reviews, Taylor & Francis Journals, vol. 18(3), pages 287-330.
    5. Qi Li & Jeffrey Scott Racine, 2006. "Density Estimation, from Nonparametric Econometrics: Theory and Practice," Introductory Chapters, in: Nonparametric Econometrics: Theory and Practice, Princeton University Press.
    6. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    7. Hall, Peter & Marron, J. S., 1987. "Estimation of integrated squared density derivatives," Statistics & Probability Letters, Elsevier, vol. 6(2), pages 109-115, November.
    8. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    9. Robinson, P.M., 2005. "Robust Covariance Matrix Estimation: Hac Estimates With Long Memory/Antipersistence Correction," Econometric Theory, Cambridge University Press, vol. 21(1), pages 171-180, February.
    10. Duong, Tarn, 2007. "ks: Kernel Density Estimation and Kernel Discriminant Analysis for Multivariate Data in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 21(i07).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Karim M Abadir & Michel Lubrano, 2023. "Explicit solutions for the asymptotically-optimal bandwidth in cross validation," AMSE Working Papers 2336, Aix-Marseille School of Economics, France.
    2. Nils-Bastian Heidenreich & Anja Schindler & Stefan Sperlich, 2013. "Bandwidth selection for kernel density estimation: a review of fully automatic selectors," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 403-433, October.
    3. Jianqing Fan & Weining Wang & Yue Zhao, 2024. "Conditional nonparametric variable screening by neural factor regression," Papers 2408.10825, arXiv.org.
    4. Gregory Connor & Lisa R. Goldberg & Robert A. Korajczyk, 2010. "Portfolio Risk Analysis," Economics Books, Princeton University Press, edition 1, number 9224.
    5. Chaohua Dong & Jiti Gao & Oliver Linton & Bin peng, 2020. "On Time Trend of COVID-19: A Panel Data Study," Monash Econometrics and Business Statistics Working Papers 22/20, Monash University, Department of Econometrics and Business Statistics.
    6. Walter Sosa-Escudero & Sergio Petralia, 2011. "Anatomy of Distributive Changes in Argentina," Chapters, in: Werner Baer & David Fleischer (ed.), The Economies of Argentina and Brazil, chapter 10, Edward Elgar Publishing.
    7. Luis Alvarez & Cristine Pinto & Vladimir Ponczek, 2022. "Homophily in preferences or meetings? Identifying and estimating an iterative network formation model," Papers 2201.06694, arXiv.org, revised Mar 2024.
    8. repec:asg:wpaper:1006 is not listed on IDEAS
    9. Eduardo Fé & Bruce Hollingsworth, 2016. "Short- and long-run estimates of the local effects of retirement on health," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(4), pages 1051-1067, October.
    10. repec:cte:werepe:we1211 is not listed on IDEAS
    11. Adriano Z. Zambom & Ronaldo Dias, 2013. "A Review of Kernel Density Estimation with Applications to Econometrics," International Econometric Review (IER), Econometric Research Association, vol. 5(1), pages 20-42, April.
    12. Tobias Adrian & Richard K. Crump & Erik Vogt, 2019. "Nonlinearity and Flight‐to‐Safety in the Risk‐Return Trade‐Off for Stocks and Bonds," Journal of Finance, American Finance Association, vol. 74(4), pages 1931-1973, August.
    13. Camelia Minoiu & Sanjay Reddy, 2014. "Kernel density estimation on grouped data: the case of poverty assessment," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(2), pages 163-189, June.
    14. Wenger, Kai & Leschinski, Christian & Sibbertsen, Philipp, 2018. "A simple test on structural change in long-memory time series," Economics Letters, Elsevier, vol. 163(C), pages 90-94.
    15. Kai Wenger & Christian Leschinski & Philipp Sibbertsen, 2019. "Change-in-mean tests in long-memory time series: a review of recent developments," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 103(2), pages 237-256, June.
    16. repec:hal:spmain:info:hdl:2441/4hgajj9cf48dladkd9pn9jcj4p is not listed on IDEAS
    17. Manabu Asai & Michael McAleer, 2017. "A fractionally integrated Wishart stochastic volatility model," Econometric Reviews, Taylor & Francis Journals, vol. 36(1-3), pages 42-59, March.
    18. Cornelius Christian & Lukas Hensel & Christopher Roth, 2019. "Income Shocks and Suicides: Causal Evidence From Indonesia," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 905-920, December.
    19. Quoc-Anh Do & Kieu-Trang Nguyen & Anh N. Tran, 2017. "One Mandarin Benefits the Whole Clan: Hometown Favoritism in an Authoritarian Regime," American Economic Journal: Applied Economics, American Economic Association, vol. 9(4), pages 1-29, October.
    20. Campante, Filipe R. & Do, Quoc-Anh & Guimaraes, Bernardo, 2012. "Isolated Capital Cities and Misgovernance: Theory and Evidence," Working Paper Series rwp12-058, Harvard University, John F. Kennedy School of Government.
    21. Hugo Bodory & Martin Huber & Michael Lechner, 2022. "The finite sample performance of instrumental variable-based estimators of the Local Average Treatment Effect when controlling for covariates," Papers 2212.07379, arXiv.org.
    22. Manuel Hernandez & Maximo Torero, 2014. "Parametric versus nonparametric methods in risk scoring: an application to microcredit," Empirical Economics, Springer, vol. 46(3), pages 1057-1079, May.
    23. Xiaohong Chen & Zhipeng Liao & Yixiao Sun, 2012. "Sieve Inference on Semi-nonparametric Time Series Models," Cowles Foundation Discussion Papers 1849, Cowles Foundation for Research in Economics, Yale University.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04678541. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.