IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v61y1997i1p38-60.html
   My bibliography  Save this article

A Cross-Validation Bandwidth Choice for Kernel Density Estimates with Selection Biased Data

Author

Listed:
  • Wu, Colin O.

Abstract

This paper studies the risks and bandwidth choices of a kernel estimate of the underlying density when the data are obtained fromsindependent biased samples. The main results of this paper give the asymptotic representation of the integrated squared errors and the mean integrated squared errors of the estimate and establish a cross-validation criterion for bandwidth selection. This kernel density estimate is shown to be asymptotically superior to many other intuitive kernel density estimates. The data-driven cross-validation bandwidth is shown to be asymptotically optimal in the sense of Stone (1984,Ann. Statist.12, 1285-1297). The finite sample properties of the cross-validation bandwidth are investigated through a Monte Carlo simulation.

Suggested Citation

  • Wu, Colin O., 1997. "A Cross-Validation Bandwidth Choice for Kernel Density Estimates with Selection Biased Data," Journal of Multivariate Analysis, Elsevier, vol. 61(1), pages 38-60, April.
  • Handle: RePEc:eee:jmvana:v:61:y:1997:i:1:p:38-60
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047-259X(97)91659-6
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ahmad, Ibrahim A., 1995. "On multivariate kernel estimation for samples from weighted distributions," Statistics & Probability Letters, Elsevier, vol. 22(2), pages 121-129, February.
    2. Morgenthaler, S. & Vardi, Y., 1986. "Choice-based samples : A non-parametric approach," Journal of Econometrics, Elsevier, vol. 32(1), pages 109-125, June.
    3. Colin Wu & Andrew Mao, 1996. "Minimax kernels for density estimation with biased data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 48(3), pages 451-467, September.
    4. Marron, James Stephen & Härdle, Wolfgang, 1986. "Random approximations to some measures of accuracy in nonparametric curve estimation," Journal of Multivariate Analysis, Elsevier, vol. 20(1), pages 91-113, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yu-Min Huang, 2019. "Binary surrogates with stratified samples when weights are unknown," Computational Statistics, Springer, vol. 34(2), pages 653-682, June.
    2. José Cristóbal & José Alcalá, 2001. "An overview of nonparametric contributions to the problem of functional estimation from biased data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 10(2), pages 309-332, December.
    3. Dauxois, Jean-Yves & Guilloux, Agathe, 2008. "Nonparametric inference under competing risks and selection-biased sampling," Journal of Multivariate Analysis, Elsevier, vol. 99(4), pages 589-605, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Delsol, Laurent & Ferraty, Frédéric & Vieu, Philippe, 2011. "Structural test in regression on functional variables," Journal of Multivariate Analysis, Elsevier, vol. 102(3), pages 422-447, March.
    2. Timmermans, Catherine & Delsol, Laurent & von Sachs, Rainer, 2013. "Using Bagidis in nonparametric functional data analysis: Predicting from curves with sharp local features," Journal of Multivariate Analysis, Elsevier, vol. 115(C), pages 421-444.
    3. Max Köhler & Anja Schindler & Stefan Sperlich, 2014. "A Review and Comparison of Bandwidth Selection Methods for Kernel Regression," International Statistical Review, International Statistical Institute, vol. 82(2), pages 243-274, August.
    4. BOUEZMARNI, Taoufik & ROMBOUTS, Jeroen V.K., 2007. "Nonparametric density estimation for multivariate bounded data," LIDAM Discussion Papers CORE 2007065, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    5. Estévez-Pérez, Graciela, 2002. "On convergence rates for quadratic errors in kernel hazard estimation," Statistics & Probability Letters, Elsevier, vol. 57(3), pages 231-241, April.
    6. J. Vilar, 1995. "Kernel estimation of the regression function with random sampling times," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 4(1), pages 137-178, June.
    7. Christopher Vahl & Qing Kang, 2015. "Analysis of an outcome-dependent enriched sample: hypothesis tests," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(3), pages 387-409, September.
    8. Junke Kou & Youming Liu, 2018. "Wavelet regression estimations with strong mixing data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(4), pages 667-688, December.
    9. Sun, Liuquan, 1997. "Bandwidth choice for hazard rate estimators from left truncated and right censored data," Statistics & Probability Letters, Elsevier, vol. 36(2), pages 101-114, December.
    10. Dauxois, Jean-Yves & Guilloux, Agathe, 2008. "Nonparametric inference under competing risks and selection-biased sampling," Journal of Multivariate Analysis, Elsevier, vol. 99(4), pages 589-605, April.
    11. J. Cristóbal & J. Ojeda & J. Alcalá, 2004. "Confidence bands in nonparametric regression with length biased data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 56(3), pages 475-496, September.
    12. Ying Qing Chen, 2010. "Semiparametric Regression in Size-Biased Sampling," Biometrics, The International Biometric Society, vol. 66(1), pages 149-158, March.
    13. Inoue, Atsushi & Jin, Lu & Rossi, Barbara, 2017. "Rolling window selection for out-of-sample forecasting with time-varying parameters," Journal of Econometrics, Elsevier, vol. 196(1), pages 55-67.
    14. Colin Wu & Andrew Mao, 1996. "Minimax kernels for density estimation with biased data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 48(3), pages 451-467, September.
    15. Dippon, J. & Fritz, P. & Kohler, M., 2002. "A statistical approach to case based reasoning, with application to breast cancer data," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 579-602, September.
    16. de Uña-Álvarez, Jacobo, 2003. "Large sample results under biased sampling when covariables are present," Statistics & Probability Letters, Elsevier, vol. 63(3), pages 287-293, July.
    17. Timmermans, Catherine & Delsol, Laurent & von Sachs, Rainer, 2011. "Using Bagidis in nonparametric functional data analysis: predicting from curves with sharp local features," LIDAM Discussion Papers ISBA 2011020, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    18. Wu, Colin O., 1997. "The effects of kernel choices in density estimation with biased data," Statistics & Probability Letters, Elsevier, vol. 34(4), pages 373-383, June.
    19. Élie Youndjé & Martin Wells, 2008. "Optimal bandwidth selection for multivariate kernel deconvolution density estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(1), pages 138-162, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:61:y:1997:i:1:p:38-60. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.