IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v32y2017i3d10.1007_s00180-017-0713-7.html
   My bibliography  Save this article

Fully robust one-sided cross-validation for regression functions

Author

Listed:
  • Olga Y. Savchuk

    (University of South Florida)

  • Jeffrey D. Hart

    (Texas A&M University)

Abstract

Fully robust OSCV is a modification of the OSCV method that produces consistent bandwidths in the cases of smooth and nonsmooth regression functions. We propose the practical implementation of the method based on the robust cross-validation kernel $$H_I$$ H I in the case when the Gaussian kernel $$\phi $$ ϕ is used in computing the resulting regression estimate. The kernel $$H_I$$ H I produces practically unbiased bandwidths in the smooth and nonsmooth cases and performs adequately in the data examples. Negative tails of $$H_I$$ H I occasionally result in unacceptably wiggly OSCV curves in the neighborhood of zero. This problem can be resolved by selecting the bandwidth from the largest local minimum of the curve. Further search for the robust kernels with desired properties brought us to consider the quartic kernel for the cross-validation purposes. The quartic kernel is almost robust in the sense that in the nonsmooth case it substantially reduces the asymptotic relative bandwidth bias compared to $$\phi $$ ϕ . However, the quartic kernel is found to produce more variable bandwidths compared to $$\phi $$ ϕ . Nevertheless, the quartic kernel has an advantage of producing smoother OSCV curves compared to $$H_I$$ H I . A simplified scale-free version of the OSCV method based on a rescaled one-sided kernel is proposed.

Suggested Citation

  • Olga Y. Savchuk & Jeffrey D. Hart, 2017. "Fully robust one-sided cross-validation for regression functions," Computational Statistics, Springer, vol. 32(3), pages 1003-1025, September.
  • Handle: RePEc:spr:compst:v:32:y:2017:i:3:d:10.1007_s00180-017-0713-7
    DOI: 10.1007/s00180-017-0713-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-017-0713-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-017-0713-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Max Köhler & Anja Schindler & Stefan Sperlich, 2014. "A Review and Comparison of Bandwidth Selection Methods for Kernel Regression," International Statistical Review, International Statistical Institute, vol. 82(2), pages 243-274, August.
    2. Hart, Jeffrey D. & Lee, Cherng-Luen, 2005. "Robustness of one-sided cross-validation to autocorrelation," Journal of Multivariate Analysis, Elsevier, vol. 92(1), pages 77-96, January.
    3. Mammen, Enno & Martínez Miranda, María Dolores & Nielsen, Jens Perch & Sperlich, Stefan, 2011. "Do-Validation for Kernel Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 651-660.
    4. Savchuk, Olga Y. & Hart, Jeffrey D. & Sheather, Simon J., 2010. "Indirect Cross-Validation for Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 415-423.
    5. Olga Y. Savchuk & Jeffrey D. Hart & Simon P. Sheather, 2013. "One-sided cross-validation for nonsmooth regression functions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 25(4), pages 889-904, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Olga Y. Savchuk, 2020. "One-sided cross-validation for nonsmooth density functions," Computational Statistics, Springer, vol. 35(3), pages 1253-1272, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olga Y. Savchuk, 2020. "One-sided cross-validation for nonsmooth density functions," Computational Statistics, Springer, vol. 35(3), pages 1253-1272, September.
    2. María Luz Gámiz & Enno Mammen & María Dolores Martínez Miranda & Jens Perch Nielsen, 2016. "Double one-sided cross-validation of local linear hazards," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 755-779, September.
    3. Gámiz Pérez, M. Luz & Martínez Miranda, María Dolores & Nielsen, Jens Perch, 2013. "Smoothing survival densities in practice," Computational Statistics & Data Analysis, Elsevier, vol. 58(C), pages 368-382.
    4. M. Hiabu & E. Mammen & M. D. Martìnez-Miranda & J. P. Nielsen, 2016. "In-sample forecasting with local linear survival densities," Biometrika, Biometrika Trust, vol. 103(4), pages 843-859.
    5. Karim M Abadir & Michel Lubrano, 2024. "Explicit solutions for the asymptotically optimal bandwidth in cross-validation," Post-Print hal-04678541, HAL.
    6. Nils-Bastian Heidenreich & Anja Schindler & Stefan Sperlich, 2013. "Bandwidth selection for kernel density estimation: a review of fully automatic selectors," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 403-433, October.
    7. Inés Barbeito & Ricardo Cao & Stefan Sperlich, 2023. "Bandwidth selection for statistical matching and prediction," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 418-446, March.
    8. Adriano Z. Zambom & Ronaldo Dias, 2013. "A Review of Kernel Density Estimation with Applications to Econometrics," International Econometric Review (IER), Econometric Research Association, vol. 5(1), pages 20-42, April.
    9. Max Köhler & Anja Schindler & Stefan Sperlich, 2014. "A Review and Comparison of Bandwidth Selection Methods for Kernel Regression," International Statistical Review, International Statistical Institute, vol. 82(2), pages 243-274, August.
    10. Rahvar, Sepehr & Reihani, Erfan S. & Golestani, Amirhossein N. & Hamounian, Abolfazl & Aghaei, Fatemeh & Sahimi, Muhammad & Manshour, Pouya & Paluš, Milan & Feudel, Ulrike & Freund, Jan A. & Lehnertz,, 2024. "Characterizing time-resolved stochasticity in non-stationary time series," Chaos, Solitons & Fractals, Elsevier, vol. 185(C).
    11. Gámiz, María Luz & Mammen, Enno & Martínez-Miranda, María Dolores & Nielsen, Jens Perch, 2022. "Missing link survival analysis with applications to available pandemic data," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    12. Fritz, Marlon, 2019. "Steady state adjusting trends using a data-driven local polynomial regression," Economic Modelling, Elsevier, vol. 83(C), pages 312-325.
    13. José María Sarabia & Faustino Prieto & Vanesa Jordá & Stefan Sperlich, 2020. "A Note on Combining Machine Learning with Statistical Modeling for Financial Data Analysis," Risks, MDPI, vol. 8(2), pages 1-14, April.
    14. Isabel Proença & Stefan Sperlich & Duygu Savaşcı, 2015. "Semi-mixed effects gravity models for bilateral trade," Empirical Economics, Springer, vol. 48(1), pages 361-387, February.
    15. Jan Koláček & Ivana Horová, 2017. "Bandwidth matrix selectors for kernel regression," Computational Statistics, Springer, vol. 32(3), pages 1027-1046, September.
    16. Stefan Sperlich, 2022. "Comments on: hybrid semiparametric Bayesian networks," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 335-339, June.
    17. Bayer, Sebastian, 2018. "Combining Value-at-Risk forecasts using penalized quantile regressions," Econometrics and Statistics, Elsevier, vol. 8(C), pages 56-77.
    18. Patrick Carmack & Jeffrey Spence & William Schucany, 2012. "Generalised correlated cross-validation," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(2), pages 269-282.
    19. Wang, Qing & Lindsay, Bruce G., 2015. "Improving cross-validated bandwidth selection using subsampling-extrapolation techniques," Computational Statistics & Data Analysis, Elsevier, vol. 89(C), pages 51-71.
    20. repec:grz:wpaper:2012-10 is not listed on IDEAS
    21. Scholz, Michael & Nielsen, Jens Perch & Sperlich, Stefan, 2015. "Nonparametric prediction of stock returns based on yearly data: The long-term view," Insurance: Mathematics and Economics, Elsevier, vol. 65(C), pages 143-155.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:32:y:2017:i:3:d:10.1007_s00180-017-0713-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.