IDEAS home Printed from https://ideas.repec.org/a/taf/gnstxx/v27y2015i2p167-179.html
   My bibliography  Save this article

A K -fold averaging cross-validation procedure

Author

Listed:
  • Yoonsuh Jung
  • Jianhua Hu

Abstract

Cross-validation (CV) type of methods have been widely used to facilitate model estimation and variable selection. In this work, we suggest a new K -fold CV procedure to select a candidate 'optimal' model from each hold-out fold and average the K candidate 'optimal' models to obtain the ultimate model. Due to the averaging effect, the variance of the proposed estimates can be significantly reduced. This new procedure results in more stable and efficient parameter estimation than the classical K -fold CV procedure. In addition, we show the asymptotic equivalence between the proposed and classical CV procedures in the linear regression setting. We also demonstrate the broad applicability of the proposed procedure via two examples of parameter sparsity regularisation and quantile smoothing splines modelling. We illustrate the promise of the proposed method through simulations and a real data example.

Suggested Citation

  • Yoonsuh Jung & Jianhua Hu, 2015. "A K -fold averaging cross-validation procedure," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(2), pages 167-179, June.
  • Handle: RePEc:taf:gnstxx:v:27:y:2015:i:2:p:167-179
    DOI: 10.1080/10485252.2015.1010532
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/10485252.2015.1010532
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/10485252.2015.1010532?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Patrick Royston & Douglas G. Altman, 1994. "Regression Using Fractional Polynomials of Continuous Covariates: Parsimonious Parametric Modelling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 43(3), pages 429-453, September.
    2. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yingtao Zhang & Tao Wang & Kangkang Liu & Yao Xia & Yi Lu & Qinlong Jing & Zhicong Yang & Wenbiao Hu & Jiahai Lu, 2016. "Developing a Time Series Predictive Model for Dengue in Zhongshan, China Based on Weather and Guangzhou Dengue Surveillance Data," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 10(2), pages 1-17, February.
    2. Yang, Yadong & Shahbeik, Hossein & Shafizadeh, Alireza & Rafiee, Shahin & Hafezi, Amir & Du, Xinyi & Pan, Junting & Tabatabaei, Meisam & Aghbashlo, Mortaza, 2023. "Predicting municipal solid waste gasification using machine learning: A step toward sustainable regional planning," Energy, Elsevier, vol. 278(PB).
    3. Dhan Lord B. Fortela & Armani Travis & Ashley P. Mikolajczyk & Wayne Sharp & Emmanuel Revellame & William Holmes & Rafael Hernandez & Mark E. Zappi, 2023. "Quantitating Wastewater Characteristic Parameters Using Neural Network Regression Modeling on Spectral Reflectance," Clean Technol., MDPI, vol. 5(4), pages 1-17, September.
    4. A. Costa & G. Buffa & D. Palmeri & G. Pollara & L. Fratini, 2022. "Hybrid prediction-optimization approaches for maximizing parts density in SLM of Ti6Al4V titanium alloy," Journal of Intelligent Manufacturing, Springer, vol. 33(7), pages 1967-1989, October.
    5. Yang, Yadong & Shahbeik, Hossein & Shafizadeh, Alireza & Masoudnia, Nima & Rafiee, Shahin & Zhang, Yijia & Pan, Junting & Tabatabaei, Meisam & Aghbashlo, Mortaza, 2022. "Biomass microwave pyrolysis characterization by machine learning for sustainable rural biorefineries," Renewable Energy, Elsevier, vol. 201(P2), pages 70-86.
    6. Li, Guosheng & Ma, Shuaichao & Zhang, Dequan & Yang, Leping & Zhang, Weihua & Wu, Zeping, 2024. "An efficient sequential anisotropic RBF reliability analysis method with fast cross-validation and parallelizability," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    7. Leonardo Brain García Fernández & Anna Diva Plasencia Lotufo & Carlos Roberto Minussi, 2023. "Development of a Short-Term Electrical Load Forecasting in Disaggregated Levels Using a Hybrid Modified Fuzzy-ARTMAP Strategy," Energies, MDPI, vol. 16(10), pages 1-30, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcelo Cajias & Philipp Freudenreich & Anna Heller & Wolfgang Schaefers, 2018. "Censored Quantile Regressions and the Determinants of Real Estate Liquidity," ERES eres2018_203, European Real Estate Society (ERES).
    2. Petrella, Lea & Raponi, Valentina, 2019. "Joint estimation of conditional quantiles in multivariate linear regression models with an application to financial distress," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 70-84.
    3. Yu, Keming & Moyeed, Rana A., 2001. "Bayesian quantile regression," Statistics & Probability Letters, Elsevier, vol. 54(4), pages 437-447, October.
    4. Kuk, Anthony Y.C., 2017. "Function compositional adjustments of conditional quantile curves," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 281-293.
    5. Akosah, Nana Kwame & Alagidede, Imhotep Paul & Schaling, Eric, 2020. "Testing for asymmetry in monetary policy rule for small-open developing economies: Multiscale Bayesian quantile evidence from Ghana," The Journal of Economic Asymmetries, Elsevier, vol. 22(C).
    6. Paul Hewson & Keming Yu, 2008. "Quantile regression for binary performance indicators," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 24(5), pages 401-418, September.
    7. Salimata Sissoko, 2011. "Working Paper 03-11 - Niveau de décentralisation de la négociation et structure des salaires," Working Papers 1103, Federal Planning Bureau, Belgium.
    8. Korom, Philipp, 2016. "Inherited advantage: The importance of inheritance for private wealth accumulation in Europe," MPIfG Discussion Paper 16/11, Max Planck Institute for the Study of Societies.
    9. Daniele, Vittorio, 2007. "Criminalità e investimenti esteri. Un’analisi per le province italiane [The effect of organized crime on Foreign Investments. An Empirical Analysis for the Italian Provinces]," MPRA Paper 6417, University Library of Munich, Germany.
    10. Ma, Lingjie & Koenker, Roger, 2006. "Quantile regression methods for recursive structural equation models," Journal of Econometrics, Elsevier, vol. 134(2), pages 471-506, October.
    11. Dutta, Anupam & Bouri, Elie & Rothovius, Timo & Uddin, Gazi Salah, 2023. "Climate risk and green investments: New evidence," Energy, Elsevier, vol. 265(C).
    12. Cowling, Marc & Ughetto, Elisa & Lee, Neil, 2018. "The innovation debt penalty: Cost of debt, loan default, and the effects of a public loan guarantee on high-tech firms," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 166-176.
    13. Haddou, Samira, 2024. "Determinants of CDS in core and peripheral European countries: A comparative study during crisis and calm periods," The North American Journal of Economics and Finance, Elsevier, vol. 71(C).
    14. Niematallah Elamin & Mototsugu Fukushige, 2016. "A Quantile Regression Model for Electricity Peak Demand Forecasting: An Approach to Avoiding Power Blackouts," Discussion Papers in Economics and Business 16-22, Osaka University, Graduate School of Economics.
    15. Peracchi, Franco, 2002. "On estimating conditional quantiles and distribution functions," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 433-447, February.
    16. Jan Fałkowski & Maciej Jakubowski & Paweł Strawiński, 2014. "Returns from income strategies in rural Poland," The Economics of Transition, The European Bank for Reconstruction and Development, vol. 22(1), pages 139-178, January.
    17. Elena Bárcena-Mart�n & Santiago Budr�a & Ana I. Moro-Egido, 2012. "Skill mismatches and wages among European university graduates," Applied Economics Letters, Taylor & Francis Journals, vol. 19(15), pages 1471-1475, October.
    18. Trojanek, Radoslaw & Huderek-Glapska, Sonia, 2018. "Measuring the noise cost of aviation – The association between the Limited Use Area around Warsaw Chopin Airport and property values," Journal of Air Transport Management, Elsevier, vol. 67(C), pages 103-114.
    19. Charlier, Isabelle & Paindaveine, Davy & Saracco, Jérôme, 2015. "Conditional quantile estimation based on optimal quantization: From theory to practice," Computational Statistics & Data Analysis, Elsevier, vol. 91(C), pages 20-39.
    20. Kato, Kengo & F. Galvao, Antonio & Montes-Rojas, Gabriel V., 2012. "Asymptotics for panel quantile regression models with individual effects," Journal of Econometrics, Elsevier, vol. 170(1), pages 76-91.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:gnstxx:v:27:y:2015:i:2:p:167-179. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/GNST20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.