IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v120y2018icp70-83.html
   My bibliography  Save this article

A note on the validity of cross-validation for evaluating autoregressive time series prediction

Author

Listed:
  • Bergmeir, Christoph
  • Hyndman, Rob J.
  • Koo, Bonsoo

Abstract

One of the most widely used standard procedures for model evaluation in classification and regression is K-fold cross-validation (CV). However, when it comes to time series forecasting, because of the inherent serial correlation and potential non-stationarity of the data, its application is not straightforward and often replaced by practitioners in favour of an out-of-sample (OOS) evaluation. It is shown that for purely autoregressive models, the use of standard K-fold CV is possible provided the models considered have uncorrelated errors. Such a setup occurs, for example, when the models nest a more appropriate model. This is very common when Machine Learning methods are used for prediction, and where CV can control for overfitting the data. Theoretical insights supporting these arguments are presented, along with a simulation study and a real-world example. It is shown empirically that K-fold CV performs favourably compared to both OOS evaluation and other time-series-specific techniques such as non-dependent cross-validation.

Suggested Citation

  • Bergmeir, Christoph & Hyndman, Rob J. & Koo, Bonsoo, 2018. "A note on the validity of cross-validation for evaluating autoregressive time series prediction," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 70-83.
  • Handle: RePEc:eee:csdana:v:120:y:2018:i:c:p:70-83
    DOI: 10.1016/j.csda.2017.11.003
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947317302384
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2017.11.003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mokkadem, Abdelkader, 1988. "Mixing properties of ARMA processes," Stochastic Processes and their Applications, Elsevier, vol. 29(2), pages 309-315, September.
    2. Borra, Simone & Di Ciaccio, Agostino, 2010. "Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2976-2989, December.
    3. Andrews, Donald W K, 1987. "Consistency in Nonlinear Econometric Models: A Generic Uniform Law of Large Numbers [On Unification of the Asymptotic Theory of Nonlinear Econometric Models]," Econometrica, Econometric Society, vol. 55(6), pages 1465-1471, November.
    4. Bergmeir, Christoph & Costantini, Mauro & Benítez, José M., 2014. "On the usefulness of cross-validation for directional forecast evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 76(C), pages 132-143.
    5. Racine, Jeff, 2000. "Consistent cross-validatory model-selection for dependent data: hv-block cross-validation," Journal of Econometrics, Elsevier, vol. 99(1), pages 39-61, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christoph Bergmeir & Rob J Hyndman & Bonsoo Koo, 2015. "A Note on the Validity of Cross-Validation for Evaluating Time Series Prediction," Monash Econometrics and Business Statistics Working Papers 10/15, Monash University, Department of Econometrics and Business Statistics.
    2. Pinto, Jeronymo Marcondes & Marçal, Emerson Fernandes, 2019. "Cross-validation based forecasting method: a machine learning approach," Textos para discussão 498, FGV EESP - Escola de Economia de São Paulo, Fundação Getulio Vargas (Brazil).
    3. Filip Stanek, 2021. "Optimal Out-of-Sample Forecast Evaluation under Stationarity," CERGE-EI Working Papers wp712, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    4. Mariana Oliveira & Luís Torgo & Vítor Santos Costa, 2021. "Evaluation Procedures for Forecasting with Spatiotemporal Data," Mathematics, MDPI, vol. 9(6), pages 1-27, March.
    5. Bergmeir, Christoph & Costantini, Mauro & Benítez, José M., 2014. "On the usefulness of cross-validation for directional forecast evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 76(C), pages 132-143.
    6. Gary S. Anderson & Alena Audzeyeva, 2019. "A Coherent Framework for Predicting Emerging Market Credit Spreads with Support Vector Regression," Finance and Economics Discussion Series 2019-074, Board of Governors of the Federal Reserve System (U.S.).
    7. M. Hashem Pesaran & Yongcheol Shin, 2002. "Long-Run Structural Modelling," Econometric Reviews, Taylor & Francis Journals, vol. 21(1), pages 49-87.
    8. Sokbae Lee & Myung Hwan Seo & Youngki Shin, 2017. "Correction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 883-883, April.
    9. Sergei Koulayev & Marc Rysman & Scott Schuh & Joanna Stavins, 2016. "Explaining adoption and use of payment instruments by US consumers," RAND Journal of Economics, RAND Corporation, vol. 47(2), pages 293-325, May.
    10. Escanciano, Juan Carlos & Jacho-Chávez, David T., 2010. "Approximating the critical values of Cramér-von Mises tests in general parametric conditional specifications," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 625-636, March.
    11. Mayer, Walter J., 1999. "An extension of the maximum score estimator for disequilibrium models," Economics Letters, Elsevier, vol. 64(2), pages 143-149, August.
    12. Kock, Anders Bredahl & Teräsvirta, Timo, 2014. "Forecasting performances of three automated modelling techniques during the economic crisis 2007–2009," International Journal of Forecasting, Elsevier, vol. 30(3), pages 616-631.
    13. Chen, Song Xi & Guo, Bin & Qiu, Yumou, 2023. "Testing and signal identification for two-sample high-dimensional covariances via multi-level thresholding," Journal of Econometrics, Elsevier, vol. 235(2), pages 1337-1354.
    14. Čížek, Pavel & Koo, Chao Hui, 2021. "Jump-preserving varying-coefficient models for nonlinear time series," Econometrics and Statistics, Elsevier, vol. 19(C), pages 58-96.
    15. de Jong, Robert M. & Woutersen, Tiemen, 2011. "Dynamic Time Series Binary Choice," Econometric Theory, Cambridge University Press, vol. 27(4), pages 673-702, August.
    16. Costantini, Mauro & Cuaresma, Jesus Crespo & Hlouskova, Jaroslava, 2014. "Can Macroeconomists Get Rich Forecasting Exchange Rates?," Economics Series 305, Institute for Advanced Studies.
    17. Lei, J., 2013. "Smoothed Spatial Maximum Score Estimation of Spatial Autoregressive Binary Choice Panel Models," Other publications TiSEM d63bf400-7ff2-4a1c-8067-1, Tilburg University, School of Economics and Management.
    18. Evangelos Spiliotis & Fotios Petropoulos & Vassilios Assimakopoulos, 2023. "On the Disagreement of Forecasting Model Selection Criteria," Forecasting, MDPI, vol. 5(2), pages 1-12, June.
    19. Gregory Connor & Matthias Hagmann & Oliver Linton, 2007. "Efficient Estimation of a Semiparametric Characteristic- Based Factor Model of Security Returns," Swiss Finance Institute Research Paper Series 07-26, Swiss Finance Institute.
    20. Zhu, Sha & Dekker, Rommert & van Jaarsveld, Willem & Renjie, Rex Wang & Koning, Alex J., 2017. "An improved method for forecasting spare parts demand using extreme value theory," European Journal of Operational Research, Elsevier, vol. 261(1), pages 169-181.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:120:y:2018:i:c:p:70-83. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.