Can out-of-sample forecast comparisons help prevent overfitting?

My bibliography Save this paper

Can out-of-sample forecast comparisons help prevent overfitting?

Author

Listed:

Todd E. Clark

Registered:

Todd Clark

Abstract

This paper shows that out-of-sample forecast comparisons can help prevent data mining-induced overfitting. The basic results are drawn from simulations of a simple Monte Carlo design and a real data-based design similar to those in Lovell (1983) and Hoover and Perez (1999). In each simulation, a general-to-specific procedure is used to arrive at a model. If the selected specification includes any of the candidate explanatory variables, forecasts from the model are compared to forecasts from a benchmark model that is nested within the selected model. In particular, the competing forecasts are tested for equal MSE and encompassing. The simulations indicate most of the post-sample tests are roughly correctly sized, as long as just the in-sample portion of the data are used in model selection. Moreover, the tests have relatively good power, although some are consistently more powerful than others. The paper concludes with an application, modeling quarterly U.S. inflation.

Suggested Citation

Todd E. Clark, 2000. "Can out-of-sample forecast comparisons help prevent overfitting?," Research Working Paper RWP 00-05, Federal Reserve Bank of Kansas City.

Handle: RePEc:fip:fedkrw:rwp00-05

Download full text from publisher

Other versions of this item:

Todd E. Clark, 2004. "Can out-of-sample forecast comparisons help prevent overfitting?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(2), pages 115-139.

References listed on IDEAS

Kevin D. Hoover & Stephen J. Perez, 1999. "Data mining reconsidered: encompassing and the general-to-specific approach to specification search," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 167-191.
- Kevin D. Hoover & Stephen J. Perez, "undated". "Data Mining Reconsidered: Encompassing And The General-To-Specific Approach To Specification Search," Department of Economics 97-27, California Davis - Department of Economics.
- Kevin Hoover & Stephen J. Perez, 2003. "Data Mining Reconsidered: Encompassing And The General-To-Specific Approach To Specification Search," Working Papers 200, University of California, Davis, Department of Economics.
Martin D.D. Evans & Richard K. Lyons, 2017. "Order Flow and Exchange Rate Dynamics," World Scientific Book Chapters, in: Studies in Foreign Exchange Economics, chapter 6, pages 247-290, World Scientific Publishing Co. Pte. Ltd..
- Martin D. D. Evans & Richard K. Lyons, 2002. "Order Flow and Exchange Rate Dynamics," Journal of Political Economy, University of Chicago Press, vol. 110(1), pages 170-180, February.
- Martin D.D. Evans & Richard K. Lyons, 1999. "Order Flow and Exchange Rate Dynamics," NBER Working Papers 7317, National Bureau of Economic Research, Inc.
- Martin D. D. Evans and Richard K. Lyons., 1999. "Order Flow and Exchange Rate Dynamics," Research Program in Finance Working Papers RPF-288, University of California at Berkeley.
- Evans, Martin D. & Lyons, Richard K., 1999. "Order Flow and Exchange Rate Dynamics," Research Program in Finance, Working Paper Series qt0dh1c16w, Research Program in Finance, Institute for Business and Economic Research, UC Berkeley.
West, Kenneth D, 1996. "Asymptotic Inference about Predictive Ability," Econometrica, Econometric Society, vol. 64(5), pages 1067-1084, September.
- West, K.D., 1994. "Asymptotic Inference About Predictive Ability," Working papers 9417, Wisconsin Madison - Social Systems.
- Kenneth D. West, 1994. "Asymptotic Inference About Predictive Ability," Macroeconomics 9410002, University Library of Munich, Germany.
Thomas Knox & James H. Stock & Mark W. Watson, 2000. "Empirical Bayes Forecasts of One Time Series Using Many Predictors," Econometric Society World Congress 2000 Contributed Papers 1421, Econometric Society.
- Thomas Knox & James H. Stock & Mark W. Watson, 2001. "Empirical Bayes Forecasts of One Time Series Using Many Predictors," NBER Technical Working Papers 0269, National Bureau of Economic Research, Inc.
Krolzig, Hans-Martin & Hendry, David F., 2001. "Computer automation of general-to-specific model selection procedures," Journal of Economic Dynamics and Control, Elsevier, vol. 25(6-7), pages 831-866, June.
- Hans-Martin Krolzig & David Hendry, 1999. "Computer Automation of General-to-Specific Model Selection Procedures," Computing in Economics and Finance 1999 314, Society for Computational Economics.
- David Hendry & Hans-Martin Krolzig, 2000. "Computer Automation of General-to-Specific Model Selection Procedures," Economics Series Working Papers 3, University of Oxford, Department of Economics.
- Hans-Martin Krolzig, 2000. "Computer Automation of General-to-Specific Model Selection Procedures," Econometric Society World Congress 2000 Contributed Papers 0411, Econometric Society.
Denton, Frank T, 1985. "Data Mining as an Industry," The Review of Economics and Statistics, MIT Press, vol. 67(1), pages 124-127, February.
Stock, James H. & Watson, Mark W., 1999. "Forecasting inflation," Journal of Monetary Economics, Elsevier, vol. 44(2), pages 293-335, October.
- James H. Stock & Mark W. Watson, 1999. "Forecasting Inflation," NBER Working Papers 7023, National Bureau of Economic Research, Inc.
Martin Lettau & Sydney Ludvigson, 2001. "Consumption, Aggregate Wealth, and Expected Stock Returns," Journal of Finance, American Finance Association, vol. 56(3), pages 815-849, June.
- Lettau, Martin & Ludvigson, Sydney, 1999. "Consumption, Aggregate Wealth and Expected Stock Returns," CEPR Discussion Papers 2223, C.E.P.R. Discussion Papers.
- Martin Lettau & Sydney C. Ludvigson, 1999. "Consumption, aggregate wealth and expected stock returns," Staff Reports 77, Federal Reserve Bank of New York.
West, Kenneth D & McCracken, Michael W, 1998. "Regression-Based Tests of Predictive Ability," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 817-840, November.
- West, K.D. & McCracken, M.W., 1997. "Regression-Based Tests of Predictive Ability," Working papers 9710, Wisconsin Madison - Social Systems.
- Kenneth D. West & Michael W. McCracken, 1998. "Regression-Based Tests of Predictive Ability," NBER Technical Working Papers 0226, National Bureau of Economic Research, Inc.
Julia Campos & Neil R. Ericsson, 1999. "Contructive data mining: modeling consumers' expenditure in Venezuela," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 226-240.
- Julia Campos & Neil R. Ericsson, 2000. "Constructive data mining: modeling consumers' expenditure in Venezuela," International Finance Discussion Papers 663, Board of Governors of the Federal Reserve System (U.S.).
Atsushi Inoue & Lutz Kilian, 2005. "In-Sample or Out-of-Sample Tests of Predictability: Which One Should We Use?," Econometric Reviews, Taylor & Francis Journals, vol. 23(4), pages 371-402.
- Kilian, Lutz & Inoue, Atsushi, 2002. "In-Sample or Out-of-Sample Tests of Predictability: Which One Should We Use?," CEPR Discussion Papers 3671, C.E.P.R. Discussion Papers.
- Inoue, Atsushi & Kilian, Lutz, 2002. "In-sample or out-of-sample tests of predictability: which one should we use?," Working Paper Series 195, European Central Bank.
Amano, Robert A. & van Norden, Simon, 1995. "Terms of trade and real exchange rates: the Canadian evidence," Journal of International Money and Finance, Elsevier, vol. 14(1), pages 83-104, February.
Chao, John & Corradi, Valentina & Swanson, Norman R., 2001. "Out-Of-Sample Tests For Granger Causality," Macroeconomic Dynamics, Cambridge University Press, vol. 5(4), pages 598-620, September.
- Norman R. Swanson, 2000. "An Out of Sample Test for Granger Causality," Econometric Society World Congress 2000 Contributed Papers 0362, Econometric Society.
Harvey, David I & Leybourne, Stephen J & Newbold, Paul, 1998. "Tests for Forecast Encompassing," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(2), pages 254-259, April.
Ashley, R & Granger, C W J & Schmalensee, R, 1980. "Advertising and Aggregate Consumption: An Analysis of Causality," Econometrica, Econometric Society, vol. 48(5), pages 1149-1167, July.
Pesaran, M Hashem & Timmermann, Allan, 2000. "A Recursive Modelling Approach to Predicting UK Stock Returns," Economic Journal, Royal Economic Society, vol. 110(460), pages 159-191, January.
- Pesaran, M. H. & Timmermann, A., 1996. "A Recursive Modelling Approach to Predicting UK Stock Returns'," Cambridge Working Papers in Economics 9625, Faculty of Economics, University of Cambridge.
- Allan Timmermann & M. Hashem Pesaran, 1999. "A Recursive Modelling Approach to Predicting UK Stock Returns," FMG Discussion Papers dp322, Financial Markets Group.
repec:bla:jfinan:v:43:y:1988:i:4:p:933-48 is not listed on IDEAS
Hendry, David F., 1995. "Dynamic Econometrics," OUP Catalogue, Oxford University Press, number 9780198283164, Decembrie.
James H. Stock & Mark W. Watson, 1998. "Diffusion Indexes," NBER Working Papers 6702, National Bureau of Economic Research, Inc.
Granger, Clive W. J. & King, Maxwell L. & White, Halbert, 1995. "Comments on testing economic theories and the use of model selection criteria," Journal of Econometrics, Elsevier, vol. 67(1), pages 173-187, May.
Halbert White, 2000. "A Reality Check for Data Snooping," Econometrica, Econometric Society, vol. 68(5), pages 1097-1126, September.
Meese, R. & Rogoff, K., 1988. "Was It Real? The Exchange Rate-Interest Differential Ralation Over The Modern Floating-Rate Period," Working papers 368, Wisconsin Madison - Social Systems.
Ericsson, Neil R., 1992. "Parameter constancy, mean square forecast errors, and measuring forecast performance: An exposition, extensions, and illustration," Journal of Policy Modeling, Elsevier, vol. 14(4), pages 465-495, August.
- Neil R. Ericsson, 1991. "Parameter constancy, mean square forecast errors, and measuring forecast performance: an exposition, extensions, and illustration," International Finance Discussion Papers 412, Board of Governors of the Federal Reserve System (U.S.).
Clark, Todd E. & McCracken, Michael W., 2001. "Tests of equal forecast accuracy and encompassing for nested models," Journal of Econometrics, Elsevier, vol. 105(1), pages 85-110, November.
- Todd E. Clark & Michael W. McCracken, 1999. "Tests of equal forecast accuracy and encompassing for nested models," Research Working Paper 99-11, Federal Reserve Bank of Kansas City.
- Todd E. Clark & Michael W. McCracken, 2000. "Tests of Equal Forecast Accuracy and Encompassing for Nested Models," Econometric Society World Congress 2000 Contributed Papers 0319, Econometric Society.
- Todd E. Clark & Michael McCracken, 1999. "Tests of Equal Forecast Accuracy and Encompassing for Nested Models," Computing in Economics and Finance 1999 1241, Society for Computational Economics.
Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
- Diebold, Francis X & Mariano, Roberto S, 1995. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(3), pages 253-263, July.
- Francis X. Diebold & Roberto S. Mariano, 1994. "Comparing Predictive Accuracy," NBER Technical Working Papers 0169, National Bureau of Economic Research, Inc.
Cogley, Timothy, 2002. "A Simple Adaptive Measure of Core Inflation," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 34(1), pages 94-113, February.
- Timothy Cogley, 1998. "A simple adaptive measure of core inflation," Working Papers in Applied Economic Theory 98-06, Federal Reserve Bank of San Francisco.
Lo, Andrew W & MacKinlay, A Craig, 1990. "Data-Snooping Biases in Tests of Financial Asset Pricing Models," The Review of Financial Studies, Society for Financial Studies, vol. 3(3), pages 431-467.
- Andrew W. Lo & A. Craig MacKinlay, 1989. "Data-Snooping Biases in Tests of Financial Asset Pricing Models," NBER Working Papers 3001, National Bureau of Economic Research, Inc.
- Lo, Andrew W. (Andrew Wen-Chuan) & MacKinlay, Archie Craig, 1955-, 1989. "Data-snooping biases in tests of financial asset pricing models," Working papers 3020-89., Massachusetts Institute of Technology (MIT), Sloan School of Management.
Chris Chatfield, 1995. "Model Uncertainty, Data Mining and Statistical Inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 158(3), pages 419-444, May.
Lovell, Michael C, 1983. "Data Mining," The Review of Economics and Statistics, MIT Press, vol. 65(1), pages 1-12, February.
Bossaerts, Peter & Hillion, Pierre, 1999. "Implementing Statistical Criteria to Select Return Forecasting Models: What Do We Learn?," The Review of Financial Studies, Society for Financial Studies, vol. 12(2), pages 405-428.
Meese, Richard A. & Rogoff, Kenneth, 1983. "Empirical exchange rate models of the seventies : Do they fit out of sample?," Journal of International Economics, Elsevier, vol. 14(1-2), pages 3-24, February.
Bruce E. Hansen, 1999. "Discussion of 'Data mining reconsidered'," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 192-201.
repec:cup:macdyn:v:5:y:2001:i:4:p:598-620 is not listed on IDEAS
David F. Hendry & Hans-Martin Krolzig, 1999. "Improving on 'Data mining reconsidered' by K.D. Hoover and S.J. Perez," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 202-219.
Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
Yoshihisa Baba & David F. Hendry & Ross M. Starr, 1992. "The Demand for M1 in the U.S.A., 1960–1988," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 59(1), pages 25-61.
Chinn, Menzie D. & Meese, Richard A., 1995. "Banking on currency forecasts: How predictable is change in money?," Journal of International Economics, Elsevier, vol. 38(1-2), pages 161-178, February.
David J. Hand, 1999. "Discussion contribution on 'Data mining reconsidered: encompassing and the general-to-specific approach to specification search' by Hoover and Perez," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 241-243.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Clark, Todd E. & McCracken, Michael W., 2001. "Tests of equal forecast accuracy and encompassing for nested models," Journal of Econometrics, Elsevier, vol. 105(1), pages 85-110, November.
- Todd E. Clark & Michael McCracken, 1999. "Tests of Equal Forecast Accuracy and Encompassing for Nested Models," Computing in Economics and Finance 1999 1241, Society for Computational Economics.
- Todd E. Clark & Michael W. McCracken, 2000. "Tests of Equal Forecast Accuracy and Encompassing for Nested Models," Econometric Society World Congress 2000 Contributed Papers 0319, Econometric Society.
- Todd E. Clark & Michael W. McCracken, 1999. "Tests of equal forecast accuracy and encompassing for nested models," Research Working Paper 99-11, Federal Reserve Bank of Kansas City.
West, Kenneth D., 2006. "Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 3, pages 99-134, Elsevier.
Clark, Todd & McCracken, Michael, 2013. "Advances in Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1107-1201, Elsevier.
- Todd E. Clark & Michael W. McCracken, 2011. "Advances in forecast evaluation," Working Papers (Old Series) 1120, Federal Reserve Bank of Cleveland.
- Todd E. Clark & Michael W. McCracken, 2011. "Advances in forecast evaluation," Working Papers 2011-025, Federal Reserve Bank of St. Louis.
McCracken,M.W. & West,K.D., 2001. "Inference about predictive ability," Working papers 14, Wisconsin Madison - Social Systems.
McCracken, Michael W., 2007. "Asymptotics for out of sample tests of Granger causality," Journal of Econometrics, Elsevier, vol. 140(2), pages 719-752, October.
Atsushi Inoue & Lutz Kilian, 2005. "In-Sample or Out-of-Sample Tests of Predictability: Which One Should We Use?," Econometric Reviews, Taylor & Francis Journals, vol. 23(4), pages 371-402.
- Inoue, Atsushi & Kilian, Lutz, 2002. "In-sample or out-of-sample tests of predictability: which one should we use?," Working Paper Series 195, European Central Bank.
- Kilian, Lutz & Inoue, Atsushi, 2002. "In-Sample or Out-of-Sample Tests of Predictability: Which One Should We Use?," CEPR Discussion Papers 3671, C.E.P.R. Discussion Papers.
Todd E. Clark & Michael W. McCracken, 2002. "Forecast-based model selection in the presence of structural breaks," Research Working Paper RWP 02-05, Federal Reserve Bank of Kansas City.
Granziera, Eleonora & Hubrich, Kirstin & Moon, Hyungsik Roger, 2014. "A predictability test for a small number of nested models," Journal of Econometrics, Elsevier, vol. 182(1), pages 174-185.
- Hubrich, Kirstin & Granziera, Eleonora & Moon, Hyungsik Roger, 2013. "A predictability test for a small number of nested models," Working Paper Series 1580, European Central Bank.
Rossi, Barbara, 2013. "Advances in Forecasting under Instability," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1203-1324, Elsevier.
- Barbara Rossi, 2011. "Advances in Forecasting Under Instability," Working Papers 11-20, Duke University, Department of Economics.
Todd E. Clark & Michael W. McCracken, 2001. "Evaluating long-horizon forecasts," Research Working Paper RWP 01-14, Federal Reserve Bank of Kansas City.
Raffaella Giacomini & Halbert White, 2006. "Tests of Conditional Predictive Ability," Econometrica, Econometric Society, vol. 74(6), pages 1545-1578, November.
- Giacomini, Raffaella & White, Halbert, 2003. "Tests of Conditional Predictive Ability," University of California at San Diego, Economics Working Paper Series qt5jk0j5jh, Department of Economics, UC San Diego.
- Raffaella Giacomini & Halbert White, 2003. "Tests of conditional predictive ability," Boston College Working Papers in Economics 572, Boston College Department of Economics.
- Raffaella Giacomini & Halbert White, 2003. "Tests of Conditional Predictive Ability," Econometrics 0308001, University Library of Munich, Germany.
Todd E. Clark & Kenneth D. West, 2005. "Using Out-of-Sample Mean Squared Prediction Errors to Test the Martingale Difference," NBER Technical Working Papers 0305, National Bureau of Economic Research, Inc.
Jin, Sainan & Corradi, Valentina & Swanson, Norman R., 2017. "Robust Forecast Comparison," Econometric Theory, Cambridge University Press, vol. 33(6), pages 1306-1351, December.
- Sainan Jin & Valentina Corradi & Norman Swanson, 2015. "Robust Forecast Comparison," Departmental Working Papers 201502, Rutgers University, Department of Economics.
Raffaella Giacomini & Barbara Rossi, 2013. "Forecasting in macroeconomics," Chapters, in: Nigar Hashimzade & Michael A. Thornton (ed.), Handbook of Research Methods and Applications in Empirical Macroeconomics, chapter 17, pages 381-408, Edward Elgar Publishing.
Clark, Todd E. & West, Kenneth D., 2007. "Approximately normal tests for equal predictive accuracy in nested models," Journal of Econometrics, Elsevier, vol. 138(1), pages 291-311, May.
- Todd E. Clark & Kenneth D. West, 2005. "Approximately normal tests for equal predictive accuracy in nested models," Research Working Paper RWP 05-05, Federal Reserve Bank of Kansas City.
- Kenneth D. West & Todd Clark, 2006. "Approximately Normal Tests for Equal Predictive Accuracy in Nested Models," NBER Technical Working Papers 0326, National Bureau of Economic Research, Inc.
Clark, Todd E. & West, Kenneth D., 2006. "Using out-of-sample mean squared prediction errors to test the martingale difference hypothesis," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 155-186.
- Todd E. Clark & Kenneth D. West, 2004. "Using out-of-sample mean squared prediction errors to test the Martingale difference hypothesis," Research Working Paper RWP 04-03, Federal Reserve Bank of Kansas City.
Todd E. Clark & Michael W. McCracken, 2010. "Reality checks and nested forecast model comparisons," Working Papers 2010-032, Federal Reserve Bank of St. Louis.
Rapach, David E. & Wohar, Mark E. & Rangvid, Jesper, 2005. "Macro variables and international stock return predictability," International Journal of Forecasting, Elsevier, vol. 21(1), pages 137-166.
O. De Bandt & E. Michaux & C. Bruneau & A. Flageollet, 2007. "Forecasting inflation using economic indicators: the case of France," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 26(1), pages 1-22.
- Bruneau, C. & De Bandt, O. & Flageollet, A. & Michaux, E., 2003. "Forecasting Inflation using Economic Indicators: the Case of France," Working papers 101, Banque de France.
Busetti, Fabio & Marcucci, Juri, 2013. "Comparing forecast accuracy: A Monte Carlo investigation," International Journal of Forecasting, Elsevier, vol. 29(1), pages 13-27.
- Fabio Busetti & Juri Marcucci & Giovanni Veronese, 2009. "Comparing forecast accuracy: A Monte Carlo investigation," Temi di discussione (Economic working papers) 723, Bank of Italy, Economic Research and International Relations Area.

More about this item

Keywords

Forecasting;

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ECM-2001-04-02 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:fip:fedkrw:rwp00-05. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Zach Kastens (email available below). General contact details of provider: https://edirc.repec.org/data/frbkcus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Can out-of-sample forecast comparisons help prevent overfitting?

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data