IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0223529.html
   My bibliography  Save this article

Optimally adjusted last cluster for prediction based on balancing the bias and variance by bootstrapping

Author

Listed:
  • Jeongwoo Kim

Abstract

Estimating a predictive model from a dataset is best initiated with an unbiased estimator. However, since the unbiased estimator is unknown in general, the problem of the bias-variance tradeoff is raised. Aside from searching for an unbiased estimator, the convenient approach to the problem of the bias-variance tradeoff may be to use the clustering method. Within a cluster whose size is smaller than the whole sample, we would expect the simple form of the estimator for prediction to avoid the overfitting problem. In this paper, we propose a new method to find the optimal cluster for prediction. Based on the previous literature, this cluster is considered to exist somewhere between the whole dataset and the typical cluster determined by partitioning data. To obtain a reliable cluster size, we use the bootstrap method in this paper. Additionally, through experiments with simulated and real-world data, we show that the prediction error can be reduced by applying this new method. We believe that our proposed method will be useful in many applications using a clustering algorithm for a stable prediction performance.

Suggested Citation

  • Jeongwoo Kim, 2019. "Optimally adjusted last cluster for prediction based on balancing the bias and variance by bootstrapping," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-31, November.
  • Handle: RePEc:plo:pone00:0223529
    DOI: 10.1371/journal.pone.0223529
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0223529
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0223529&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0223529?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kearl, J R & Mishkin, Frederic S, 1977. "Illiquidity, the Demand for Residential Housing, and Monetary Policy," Journal of Finance, American Finance Association, vol. 32(5), pages 1571-1586, December.
    2. Diebold, Francis X. & Chen, Celia, 1996. "Testing structural stability with endogenous breakpoint A size comparison of analytic and bootstrap procedures," Journal of Econometrics, Elsevier, vol. 70(1), pages 221-241, January.
    3. Monika Piazzesi & Martin Schneider, 2009. "Inflation and the price of real assets," Staff Report 423, Federal Reserve Bank of Minneapolis.
    4. Fan Cai & Nhien-An Le-Khac & Tahar Kechadi, 2016. "Clustering Approaches for Financial Data Analysis: a Survey," Papers 1609.08520, arXiv.org.
    5. Gupta, Rangan & Kabundi, Alain & Miller, Stephen M., 2011. "Forecasting the US real house price index: Structural and non-structural models with and without fundamentals," Economic Modelling, Elsevier, vol. 28(4), pages 2013-2021, July.
    6. Pesaran, M. Hashem & Timmermann, Allan, 2007. "Selection of estimation window in the presence of breaks," Journal of Econometrics, Elsevier, vol. 137(1), pages 134-161, March.
    7. Patric H. Hendershott, 1980. "Real User Costs and the Demand for Single-Family Housing," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 11(2), pages 401-452.
    8. John Barkoulas & Christopher F. Baum & Atreya Chakraborty, 1996. "Nearest-Neighbor Forecasts of U.S. Interest Rates," Boston College Working Papers in Economics 313., Boston College Department of Economics, revised 01 Apr 2003.
    9. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    10. Todd E. Clark & Michael W. McCracken, 2009. "Improving Forecast Accuracy By Combining Recursive And Rolling Forecasts," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 50(2), pages 363-395, May.
    11. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    12. Hansen, Bruce E., 2008. "Least-squares forecast averaging," Journal of Econometrics, Elsevier, vol. 146(2), pages 342-350, October.
    13. Chen, Nai-Fu & Roll, Richard & Ross, Stephen A, 1986. "Economic Forces and the Stock Market," The Journal of Business, University of Chicago Press, vol. 59(3), pages 383-403, July.
    14. Lessard, Donald R. & Modigliani, Franco., 1975. "Inflation and the housing market : problems and potential solutions," Working papers 813-75., Massachusetts Institute of Technology (MIT), Sloan School of Management.
    15. Kwon, Chung S. & Shin, Tai S., 1999. "Cointegration and causality between macroeconomic variables and stock market returns," Global Finance Journal, Elsevier, vol. 10(1), pages 71-81.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cunha, Ronan & Pereira, Pedro L. Valls, 2015. "Automatic model selection for forecasting Brazilian stock returns," Textos para discussão 398, FGV EESP - Escola de Economia de São Paulo, Fundação Getulio Vargas (Brazil).
    2. Sun, Yuying & Hong, Yongmiao & Wang, Shouyang & Zhang, Xinyu, 2023. "Penalized time-varying model averaging," Journal of Econometrics, Elsevier, vol. 235(2), pages 1355-1377.
    3. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    4. Mehmet Sahiner, 2022. "Forecasting volatility in Asian financial markets: evidence from recursive and rolling window methods," SN Business & Economics, Springer, vol. 2(10), pages 1-74, October.
    5. Rossi, Barbara, 2013. "Advances in Forecasting under Instability," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1203-1324, Elsevier.
    6. Christopher J. Neely & David E. Rapach & Jun Tu & Guofu Zhou, 2014. "Forecasting the Equity Risk Premium: The Role of Technical Indicators," Management Science, INFORMS, vol. 60(7), pages 1772-1791, July.
    7. Andersen, Torben G. & Varneskov, Rasmus T., 2022. "Testing for parameter instability and structural change in persistent predictive regressions," Journal of Econometrics, Elsevier, vol. 231(2), pages 361-386.
    8. Zhao, Albert Bo & Cheng, Tingting, 2022. "Stock return prediction: Stacking a variety of models," Journal of Empirical Finance, Elsevier, vol. 67(C), pages 288-317.
    9. Pablo Guerróon‐Quintana & Molin Zhong, 2023. "Macroeconomic forecasting in times of crises," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(3), pages 295-320, April.
    10. Barbara Rossi, 2019. "Forecasting in the presence of instabilities: How do we know whether models predict well and how to improve them," Economics Working Papers 1711, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2021.
    11. Jana Eklund & George Kapetanios & Simon Price, 2013. "Robust Forecast Methods and Monitoring during Structural Change," Manchester School, University of Manchester, vol. 81, pages 3-27, October.
    12. Pesaran, M.H. & Pick, A., 2008. "Forecasting Random Walks Under Drift Instability," Cambridge Working Papers in Economics 0814, Faculty of Economics, University of Cambridge.
    13. Kim, Hyun Hak & Swanson, Norman R., 2014. "Forecasting financial and macroeconomic variables using data reduction methods: New empirical evidence," Journal of Econometrics, Elsevier, vol. 178(P2), pages 352-367.
    14. Barbara Rossi & Atsushi Inoue, 2012. "Out-of-Sample Forecast Tests Robust to the Choice of Window Size," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(3), pages 432-453, April.
    15. Tae‐Hwy Lee & Shahnaz Parsaeian & Aman Ullah, 2022. "Forecasting Under Structural Breaks Using Improved Weighted Estimation," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(6), pages 1485-1501, December.
    16. M. Hashem Pesaran & Andreas Pick, 2008. "Forecasting Random Walks Under Drift Instability," CESifo Working Paper Series 2293, CESifo.
    17. Morales-Arias, Leonardo & Moura, Guilherme V., 2013. "Adaptive forecasting of exchange rates with panel data," International Journal of Forecasting, Elsevier, vol. 29(3), pages 493-509.
    18. Tarassow, Artur, 2019. "Forecasting U.S. money growth using economic uncertainty measures and regularisation techniques," International Journal of Forecasting, Elsevier, vol. 35(2), pages 443-457.
    19. Salisu, Afees A. & Swaray, Raymond & Oloko, Tirimisiyu F., 2019. "Improving the predictability of the oil–US stock nexus: The role of macroeconomic variables," Economic Modelling, Elsevier, vol. 76(C), pages 153-171.
    20. Rapach, David & Zhou, Guofu, 2013. "Forecasting Stock Returns," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 328-383, Elsevier.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0223529. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.