IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v637y2024ics0378437124001201.html
   My bibliography  Save this article

A hybrid forecasting framework based on MCS and machine learning for higher dimensional and unbalanced systems

Author

Listed:
  • Yang, Guo-Hui
  • Zhong, Guang-Yan
  • Wang, Li-Ya
  • Xie, Zu-Guang
  • Li, Jiang-Cheng

Abstract

Forecasting methods and theories have been widely researched and applied in complex systems and fields such as statistical physics, econophysics, material crystals, etc. However, challenges persist in applying these methods to complex systems characterized by high dimensionality, data imbalance, and single prediction evaluation. To address these issues, we propose a novel hybrid forecasting approach that integrates the model confidence set (MCS) with machine learning (ML) models. We introduce Principal Component Analysis (PCA) to reduce dimensionality of the data, reduce data imbalance through a combination of random undersampling and oversampling, and introduce several metrics to evaluate the machine learning model set. We also introduce the MCS to select the optimal model from the set of ML models and propose a new combinatorial approach, the MCS-ML combinatorial model. An empirical study is conducted using the example of abnormal transactions in the Bitcoin blockchain. The empirical results show that the proposed MCS-ML combinatorial model has better predictive performance than the models in the ML model set under different data structures.

Suggested Citation

  • Yang, Guo-Hui & Zhong, Guang-Yan & Wang, Li-Ya & Xie, Zu-Guang & Li, Jiang-Cheng, 2024. "A hybrid forecasting framework based on MCS and machine learning for higher dimensional and unbalanced systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 637(C).
  • Handle: RePEc:eee:phsmap:v:637:y:2024:i:c:s0378437124001201
    DOI: 10.1016/j.physa.2024.129612
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437124001201
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2024.129612?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Romi Kher & Siri Terjesen & Chen Liu, 2021. "Blockchain, Bitcoin, and ICOs: a review and research agenda," Small Business Economics, Springer, vol. 56(4), pages 1699-1720, April.
    2. Ben Moews & Gbenga Ibikunle, 2020. "Predictive intraday correlations in stable and volatile market environments: Evidence from deep learning," Papers 2002.10385, arXiv.org.
    3. Hansen, Peter Reinhard, 2005. "A Test for Superior Predictive Ability," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 365-380, October.
    4. Mauro Bernardi & Leopoldo Catania, 2018. "The model confidence set package for R," International Journal of Computational Economics and Econometrics, Inderscience Enterprises Ltd, vol. 8(2), pages 144-158.
    5. Li, Jiang-Cheng & Xu, Ming-Zhe & Han, Xu & Tao, Chen, 2022. "Dynamic risk resonance between crude oil and stock market by econophysics and machine learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 607(C).
    6. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    7. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    8. Jia Li & Zhipeng Liao & Rogier Quaedvlieg, 2022. "Conditional Superior Predictive Ability," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 89(2), pages 843-875.
    9. Nowotarski, Jakub & Weron, Rafał, 2018. "Recent advances in electricity price forecasting: A review of probabilistic forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1548-1568.
    10. Chen, Hongtao & Liu, Li & Li, Xiaolei, 2018. "The predictive content of CBOE crude oil volatility index," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 492(C), pages 837-850.
    11. Plerou, Vasiliki & Gopikrishnan, Parameswaran & Rosenow, Bernd & Amaral, Luis A.N. & Stanley, H.Eugene, 2000. "Econophysics: financial time series from a statistical physics point of view," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 279(1), pages 443-456.
    12. Liang, Chao & Umar, Muhammad & Ma, Feng & Huynh, Toan L.D., 2022. "Climate policy uncertainty and world renewable energy index volatility forecasting," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    13. Mark Weber & Giacomo Domeniconi & Jie Chen & Daniel Karl I. Weidele & Claudio Bellei & Tom Robinson & Charles E. Leiserson, 2019. "Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics," Papers 1908.02591, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Billé, Anna Gloria & Gianfreda, Angelica & Del Grosso, Filippo & Ravazzolo, Francesco, 2023. "Forecasting electricity prices with expert, linear, and nonlinear models," International Journal of Forecasting, Elsevier, vol. 39(2), pages 570-586.
    2. Xie, Nan & Wang, Zongrun & Chen, Sicen & Gong, Xu, 2019. "Forecasting downside risk in China’s stock market based on high-frequency data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 517(C), pages 530-541.
    3. Zhu, Haibin & Bai, Lu & He, Lidan & Liu, Zhi, 2023. "Forecasting realized volatility with machine learning: Panel data perspective," Journal of Empirical Finance, Elsevier, vol. 73(C), pages 251-271.
    4. Firat Melih Yilmaz & Engin Yildiztepe, 2024. "Statistical Evaluation of Deep Learning Models for Stock Return Forecasting," Computational Economics, Springer;Society for Computational Economics, vol. 63(1), pages 221-244, January.
    5. Zhu, Sha & Liu, Qiuhong & Wang, Yan & Wei, Yu & Wei, Guiwu, 2019. "Which fear index matters for predicting US stock market volatilities: Text-counts or option based measurement?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    6. Ding, Jing & Jiang, Lei & Liu, Xiaohui & Peng, Liang, 2023. "Nonparametric tests for market timing ability using daily mutual fund returns," Journal of Economic Dynamics and Control, Elsevier, vol. 150(C).
    7. Gary S. Anderson & Alena Audzeyeva, 2019. "A Coherent Framework for Predicting Emerging Market Credit Spreads with Support Vector Regression," Finance and Economics Discussion Series 2019-074, Board of Governors of the Federal Reserve System (U.S.).
    8. Köchling, Gerrit & Schmidtke, Philipp & Posch, Peter N., 2020. "Volatility forecasting accuracy for Bitcoin," Economics Letters, Elsevier, vol. 191(C).
    9. Peter Malec, 2016. "A Semiparametric Intraday GARCH Model," Cambridge Working Papers in Economics 1633, Faculty of Economics, University of Cambridge.
    10. Štefan Lyócsa & Petra Vašaničová & Branka Hadji Misheva & Marko Dávid Vateha, 2022. "Default or profit scoring credit systems? Evidence from European and US peer-to-peer lending markets," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-21, December.
    11. repec:bny:wpaper:0083 is not listed on IDEAS
    12. Degiannakis, Stavros & Filis, George, 2018. "Forecasting oil prices: High-frequency financial data are indeed useful," Energy Economics, Elsevier, vol. 76(C), pages 388-402.
    13. Drachal, Krzysztof, 2021. "Forecasting crude oil real prices with averaging time-varying VAR models," Resources Policy, Elsevier, vol. 74(C).
    14. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    15. Heitham Al-Hajieh & Hashem AlNemer & Timothy Rodgers & Jacek Niklewski, 2015. "Forecasting the Jordanian stock index: modelling asymmetric volatility and distribution effects within a GARCH framework," Copernican Journal of Finance & Accounting, Uniwersytet Mikolaja Kopernika, vol. 4(2), pages 9-26.
    16. Eo, Yunjong & Kang, Kyu Ho, 2020. "The effects of conventional and unconventional monetary policy on forecasting the yield curve," Journal of Economic Dynamics and Control, Elsevier, vol. 111(C).
    17. Stavros Degiannakis, 2023. "The D-model for GDP nowcasting," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-33, December.
    18. Wei, Yu & Cao, Yang, 2017. "Forecasting house prices using dynamic model averaging approach: Evidence from China," Economic Modelling, Elsevier, vol. 61(C), pages 147-155.
    19. Anwen Yin, 2024. "Predictive model averaging with parameter instability and heteroskedasticity," Bulletin of Economic Research, Wiley Blackwell, vol. 76(2), pages 418-442, April.
    20. Marchese, Malvina & Kyriakou, Ioannis & Tamvakis, Michael & Di Iorio, Francesca, 2020. "Forecasting crude oil and refined products volatilities and correlations: New evidence from fractionally integrated multivariate GARCH models," Energy Economics, Elsevier, vol. 88(C).
    21. Davide De Gaetano, 2018. "Forecast Combinations for Structural Breaks in Volatility: Evidence from BRICS Countries," JRFM, MDPI, vol. 11(4), pages 1-13, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:637:y:2024:i:c:s0378437124001201. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.