IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i23p3778-d1533310.html
   My bibliography  Save this article

A Hybrid Forecasting System Based on Comprehensive Feature Selection and Intelligent Optimization for Stock Price Index Forecasting

Author

Listed:
  • Xuecheng He

    (School of Management Science and Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China
    School of Politics, Economics and International Relations, University of Reading, Whiteknights RG6 6UR, UK)

  • Jujie Wang

    (School of Management Science and Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China)

Abstract

Accurate forecasts of stock indexes can not only provide reference information for investors to formulate relevant strategies but also provide effective channels for the government to regulate the market. However, due to its volatility and complexity, predicting the stock price index has always been a challenging task. This paper proposes a hybrid forecasting system based on comprehensive feature selection and intelligent optimization for stock price index forecasting. First, a recursive feature elimination with a cross-validation (RFECV) algorithm is designed to filter variables that have a significant impact on the target data from multiple datasets. Then, the stack autoencoder (SAE) algorithm is constructed to compress the feature variables. At last, an enhanced least squares support vector machine (LSSVM) algorithm is established to obtain high-precision point prediction results, and the Gaussian process regression (GPR) algorithm is used to obtain reasonable interval prediction results. Taking the Shanghai Stock Exchange (SSE) as an example, the root mean square error (RMSE) and mean absolute percentage error (MAPE) of the model were 6.989 and 0.158%, respectively. In addition, the prediction interval coverage probability (PICP) is 99.792%. Through experimental comparison, the model shows high prediction accuracy and generalization ability.

Suggested Citation

  • Xuecheng He & Jujie Wang, 2024. "A Hybrid Forecasting System Based on Comprehensive Feature Selection and Intelligent Optimization for Stock Price Index Forecasting," Mathematics, MDPI, vol. 12(23), pages 1-27, November.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:23:p:3778-:d:1533310
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/23/3778/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/23/3778/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Tang, Ling & Zhang, Chengyuan & Li, Ling & Wang, Shouyang, 2020. "A multi-scale method for forecasting oil price with multi-factor search engine data," Applied Energy, Elsevier, vol. 257(C).
    2. Escobar-Anel, Marcos & Rastegari, Javad & Stentoft, Lars, 2021. "Option pricing with conditional GARCH models," European Journal of Operational Research, Elsevier, vol. 289(1), pages 350-363.
    3. Mojtaba Kadkhodazadeh & Saeed Farzin, 2021. "A Novel LSSVM Model Integrated with GBO Algorithm to Assessment of Water Quality Parameters," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(12), pages 3939-3968, September.
    4. Kizilaslan, Recep & Freund, Steven & Iseri, Ali, 2016. "A data analytic approach to forecasting daily stock returns in an emerging marketAuthor-Name: Oztekin, Asil," European Journal of Operational Research, Elsevier, vol. 253(3), pages 697-710.
    5. Akshit Kurani & Pavan Doshi & Aarya Vakharia & Manan Shah, 2023. "A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting," Annals of Data Science, Springer, vol. 10(1), pages 183-208, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jahangoshai Rezaee, Mustafa & Jozmaleki, Mehrdad & Valipour, Mahsa, 2018. "Integrating dynamic fuzzy C-means, data envelopment analysis and artificial neural network to online prediction performance of companies in stock exchange," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 489(C), pages 78-93.
    2. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    3. Ling Tang & Chengyuan Zhang & Tingfei Li & Ling Li, 2021. "A novel BEMD-based method for forecasting tourist volume with search engine data," Tourism Economics, , vol. 27(5), pages 1015-1038, August.
    4. Saima Akhtar & Sulman Shahzad & Asad Zaheer & Hafiz Sami Ullah & Heybet Kilic & Radomir Gono & Michał Jasiński & Zbigniew Leonowicz, 2023. "Short-Term Load Forecasting Models: A Review of Challenges, Progress, and the Road Ahead," Energies, MDPI, vol. 16(10), pages 1-29, May.
    5. Xiao, Jihong & Wen, Fenghua & He, Zhifang, 2023. "Impact of geopolitical risks on investor attention and speculation in the oil market: Evidence from nonlinear and time-varying analysis," Energy, Elsevier, vol. 267(C).
    6. Gupta, Priya & Singh, Rhythm, 2023. "Combining simple and less time complex ML models with multivariate empirical mode decomposition to obtain accurate GHI forecast," Energy, Elsevier, vol. 263(PC).
    7. Meng, Anbo & Zhu, Zibin & Deng, Weisi & Ou, Zuhong & Lin, Shan & Wang, Chenen & Xu, Xuancong & Wang, Xiaolin & Yin, Hao & Luo, Jianqiang, 2022. "A novel wind power prediction approach using multivariate variational mode decomposition and multi-objective crisscross optimization based deep extreme learning machine," Energy, Elsevier, vol. 260(C).
    8. Zhao, Lu-Tao & Zheng, Zhi-Yi & Wei, Yi-Ming, 2023. "Forecasting oil inventory changes with Google trends: A hybrid wavelet decomposer and ARDL-SVR ensemble model," Energy Economics, Elsevier, vol. 120(C).
    9. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    10. Zehai Gao & Yang Liu & Nan Li & Kangjie Ma, 2022. "An Enhanced Beetle Antennae Search Algorithm Based Comprehensive Water Quality Index for Urban River Water Quality Assessment," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(8), pages 2685-2702, June.
    11. Xiao, Jihong & Wang, Yudong, 2021. "Investor attention and oil market volatility: Does economic policy uncertainty matter?," Energy Economics, Elsevier, vol. 97(C).
    12. Lyócsa, Štefan & Todorova, Neda & Výrost, Tomáš, 2021. "Predicting risk in energy markets: Low-frequency data still matter," Applied Energy, Elsevier, vol. 282(PA).
    13. Mustafa Al-Mukhtar & Aman Srivastava & Leena Khadke & Tariq Al-Musawi & Ahmed Elbeltagi, 2024. "Prediction of Irrigation Water Quality Indices Using Random Committee, Discretization Regression, REPTree, and Additive Regression," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 38(1), pages 343-368, January.
    14. Antonios K. Alexandridis & Ekaterini Panopoulou & Ioannis Souropanis, 2024. "Forecasting exchange rates: An iterated combination constrained predictor approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(4), pages 983-1017, July.
    15. Mojtaba Kadkhodazadeh & Mahdi Valikhan Anaraki & Amirreza Morshed-Bozorgdel & Saeed Farzin, 2022. "A New Methodology for Reference Evapotranspiration Prediction and Uncertainty Analysis under Climate Change Conditions Based on Machine Learning, Multi Criteria Decision Making and Monte Carlo Methods," Sustainability, MDPI, vol. 14(5), pages 1-37, February.
    16. Stefan Tsokov & Milena Lazarova & Adelina Aleksieva-Petrova, 2022. "A Hybrid Spatiotemporal Deep Model Based on CNN and LSTM for Air Pollution Prediction," Sustainability, MDPI, vol. 14(9), pages 1-38, April.
    17. Chengyuan Zhang & Fuxin Jiang & Shouyang Wang & Shaolong Sun, 2020. "A New Decomposition Ensemble Approach for Tourism Demand Forecasting: Evidence from Major Source Countries," Papers 2002.09201, arXiv.org.
    18. Chin Soon Ku & Jiale Xiong & Yen-Lin Chen & Shing Dhee Cheah & Hoong Cheng Soong & Lip Yee Por, 2023. "Improving Stock Market Predictions: An Equity Forecasting Scanner Using Long Short-Term Memory Method with Dynamic Indicators for Malaysia Stock Market," Mathematics, MDPI, vol. 11(11), pages 1-20, May.
    19. Qin, Fuli & Tong, Mingyu & Huang, Ying & Zhang, Yubo, 2024. "Modeling, prediction and analysis of natural gas consumption in China using a novel dynamic nonlinear multivariable grey delay model," Energy, Elsevier, vol. 305(C).
    20. Hao, Jun & Feng, Qianqian & Yuan, Jiaxin & Sun, Xiaolei & Li, Jianping, 2022. "A dynamic ensemble learning with multi-objective optimization for oil prices prediction," Resources Policy, Elsevier, vol. 79(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:23:p:3778-:d:1533310. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.