IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i10p1572-d1396989.html
   My bibliography  Save this article

Incorporating Multi-Source Market Sentiment and Price Data for Stock Price Prediction

Author

Listed:
  • Kui Fu

    (School of Economics, Wuhan University of Technology, Wuhan 430070, China)

  • Yanbin Zhang

    (School of Economics, Wuhan University of Technology, Wuhan 430070, China)

Abstract

The problem of stock price prediction has been a hot research issue. Stock price is influenced by various factors at the same time, and market sentiment is one of the most critical factors. Financial texts such as news and investor comments reflect investor sentiment in the stock market and influence market movements. Previous research models have struggled to accurately mine multiple sources of market sentiment information originating from the Internet and traditional sentiment analysis models are challenging to quantify and combine indicator data from market data and multi-source sentiment data. Therefore, we propose a BERT-LLA stock price prediction model incorporating multi-source market sentiment and technical analysis. In the sentiment analysis module, we propose a semantic similarity and sector heat-based model to screen for related sectors and use fine-tuned BERT models to calculate the text sentiment index, transforming the text data into sentiment index time series data. In the technical indicator calculation module, technical indicator time series are calculated using market data. Finally, in the prediction module, we combine the sentiment index time series and technical indicator time series and employ a two-layer LSTM network prediction model with an integrated attention mechanism to predict stock close price. Our experiment results show that the BERT-LLA model can accurately capture market sentiment and has a strong practicality and forecasting ability in analyzing market sentiment and stock price prediction.

Suggested Citation

  • Kui Fu & Yanbin Zhang, 2024. "Incorporating Multi-Source Market Sentiment and Price Data for Stock Price Prediction," Mathematics, MDPI, vol. 12(10), pages 1-26, May.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:10:p:1572-:d:1396989
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/10/1572/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/10/1572/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Song, Yu & Akagi, Fumio, 2016. "Application of artificial neural network for the prediction of stock market returns: The case of the Japanese stock marketAuthor-Name: Qiu, Mingyue," Chaos, Solitons & Fractals, Elsevier, vol. 85(C), pages 1-7.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ehsan Hoseinzade & Saman Haratizadeh & Arash Khoeini, 2019. "U-CNNpred: A Universal CNN-based Predictor for Stock Markets," Papers 1911.12540, arXiv.org.
    2. M. Mallikarjuna & R. Prabhakara Rao, 2019. "Evaluation of forecasting methods from selected stock market returns," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-16, December.
    3. Xuan-Hong Dang & Syed Yousaf Shah & Petros Zerfos, 2019. ""The Squawk Bot": Joint Learning of Time Series and Text Data Modalities for Automated Financial Information Filtering," Papers 1912.10858, arXiv.org.
    4. Erdinc Akyildirim & Aurelio F. Bariviera & Duc Khuong Nguyen & Ahmet Sensoy, 2022. "Forecasting high-frequency stock returns: a comparison of alternative methods," Annals of Operations Research, Springer, vol. 313(2), pages 639-690, June.
    5. Ehsan Hoseinzade & Saman Haratizadeh, 2018. "CNNPred: CNN-based stock market prediction using several data sources," Papers 1810.08923, arXiv.org.
    6. Manel Hamdi & Walid Chkili, 2019. "An artificial neural network augmented GARCH model for Islamic stock market volatility: Do asymmetry and long memory matter?," Working Papers 13, Economic Research Forum, revised 21 Aug 2019.
    7. Sujin Pyo & Jaewook Lee & Mincheol Cha & Huisu Jang, 2017. "Predictability of machine learning techniques to forecast the trends of market index prices: Hypothesis testing for the Korean stock markets," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-17, November.
    8. Eva DEZSI & Ioan Alin NISTOR, 2016. "Can Deep Machine Learning Outsmart The Market? A Comparison Between Econometric Modelling And Long- Short Term Memory," Romanian Economic Business Review, Romanian-American University, vol. 11(4.1), pages 54-73, december.
    9. Chia-Cheng Chen & Yisheng Liu & Ting-Hsin Hsu, 2019. "An Analysis on Investment Performance of Machine Learning: An Empirical Examination on Taiwan Stock Market," International Journal of Economics and Financial Issues, Econjournals, vol. 9(4), pages 1-10.
    10. Chia-Cheng Chen & Chun-Hung Chen & Ting-Yin Liu, 2020. "Investment Performance of Machine Learning: Analysis of S&P 500 Index," International Journal of Economics and Financial Issues, Econjournals, vol. 10(1), pages 59-66.
    11. Wei Liu & Yoshihisa Suzuki & Shuyi Du, 2024. "Forecasting the Stock Price of Listed Innovative SMEs Using Machine Learning Methods Based on Bayesian optimization: Evidence from China," Computational Economics, Springer;Society for Computational Economics, vol. 63(5), pages 2035-2068, May.
    12. Jun Zhang & Lan Li & Wei Chen, 2021. "Predicting Stock Price Using Two-Stage Machine Learning Techniques," Computational Economics, Springer;Society for Computational Economics, vol. 57(4), pages 1237-1261, April.
    13. Duan, Yunlong & Mu, Chang & Yang, Meng & Deng, Zhiqing & Chin, Tachia & Zhou, Li & Fang, Qifeng, 2021. "Study on early warnings of strategic risk during the process of firms’ sustainable innovation based on an optimized genetic BP neural networks model: Evidence from Chinese manufacturing firms," International Journal of Production Economics, Elsevier, vol. 242(C).
    14. Chen, Wei & Zhang, Haoyu & Jia, Lifen, 2022. "A novel two-stage method for well-diversified portfolio construction based on stock return prediction using machine learning," The North American Journal of Economics and Finance, Elsevier, vol. 63(C).
    15. Sun, Ying & Zhang, Luying & Yao, Minghui, 2023. "Chaotic time series prediction of nonlinear systems based on various neural network models," Chaos, Solitons & Fractals, Elsevier, vol. 175(P1).
    16. Rohitash Chandra & Yixuan He, 2021. "Bayesian neural networks for stock price forecasting before and during COVID-19 pandemic," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-32, July.
    17. Jujie Wang & Yinan Liao & Zhenzhen Zhuang & Dongming Gao, 2021. "An Optimal Weighted Combined Model Coupled with Feature Reconstruction and Deep Learning for Multivariate Stock Index Forecasting," Mathematics, MDPI, vol. 9(21), pages 1-20, October.
    18. Gourav Kumar & Uday Pratap Singh & Sanjeev Jain, 2022. "Swarm Intelligence Based Hybrid Neural Network Approach for Stock Price Forecasting," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 991-1039, October.
    19. Faraz Sasani & Ramin Mousa & Ali Karkehabadi & Samin Dehbashi & Ali Mohammadi, 2023. "TM-vector: A Novel Forecasting Approach for Market stock movement with a Rich Representation of Twitter and Market data," Papers 2304.02094, arXiv.org.
    20. Ku, Seungmo & Lee, Changju & Chang, Woojin & Wook Song, Jae, 2020. "Fractal structure in the S&P500: A correlation-based threshold network approach," Chaos, Solitons & Fractals, Elsevier, vol. 137(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:10:p:1572-:d:1396989. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.