IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2410.07143.html
   My bibliography  Save this paper

SARF: Enhancing Stock Market Prediction with Sentiment-Augmented Random Forest

Author

Listed:
  • Saber Talazadeh
  • Dragan Perakovic

Abstract

Stock trend forecasting, a challenging problem in the financial domain, involves ex-tensive data and related indicators. Relying solely on empirical analysis often yields unsustainable and ineffective results. Machine learning researchers have demonstrated that the application of random forest algorithm can enhance predictions in this context, playing a crucial auxiliary role in forecasting stock trends. This study introduces a new approach to stock market prediction by integrating sentiment analysis using FinGPT generative AI model with the traditional Random Forest model. The proposed technique aims to optimize the accuracy of stock price forecasts by leveraging the nuanced understanding of financial sentiments provided by FinGPT. We present a new methodology called "Sentiment-Augmented Random Forest" (SARF), which in-corporates sentiment features into the Random Forest framework. Our experiments demonstrate that SARF outperforms conventional Random Forest and LSTM models with an average accuracy improvement of 9.23% and lower prediction errors in pre-dicting stock market movements.

Suggested Citation

  • Saber Talazadeh & Dragan Perakovic, 2024. "SARF: Enhancing Stock Market Prediction with Sentiment-Augmented Random Forest," Papers 2410.07143, arXiv.org.
  • Handle: RePEc:arx:papers:2410.07143
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2410.07143
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Heyman, Dries & Lescrauwaet, Michiel & Stieperaere, Hannes, 2019. "Investor attention and short-term return reversals," Finance Research Letters, Elsevier, vol. 29(C), pages 1-6.
    2. Xiao-Yang Liu & Guoxuan Wang & Hongyang Yang & Daochen Zha, 2023. "FinGPT: Democratizing Internet-scale Data for Financial Large Language Models," Papers 2307.10485, arXiv.org, revised Nov 2023.
    3. Feuerriegel, Stefan & Gordon, Julius, 2019. "News-based forecasts of macroeconomic indicators: A semantic path model for interpretable predictions," European Journal of Operational Research, Elsevier, vol. 272(1), pages 162-175.
    4. Lohrmann, Christoph & Luukka, Pasi, 2019. "Classification of intraday S&P500 returns with a Random Forest," International Journal of Forecasting, Elsevier, vol. 35(1), pages 390-407.
    5. Sidra Mehtab & Jaydip Sen, 2019. "A Robust Predictive Model for Stock Price Prediction Using Deep Learning and Natural Language Processing," Papers 1912.07700, arXiv.org.
    6. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    7. Narayana Darapaneni & Anwesh Reddy Paduri & Himank Sharma & Milind Manjrekar & Nutan Hindlekar & Pranali Bhagat & Usha Aiyer & Yogesh Agarwal, 2022. "Stock Price Prediction using Sentiment Analysis and Deep Learning for Indian Markets," Papers 2204.05783, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Henriques, Irene & Sadorsky, Perry, 2023. "Forecasting rare earth stock prices with machine learning," Resources Policy, Elsevier, vol. 86(PA).
    2. Bharat Kumar Meher & Abhishek Anand & Sunil Kumar & Ramona Birau & Manohar Sing, 2024. "Effectiveness of Random Forest Model in Predicting Stock Prices of Solar Energy Companies in India," International Journal of Energy Economics and Policy, Econjournals, vol. 14(2), pages 426-434, March.
    3. Perry Sadorsky, 2021. "Predicting Gold and Silver Price Direction Using Tree-Based Classifiers," JRFM, MDPI, vol. 14(5), pages 1-21, April.
    4. Syed Abul, Basher & Perry, Sadorsky, 2022. "Forecasting Bitcoin price direction with random forests: How important are interest rates, inflation, and market volatility?," MPRA Paper 113293, University Library of Munich, Germany.
    5. Sadorsky, Perry, 2022. "Forecasting solar stock prices using tree-based machine learning classification: How important are silver prices?," The North American Journal of Economics and Finance, Elsevier, vol. 61(C).
    6. Perry Sadorsky, 2021. "A Random Forests Approach to Predicting Clean Energy Stock Prices," JRFM, MDPI, vol. 14(2), pages 1-20, January.
    7. Ghosh, Pushpendu & Neufeld, Ariel & Sahoo, Jajati Keshari, 2022. "Forecasting directional movements of stock prices for intraday trading using LSTM and random forests," Finance Research Letters, Elsevier, vol. 46(PA).
    8. Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
    9. Wang, Jianzhou & Lv, Mengzheng & Wang, Shuai & Gao, Jialu & Zhao, Yang & Wang, Qiangqiang, 2024. "Can multi-period auto-portfolio systems improve returns? Evidence from Chinese and U.S. stock markets," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    10. Jaydip Sen & Sidra Mehtab & Abhishek Dutta & Saikat Mondal, 2022. "Precise Stock Price Prediction for Optimized Portfolio Design Using an LSTM Model," Papers 2203.01326, arXiv.org.
    11. Sidra Mehtab & Jaydip Sen & Subhasis Dasgupta, 2020. "Robust Analysis of Stock Price Time Series Using CNN and LSTM-Based Deep Learning Models," Papers 2011.08011, arXiv.org, revised Jan 2021.
    12. Jaydip Sen & Sidra Mehtab, 2021. "Design and Analysis of Robust Deep Learning Models for Stock Price Prediction," Papers 2106.09664, arXiv.org.
    13. Wang, Delu & Gan, Jun & Mao, Jinqi & Chen, Fan & Yu, Lan, 2023. "Forecasting power demand in China with a CNN-LSTM model including multimodal information," Energy, Elsevier, vol. 263(PE).
    14. Saqib Farid & Rubeena Tashfeen & Tahseen Mohsan & Arsal Burhan, 2023. "Forecasting stock prices using a data mining method: Evidence from emerging market," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(2), pages 1911-1917, April.
    15. Anh Duy Nguyen, 2020. "Alternative reversal variable," Post-Print hal-02388743, HAL.
    16. Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
    17. Zhou, Zhongbao & Gao, Meng & Liu, Qing & Xiao, Helu, 2020. "Forecasting stock price movements with multiple data sources: Evidence from stock market in China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 542(C).
    18. Barboza, Flavio & Altman, Edward, 2024. "Predicting financial distress in Latin American companies: A comparative analysis of logistic regression and random forest models," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).
    19. Thanos Konstantinidis & Giorgos Iacovides & Mingxue Xu & Tony G. Constantinides & Danilo Mandic, 2024. "FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications," Papers 2403.12285, arXiv.org.
    20. Jaydip Sen, 2022. "Designing Efficient Pair-Trading Strategies Using Cointegration for the Indian Stock Market," Papers 2211.07080, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2410.07143. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.