IDEAS home Printed from https://ideas.repec.org/a/gam/jecnmx/v12y2024i2p16-d1412218.html
   My bibliography  Save this article

Predicting the Direction of NEPSE Index Movement with News Headlines Using Machine Learning

Author

Listed:
  • Keshab Raj Dahal

    (Department of Mathematics, State University of New York Cortland, Cortland, NY 13045, USA)

  • Ankrit Gupta

    (Department of Computer Science, Central Michigan University, Mt Pleasant, MI 48859, USA)

  • Nawa Raj Pokhrel

    (Department of Physics and Computer Science, Xavier University of Louisiana, New Orleans, LA 70125, USA)

Abstract

Predicting stock market movement direction is a challenging task due to its fuzzy, chaotic, volatile, nonlinear, and complex nature. However, with advancements in artificial intelligence, abundant data availability, and improved computational capabilities, creating robust models capable of accurately predicting stock market movement is now feasible. This study aims to construct a predictive model using news headlines to predict stock market movement direction. It conducts a comparative analysis of five supervised classification machine learning algorithms—logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and artificial neural network (ANN)—to predict the next day’s movement direction of the close price of the Nepal Stock Exchange (NEPSE) index. Sentiment scores from news headlines are computed using the Valence Aware Dictionary for Sentiment Reasoning (VADER) and TextBlob sentiment analyzer. The models’ performance is evaluated based on sensitivity, specificity, accuracy, and the area under the receiver operating characteristic (ROC) curve (AUC). Experimental results reveal that all five models perform equally well when using sentiment scores from the TextBlob analyzer. Similarly, all models exhibit almost identical performance when using sentiment scores from the VADER analyzer, except for minor variations in AUC in SVM vs. LR and SVM vs. ANN. Moreover, models perform relatively better when using sentiment scores from the TextBlob analyzer compared to the VADER analyzer. These findings are further validated through statistical tests.

Suggested Citation

  • Keshab Raj Dahal & Ankrit Gupta & Nawa Raj Pokhrel, 2024. "Predicting the Direction of NEPSE Index Movement with News Headlines Using Machine Learning," Econometrics, MDPI, vol. 12(2), pages 1-26, June.
  • Handle: RePEc:gam:jecnmx:v:12:y:2024:i:2:p:16-:d:1412218
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2225-1146/12/2/16/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2225-1146/12/2/16/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Li, Qiong & Meng, Qinglin & Cai, Jiejin & Yoshino, Hiroshi & Mochida, Akashi, 2009. "Applying support vector machine to predict hourly cooling load in the building," Applied Energy, Elsevier, vol. 86(10), pages 2249-2256, October.
    2. Shun Chen & Lei Ge, 2019. "Exploring the attention mechanism in LSTM-based Hong Kong stock price movement prediction," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1507-1515, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tian, Wei & Song, Jitian & Li, Zhanyong & de Wilde, Pieter, 2014. "Bootstrap techniques for sensitivity analysis and model selection in building thermal performance analysis," Applied Energy, Elsevier, vol. 135(C), pages 320-328.
    2. Ahmad, Muhammad Waseem & Mourshed, Monjur & Rezgui, Yacine, 2018. "Tree-based ensemble methods for predicting PV power generation and their comparison with support vector regression," Energy, Elsevier, vol. 164(C), pages 465-474.
    3. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    4. Ling, Jihong & Zhang, Bingyang & Dai, Na & Xing, Jincheng, 2023. "Coupling input feature construction methods and machine learning algorithms for hourly secondary supply temperature prediction," Energy, Elsevier, vol. 278(C).
    5. Wang, Chen & Shen, Dehua & Li, Youwei, 2022. "Aggregate Investor Attention and Bitcoin Return: The Long Short-term Memory Networks Perspective," Finance Research Letters, Elsevier, vol. 49(C).
    6. Wang, Ran & Lu, Shilei & Feng, Wei, 2020. "A novel improved model for building energy consumption prediction based on model integration," Applied Energy, Elsevier, vol. 262(C).
    7. Afroz, Zakia & Urmee, Tania & Shafiullah, G.M. & Higgins, Gary, 2018. "Real-time prediction model for indoor temperature in a commercial building," Applied Energy, Elsevier, vol. 231(C), pages 29-53.
    8. Luca Grilli & Domenico Santoro, 2022. "Forecasting financial time series with Boltzmann entropy through neural networks," Computational Management Science, Springer, vol. 19(4), pages 665-681, October.
    9. Yi Fu & Shuai Cao & Tao Pang, 2020. "A Sustainable Quantitative Stock Selection Strategy Based on Dynamic Factor Adjustment," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    10. Mohammad Nikoo & Akbar Karimi & Reza Kerachian & Hamed Poorsepahy-Samian & Farhang Daneshmand, 2013. "Rules for Optimal Operation of Reservoir-River-Groundwater Systems Considering Water Quality Targets: Application of M5P Model," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 27(8), pages 2771-2784, June.
    11. Qinkai Chen, 2021. "Stock Movement Prediction with Financial News using Contextualized Embedding from BERT," Papers 2107.08721, arXiv.org.
    12. Sun, Chunhua & Zhang, Haixiang & Cao, Shanshan & Xia, Guoqiang & Zhong, Jian & Wu, Xiangdong, 2023. "A hierarchical classifying and two-step training strategy for detection and diagnosis of anormal temperature in district heating system," Applied Energy, Elsevier, vol. 349(C).
    13. Muhammad Waseem Ahmad & Anthony Mouraud & Yacine Rezgui & Monjur Mourshed, 2018. "Deep Highway Networks and Tree-Based Ensemble for Predicting Short-Term Building Energy Consumption," Energies, MDPI, vol. 11(12), pages 1-21, December.
    14. Luo, Na & Hong, Tianzhen & Li, Hui & Jia, Ruoxi & Weng, Wenguo, 2017. "Data analytics and optimization of an ice-based energy storage system for commercial buildings," Applied Energy, Elsevier, vol. 204(C), pages 459-475.
    15. Kapp, Sean & Choi, Jun-Ki & Hong, Taehoon, 2023. "Predicting industrial building energy consumption with statistical and machine-learning models informed by physical system parameters," Renewable and Sustainable Energy Reviews, Elsevier, vol. 172(C).
    16. Ahmed Salih Mohammed & Panagiotis G. Asteris & Mohammadreza Koopialipoor & Dimitrios E. Alexakis & Minas E. Lemonis & Danial Jahed Armaghani, 2021. "Stacking Ensemble Tree Models to Predict Energy Performance in Residential Buildings," Sustainability, MDPI, vol. 13(15), pages 1-22, July.
    17. Venkatraj, V. & Dixit, M.K., 2022. "Challenges in implementing data-driven approaches for building life cycle energy assessment: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 160(C).
    18. Wang, Zeyu & Srinivasan, Ravi S., 2017. "A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 75(C), pages 796-808.
    19. Lara Ramadan & Isam Shahrour & Hussein Mroueh & Fadi Hage Chehade, 2021. "Use of Machine Learning Methods for Indoor Temperature Forecasting," Future Internet, MDPI, vol. 13(10), pages 1-18, September.
    20. Cameron Francis Assadian & Francis Assadian, 2023. "Data-Driven Modeling of Appliance Energy Usage," Energies, MDPI, vol. 16(22), pages 1-12, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jecnmx:v:12:y:2024:i:2:p:16-:d:1412218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.