IDEAS home Printed from https://ideas.repec.org/a/gam/jecnmx/v12y2024i2p16-d1412218.html
   My bibliography  Save this article

Predicting the Direction of NEPSE Index Movement with News Headlines Using Machine Learning

Author

Listed:
  • Keshab Raj Dahal

    (Department of Mathematics, State University of New York Cortland, Cortland, NY 13045, USA)

  • Ankrit Gupta

    (Department of Computer Science, Central Michigan University, Mt Pleasant, MI 48859, USA)

  • Nawa Raj Pokhrel

    (Department of Physics and Computer Science, Xavier University of Louisiana, New Orleans, LA 70125, USA)

Abstract

Predicting stock market movement direction is a challenging task due to its fuzzy, chaotic, volatile, nonlinear, and complex nature. However, with advancements in artificial intelligence, abundant data availability, and improved computational capabilities, creating robust models capable of accurately predicting stock market movement is now feasible. This study aims to construct a predictive model using news headlines to predict stock market movement direction. It conducts a comparative analysis of five supervised classification machine learning algorithms—logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and artificial neural network (ANN)—to predict the next day’s movement direction of the close price of the Nepal Stock Exchange (NEPSE) index. Sentiment scores from news headlines are computed using the Valence Aware Dictionary for Sentiment Reasoning (VADER) and TextBlob sentiment analyzer. The models’ performance is evaluated based on sensitivity, specificity, accuracy, and the area under the receiver operating characteristic (ROC) curve (AUC). Experimental results reveal that all five models perform equally well when using sentiment scores from the TextBlob analyzer. Similarly, all models exhibit almost identical performance when using sentiment scores from the VADER analyzer, except for minor variations in AUC in SVM vs. LR and SVM vs. ANN. Moreover, models perform relatively better when using sentiment scores from the TextBlob analyzer compared to the VADER analyzer. These findings are further validated through statistical tests.

Suggested Citation

  • Keshab Raj Dahal & Ankrit Gupta & Nawa Raj Pokhrel, 2024. "Predicting the Direction of NEPSE Index Movement with News Headlines Using Machine Learning," Econometrics, MDPI, vol. 12(2), pages 1-26, June.
  • Handle: RePEc:gam:jecnmx:v:12:y:2024:i:2:p:16-:d:1412218
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2225-1146/12/2/16/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2225-1146/12/2/16/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Li, Qiong & Meng, Qinglin & Cai, Jiejin & Yoshino, Hiroshi & Mochida, Akashi, 2009. "Applying support vector machine to predict hourly cooling load in the building," Applied Energy, Elsevier, vol. 86(10), pages 2249-2256, October.
    2. Shun Chen & Lei Ge, 2019. "Exploring the attention mechanism in LSTM-based Hong Kong stock price movement prediction," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1507-1515, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jinrong Wu & Su Nguyen & Damminda Alahakoon & Daswin De Silva & Nishan Mills & Prabod Rathnayaka & Harsha Moraliyage & Andrew Jennings, 2024. "A Comparative Analysis of Machine Learning-Based Energy Baseline Models across Multiple Building Types," Energies, MDPI, vol. 17(6), pages 1-18, March.
    2. Hyo-Jun Kim & Young-Hum Cho, 2021. "Optimal Control Method of Variable Air Volume Terminal Unit System," Energies, MDPI, vol. 14(22), pages 1-15, November.
    3. Tian, Wei & Song, Jitian & Li, Zhanyong & de Wilde, Pieter, 2014. "Bootstrap techniques for sensitivity analysis and model selection in building thermal performance analysis," Applied Energy, Elsevier, vol. 135(C), pages 320-328.
    4. Ahmad, Muhammad Waseem & Mourshed, Monjur & Rezgui, Yacine, 2018. "Tree-based ensemble methods for predicting PV power generation and their comparison with support vector regression," Energy, Elsevier, vol. 164(C), pages 465-474.
    5. Amasyali, Kadir & El-Gohary, Nora M., 2018. "A review of data-driven building energy consumption prediction studies," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1192-1205.
    6. Muideen Adegoke & Alaka Hafiz & Saheed Ajayi & Razak Olu-Ajayi, 2022. "Application of Multilayer Extreme Learning Machine for Efficient Building Energy Prediction," Energies, MDPI, vol. 15(24), pages 1-21, December.
    7. Yuwen You & Zhonghua Wang & Zhihao Liu & Chunmei Guo & Bin Yang, 2024. "Load Prediction of Regional Heat Exchange Station Based on Fuzzy Clustering Based on Fourier Distance and Convolutional Neural Network–Bidirectional Long Short-Term Memory Network," Energies, MDPI, vol. 17(16), pages 1-19, August.
    8. Ling, Jihong & Zhang, Bingyang & Dai, Na & Xing, Jincheng, 2023. "Coupling input feature construction methods and machine learning algorithms for hourly secondary supply temperature prediction," Energy, Elsevier, vol. 278(C).
    9. Sun, Chuting & Wu, Qi & Yan, Xing, 2024. "Dynamic CVaR portfolio construction with attention-powered generative factor learning," Journal of Economic Dynamics and Control, Elsevier, vol. 160(C).
    10. Wang, Chen & Shen, Dehua & Li, Youwei, 2022. "Aggregate Investor Attention and Bitcoin Return: The Long Short-term Memory Networks Perspective," Finance Research Letters, Elsevier, vol. 49(C).
    11. Zhang, Xingxing & Lovati, Marco & Vigna, Ilaria & Widén, Joakim & Han, Mengjie & Gal, Csilla & Feng, Tao, 2018. "A review of urban energy systems at building cluster level incorporating renewable-energy-source (RES) envelope solutions," Applied Energy, Elsevier, vol. 230(C), pages 1034-1056.
    12. Wang, Ran & Lu, Shilei & Feng, Wei, 2020. "A novel improved model for building energy consumption prediction based on model integration," Applied Energy, Elsevier, vol. 262(C).
    13. Jiang, Ben & Li, Yu & Rezgui, Yacine & Zhang, Chengyu & Wang, Peng & Zhao, Tianyi, 2024. "Multi-source domain generalization deep neural network model for predicting energy consumption in multiple office buildings," Energy, Elsevier, vol. 299(C).
    14. Jiang, Lai & Yao, Runming & Liu, Kecheng & McCrindle, Rachel, 2017. "An Epistemic-Deontic-Axiologic (EDA) agent-based energy management system in office buildings," Applied Energy, Elsevier, vol. 205(C), pages 440-452.
    15. Afroz, Zakia & Urmee, Tania & Shafiullah, G.M. & Higgins, Gary, 2018. "Real-time prediction model for indoor temperature in a commercial building," Applied Energy, Elsevier, vol. 231(C), pages 29-53.
    16. Yu, Binbin & Li, Jianjing & Liu, Che & Sun, Bo, 2022. "A novel short-term electrical load forecasting framework with intelligent feature engineering," Applied Energy, Elsevier, vol. 327(C).
    17. Luca Grilli & Domenico Santoro, 2022. "Forecasting financial time series with Boltzmann entropy through neural networks," Computational Management Science, Springer, vol. 19(4), pages 665-681, October.
    18. Yi Fu & Shuai Cao & Tao Pang, 2020. "A Sustainable Quantitative Stock Selection Strategy Based on Dynamic Factor Adjustment," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    19. Mohammad Nikoo & Akbar Karimi & Reza Kerachian & Hamed Poorsepahy-Samian & Farhang Daneshmand, 2013. "Rules for Optimal Operation of Reservoir-River-Groundwater Systems Considering Water Quality Targets: Application of M5P Model," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 27(8), pages 2771-2784, June.
    20. Jawed Mustafa & Fahad Awjah Almehmadi & Saeed Alqaed & Mohsen Sharifpur, 2022. "Building a Sustainable Energy Community: Design and Integrate Variable Renewable Energy Systems for Rural Communities," Sustainability, MDPI, vol. 14(21), pages 1-21, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jecnmx:v:12:y:2024:i:2:p:16-:d:1412218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.