IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v2y2019i1d10.1007_s42001-019-00035-x.html
   My bibliography  Save this article

Enhanced news sentiment analysis using deep learning methods

Author

Listed:
  • Wataru Souma

    (Nihon University)

  • Irena Vodenska

    (Boston University
    Boston University)

  • Hideaki Aoyama

    (Kyoto University
    Trade and Industry)

Abstract

We explore the predictive power of historical news sentiments based on financial market performance to forecast financial news sentiments. We define news sentiments based on stock price returns averaged over one minute right after a news article has been released. If the stock price exhibits positive (negative) return, we classify the news article released just prior to the observed stock return as positive (negative). We use Wikipedia and Gigaword five corpus articles from 2014 and we apply the global vectors for word representation method to this corpus to create word vectors to use as inputs into the deep learning TensorFlow network. We analyze high-frequency (intraday) Thompson Reuters News Archive as well as the high-frequency price tick history of the Dow Jones Industrial Average (DJIA 30) Index individual stocks for the period between 1/1/2003 and 12/30/2013. We apply a combination of deep learning methodologies of recurrent neural network with long short-term memory units to train the Thompson Reuters News Archive Data from 2003 to 2012, and we test the forecasting power of our method on 2013 News Archive data. We find that the forecasting accuracy of our methodology improves when we switch from random selection of positive and negative news to selecting the news with highest positive scores as positive news and news with highest negative scores as negative news to create our training data set.

Suggested Citation

  • Wataru Souma & Irena Vodenska & Hideaki Aoyama, 2019. "Enhanced news sentiment analysis using deep learning methods," Journal of Computational Social Science, Springer, vol. 2(1), pages 33-46, January.
  • Handle: RePEc:spr:jcsosc:v:2:y:2019:i:1:d:10.1007_s42001-019-00035-x
    DOI: 10.1007/s42001-019-00035-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-019-00035-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-019-00035-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wei Bao & Jun Yue & Yulei Rao, 2017. "A deep learning framework for financial time series using stacked autoencoders and long-short term memory," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-24, July.
    2. Luigi Troiano & Elena Mejuto & Pravesh Kriplani, 2017. "On Feature Reduction using Deep Learning for Trend Prediction in Finance," Papers 1704.03205, arXiv.org.
    3. Linda Ponta & Silvano Cincotti, 2018. "Traders’ Networks of Interactions and Structural Properties of Financial Markets: An Agent-Based Approach," Complexity, Hindawi, vol. 2018, pages 1-9, January.
    4. J. B. Heaton & N. G. Polson & J. H. Witte, 2017. "Deep learning for finance: deep portfolios," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 3-12, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xu Gong & Mengjie Li & Keqin Guan & Chuanwang Sun, 2023. "Climate change attention and carbon futures return prediction," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 43(9), pages 1261-1288, September.
    2. Ali Lashgari, 2023. "Assessing Text Mining and Technical Analyses on Forecasting Financial Time Series," Papers 2304.14544, arXiv.org.
    3. Amartya Chakraborty & Sunanda Bose, 2020. "Around the world in 60 days: an exploratory study of impact of COVID-19 on online global news sentiment," Journal of Computational Social Science, Springer, vol. 3(2), pages 367-400, November.
    4. Park, Jeongeun & Yang, Donguk & Kim, Ha Young, 2023. "Text mining-based four-step framework for smart speaker product improvement and sales planning," Journal of Retailing and Consumer Services, Elsevier, vol. 71(C).
    5. Yupeng Cao & Zhi Chen & Qingyun Pei & Fabrizio Dimino & Lorenzo Ausiello & Prashant Kumar & K. P. Subbalakshmi & Papa Momar Ndiaye, 2024. "RiskLabs: Predicting Financial Risk Using Large Language Model Based on Multi-Sources Data," Papers 2404.07452, arXiv.org.
    6. Zexin Hu & Yiqi Zhao & Matloob Khushi, 2021. "A Survey of Forex and Stock Price Prediction Using Deep Learning," Papers 2103.09750, arXiv.org.
    7. Aaryan Gupta & Vinya Dengre & Hamza Abubakar Kheruwala & Manan Shah, 2020. "Comprehensive review of text-mining applications in finance," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 6(1), pages 1-25, December.
    8. Kao, Yu-Sheng & Day, Min-Yuh & Chou, Ke-Hsin, 2024. "A comparison of bitcoin futures return and return volatility based on news sentiment contemporaneously or lead-lag," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).
    9. Ryosuke Gotoh, 2024. "Analyzing the influence of web search behavior on electricity market price: a case study of Japan electric power exchange," Journal of Computational Social Science, Springer, vol. 7(1), pages 837-876, April.
    10. Joao Vitor Matos Goncalves & Michel Alexandre & Gilberto Tadeu Lima, 2023. "ARIMA and LSTM: A Comparative Analysis of Financial Time Series Forecasting," Working Papers, Department of Economics 2023_13, University of São Paulo (FEA-USP).
    11. Prajwal Eachempati & Praveen Ranjan Srivastava, 2021. "Accounting for unadjusted news sentiment for asset pricing," Qualitative Research in Financial Markets, Emerald Group Publishing Limited, vol. 13(3), pages 383-422, May.
    12. Bledar Fazlija & Pedro Harder, 2022. "Using Financial News Sentiment for Stock Price Direction Prediction," Mathematics, MDPI, vol. 10(13), pages 1-20, June.
    13. Gianluca Anese & Marco Corazza & Michele Costola & Loriana Pelizzon, 2023. "Impact of public news sentiment on stock market index return and volatility," Computational Management Science, Springer, vol. 20(1), pages 1-36, December.
    14. Anna Ruelens, 2022. "Analyzing user-generated content using natural language processing: a case study of public satisfaction with healthcare systems," Journal of Computational Social Science, Springer, vol. 5(1), pages 731-749, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    2. Kim, A. & Yang, Y. & Lessmann, S. & Ma, T. & Sung, M.-C. & Johnson, J.E.V., 2020. "Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 217-234.
    3. Axelsson, Birger & Song, Han-Suck, 2023. "Univariate Forecasting for REITs with Deep Learning: A Comparative Analysis with an ARIMA Model," Working Paper Series 23/10, Royal Institute of Technology, Department of Real Estate and Construction Management & Banking and Finance, revised 14 Nov 2023.
    4. Sang Il Lee & Seong Joon Yoo, 2019. "Multimodal Deep Learning for Finance: Integrating and Forecasting International Stock Markets," Papers 1903.06478, arXiv.org, revised Sep 2019.
    5. Li, Weiping & Mei, Feng, 2020. "Asset returns in deep learning methods: An empirical analysis on SSE 50 and CSI 300," Research in International Business and Finance, Elsevier, vol. 54(C).
    6. Andrea Bucci, 2020. "Realized Volatility Forecasting with Neural Networks," Journal of Financial Econometrics, Oxford University Press, vol. 18(3), pages 502-531.
    7. Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay & Jamal Atif, 2020. "AAMDRL: Augmented Asset Management with Deep Reinforcement Learning," Papers 2010.08497, arXiv.org.
    8. Paskalis Glabadanidis, 2020. "Portfolio Strategies to Track and Outperform a Benchmark," JRFM, MDPI, vol. 13(8), pages 1-26, August.
    9. Jaydip Sen & Sidra Mehtab & Abhishek Dutta & Saikat Mondal, 2022. "Precise Stock Price Prediction for Optimized Portfolio Design Using an LSTM Model," Papers 2203.01326, arXiv.org.
    10. Eleni Kosta, 2022. "Algorithmic state surveillance: Challenging the notion of agency in human rights," Regulation & Governance, John Wiley & Sons, vol. 16(1), pages 212-224, January.
    11. Jaydip Sen & Sidra Mehtab, 2021. "Design and Analysis of Robust Deep Learning Models for Stock Price Prediction," Papers 2106.09664, arXiv.org.
    12. Yue Chen & Xiaojian Niu & Yan Zhang, 2019. "Exploring Contrarian Degree in the Trading Behavior of China's Stock Market," Complexity, Hindawi, vol. 2019, pages 1-12, April.
    13. Umut Ugurlu & Ilkay Oksuz & Oktay Tas, 2018. "Electricity Price Forecasting Using Recurrent Neural Networks," Energies, MDPI, vol. 11(5), pages 1-23, May.
    14. Jeonggyu Huh, 2018. "Measuring Systematic Risk with Neural Network Factor Model," Papers 1809.04925, arXiv.org.
    15. Amir Mosavi & Pedram Ghamisi & Yaser Faghan & Puhong Duan, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Papers 2004.01509, arXiv.org.
    16. Ponta, Linda & Puliga, Gloria & Lazzarotti, Valentina & Manzini, Raffaella & Cincotti, Silvano, 2023. "To copatent or not to copatent: An agent-based model for firms facing this dilemma," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1349-1363.
    17. Adebayo Oshingbesan & Eniola Ajiboye & Peruth Kamashazi & Timothy Mbaka, 2022. "Model-Free Reinforcement Learning for Asset Allocation," Papers 2209.10458, arXiv.org.
    18. Rian Dolphin & Barry Smyth & Ruihai Dong, 2024. "Contrastive Learning of Asset Embeddings from Financial Time Series," Papers 2407.18645, arXiv.org.
    19. Tomoshiro Ochiai & Jose C. Nacher, 2020. "Unveiling the directional network behind the financial statements data using volatility constraint correlation," Papers 2008.07836, arXiv.org, revised Jun 2023.
    20. Junyi Li & Xitong Wang & Yaoyang Lin & Arunesh Sinha & Micheal P. Wellman, 2020. "Generating Realistic Stock Market Order Streams," Papers 2006.04212, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:2:y:2019:i:1:d:10.1007_s42001-019-00035-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.