IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2404.01337.html
   My bibliography  Save this paper

Detection of Temporality at Discourse Level on Financial News by Combining Natural Language Processing and Machine Learning

Author

Listed:
  • Silvia Garc'ia-M'endez
  • Francisco de Arriba-P'erez
  • Ana Barros-Vila
  • Francisco J. Gonz'alez-Casta~no

Abstract

Finance-related news such as Bloomberg News, CNN Business and Forbes are valuable sources of real data for market screening systems. In news, an expert shares opinions beyond plain technical analyses that include context such as political, sociological and cultural factors. In the same text, the expert often discusses the performance of different assets. Some key statements are mere descriptions of past events while others are predictions. Therefore, understanding the temporality of the key statements in a text is essential to separate context information from valuable predictions. We propose a novel system to detect the temporality of finance-related news at discourse level that combines Natural Language Processing and Machine Learning techniques, and exploits sophisticated features such as syntactic and semantic dependencies. More specifically, we seek to extract the dominant tenses of the main statements, which may be either explicit or implicit. We have tested our system on a labelled dataset of finance-related news annotated by researchers with knowledge in the field. Experimental results reveal a high detection precision compared to an alternative rule-based baseline approach. Ultimately, this research contributes to the state-of-the-art of market screening by identifying predictive knowledge for financial decision making.

Suggested Citation

  • Silvia Garc'ia-M'endez & Francisco de Arriba-P'erez & Ana Barros-Vila & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of Temporality at Discourse Level on Financial News by Combining Natural Language Processing and Machine Learning," Papers 2404.01337, arXiv.org.
  • Handle: RePEc:arx:papers:2404.01337
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2404.01337
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dimpfl, Thomas & Jank, Stephan, 2011. "Can internet search queries help to predict stock market volatility?," CFR Working Papers 11-15, University of Cologne, Centre for Financial Research (CFR).
    2. Karapandza, Rasa, 2016. "Stock returns and future tense language in 10-K reports," Journal of Banking & Finance, Elsevier, vol. 71(C), pages 50-61.
    3. Kim, A. & Yang, Y. & Lessmann, S. & Ma, T. & Sung, M.-C. & Johnson, J.E.V., 2020. "Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 217-234.
    4. Thomas Dimpfl & Stephan Jank, 2016. "Can Internet Search Queries Help to Predict Stock Market Volatility?," European Financial Management, European Financial Management Association, vol. 22(2), pages 171-192, March.
    5. Nofer, Michael & Hinz, Oliver, 2015. "Using Twitter to Predict the Stock Market: Where is the Mood Effect?," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 77140, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    6. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    7. Michael Nofer & Oliver Hinz, 2015. "Using Twitter to Predict the Stock Market," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 57(4), pages 229-242, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francisco de Arriba-P'erez & Silvia Garc'ia-M'endez & Jos'e A. Regueiro-Janeiro & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of financial opportunities in micro-blogging data with a stacked classification system," Papers 2404.07224, arXiv.org.
    2. Jung, Sang Hoon & Jeong, Yong Jin, 2021. "Examining stock markets and societal mood using Internet memes," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    3. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    4. Audrino, Francesco & Sigrist, Fabio & Ballinari, Daniele, 2020. "The impact of sentiment and attention measures on stock market volatility," International Journal of Forecasting, Elsevier, vol. 36(2), pages 334-357.
    5. Zhen-Hua Yang & Jian-Guo Liu & Chang-Rui Yu & Jing-Ti Han, 2017. "Quantifying the effect of investors’ attention on stock market," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-16, May.
    6. Tihana Škrinjarić, 2019. "Time Varying Spillovers between the Online Search Volume and Stock Returns: Case of CESEE Markets," IJFS, MDPI, vol. 7(4), pages 1-30, October.
    7. Yongqiang Meng & Dehua Shen & Xiong Xiong & Jorgen Vitting Andersen, 2020. "A Socio-Finance Model: The Case of Bitcoin," Documents de travail du Centre d'Economie de la Sorbonne 20031, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    8. Bianconi, Marcelo & Hua, Xiaxin & Tan, Chih Ming, 2015. "Determinants of systemic risk and information dissemination," International Review of Economics & Finance, Elsevier, vol. 38(C), pages 352-368.
    9. Geng, Yuedan & Ye, Qiang & Jin, Yu & Shi, Wen, 2022. "Crowd wisdom and internet searches: What happens when investors search for stocks?," International Review of Financial Analysis, Elsevier, vol. 82(C).
    10. Chen, Hongtao & Fang, Xiumei & Xiang, Erwei & Ji, Xiaojia & An, Maolin, 2023. "Do online media and investor attention affect corporate environmental information disclosure?Evidence from Chinese listed companies," International Review of Economics & Finance, Elsevier, vol. 86(C), pages 1022-1040.
    11. Papadamou, Stephanos & Fassas, Athanasios P. & Kenourgios, Dimitris & Dimitriou, Dimitrios, 2023. "Effects of the first wave of COVID-19 pandemic on implied stock market volatility: International evidence using a google trend measure," The Journal of Economic Asymmetries, Elsevier, vol. 28(C).
    12. Liu, Yuanyuan & Niu, Zibo & Suleman, Muhammad Tahir & Yin, Libo & Zhang, Hongwei, 2022. "Forecasting the volatility of crude oil futures: The role of oil investor attention and its regime switching characteristics under a high-frequency framework," Energy, Elsevier, vol. 238(PA).
    13. Gao, Yang & Wang, Yaojun & Wang, Chao & Liu, Chao, 2018. "Internet attention and information asymmetry: Evidence from Qihoo 360 search data on the Chinese stock market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 510(C), pages 802-811.
    14. Hailiang Huang & Yanhong Li & Yingying Zhang, 2018. "Investors’ attention and overpricing of IPO: an empirical study on China’s growth enterprise market," Information Systems and e-Business Management, Springer, vol. 16(4), pages 761-774, November.
    15. Matija Piv{s}korec & Nino Antulov-Fantulin & Petra Kralj Novak & Igor Mozetiv{c} & Miha Grv{c}ar & Irena Vodenska & Tomislav v{S}muc, 2014. "News Cohesiveness: an Indicator of Systemic Risk in Financial Markets," Papers 1402.3483, arXiv.org.
    16. Cai, Wenwu & Lu, Jing, 2019. "Investors’ financial attention frequency and trading activity," Pacific-Basin Finance Journal, Elsevier, vol. 58(C).
    17. repec:ipg:wpaper:2014-405 is not listed on IDEAS
    18. Sophie Cockcroft & Mark Russell, 2018. "Big Data Opportunities for Accounting and Finance Practice and Research," Australian Accounting Review, CPA Australia, vol. 28(3), pages 323-333, September.
    19. Aharon, David Y. & Qadan, Mahmoud, 2020. "When do retail investors pay attention to their trading platforms?," The North American Journal of Economics and Finance, Elsevier, vol. 53(C).
    20. Jain, Anshul & Biswal, Pratap Chandra, 2019. "Does internet search interest for gold move the gold spot, stock and exchange rate markets? A study from India," Resources Policy, Elsevier, vol. 61(C), pages 501-507.
    21. Xiao, Jihong & Wang, Yudong, 2021. "Investor attention and oil market volatility: Does economic policy uncertainty matter?," Energy Economics, Elsevier, vol. 97(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2404.01337. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.