IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2404.07224.html
   My bibliography  Save this paper

Detection of financial opportunities in micro-blogging data with a stacked classification system

Author

Listed:
  • Francisco de Arriba-P'erez
  • Silvia Garc'ia-M'endez
  • Jos'e A. Regueiro-Janeiro
  • Francisco J. Gonz'alez-Casta~no

Abstract

Micro-blogging sources such as the Twitter social network provide valuable real-time data for market prediction models. Investors' opinions in this network follow the fluctuations of the stock markets and often include educated speculations on market opportunities that may have impact on the actions of other investors. In view of this, we propose a novel system to detect positive predictions in tweets, a type of financial emotions which we term "opportunities" that are akin to "anticipation" in Plutchik's theory. Specifically, we seek a high detection precision to present a financial operator a substantial amount of such tweets while differentiating them from the rest of financial emotions in our system. We achieve it with a three-layer stacked Machine Learning classification system with sophisticated features that result from applying Natural Language Processing techniques to extract valuable linguistic information. Experimental results on a dataset that has been manually annotated with financial emotion and ticker occurrence tags demonstrate that our system yields satisfactory and competitive performance in financial opportunity detection, with precision values up to 83%. This promising outcome endorses the usability of our system to support investors' decision making.

Suggested Citation

  • Francisco de Arriba-P'erez & Silvia Garc'ia-M'endez & Jos'e A. Regueiro-Janeiro & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of financial opportunities in micro-blogging data with a stacked classification system," Papers 2404.07224, arXiv.org.
  • Handle: RePEc:arx:papers:2404.07224
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2404.07224
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Thomas Dimpfl & Stephan Jank, 2016. "Can Internet Search Queries Help to Predict Stock Market Volatility?," European Financial Management, European Financial Management Association, vol. 22(2), pages 171-192, March.
    2. Jitendra Kumar Rout & Kim-Kwang Raymond Choo & Amiya Kumar Dash & Sambit Bakshi & Sanjay Kumar Jena & Karen L. Williams, 2018. "A model for sentiment and emotion analysis of unstructured social media text," Electronic Commerce Research, Springer, vol. 18(1), pages 181-199, March.
    3. Sun, Andrew & Lachanski, Michael & Fabozzi, Frank J., 2016. "Trade the tweet: Social media text mining and sparse matrix factorization for stock market prediction," International Review of Financial Analysis, Elsevier, vol. 48(C), pages 272-281.
    4. Nofer, Michael & Hinz, Oliver, 2015. "Using Twitter to Predict the Stock Market: Where is the Mood Effect?," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 77140, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    5. Pengnate, Supavich (Fone) & Riggins, Frederick J., 2020. "The role of emotion in P2P microfinance funding: A sentiment analysis approach," International Journal of Information Management, Elsevier, vol. 54(C).
    6. Laura K. Rickett, 2016. "Do financial blogs serve an infomediary role in capital markets?," American Journal of Business, Emerald Group Publishing Limited, vol. 31(1), pages 17-40, April.
    7. Xi Zhang & Yunjia Zhang & Senzhang Wang & Yuntao Yao & Binxing Fang & Philip S. Yu, 2018. "Improving Stock Market Prediction via Heterogeneous Information Fusion," Papers 1801.00588, arXiv.org.
    8. Yufeng Wang & Shuangrong Liu & Songqian Li & Jidong Duan & Zhihao Hou & Jia Yu & Kun Ma, 2019. "Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection," Future Internet, MDPI, vol. 11(7), pages 1-12, July.
    9. Darren Duxbury & Tommy Gärling & Amelie Gamble & Vian Klass, 2020. "How emotions influence behavior in financial markets: a conceptual analysis and emotion-based account of buy-sell preferences," The European Journal of Finance, Taylor & Francis Journals, vol. 26(14), pages 1417-1438, September.
    10. Hui Yuan & Wei Xu & Qian Li & Raymond Lau, 2018. "Topic sentiment mining for sales performance prediction in e-commerce," Annals of Operations Research, Springer, vol. 270(1), pages 553-576, November.
    11. Michael Nofer & Oliver Hinz, 2015. "Using Twitter to Predict the Stock Market," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 57(4), pages 229-242, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Silvia Garc'ia-M'endez & Francisco de Arriba-P'erez & Ana Barros-Vila & Francisco J. Gonz'alez-Casta~no, 2024. "Targeted aspect-based emotion analysis to detect opportunities and precaution in financial Twitter messages," Papers 2404.08665, arXiv.org.
    2. Silvia Garc'ia-M'endez & Francisco de Arriba-P'erez & Ana Barros-Vila & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of Temporality at Discourse Level on Financial News by Combining Natural Language Processing and Machine Learning," Papers 2404.01337, arXiv.org.
    3. Liang, Chao & Wang, Lu & Duong, Duy, 2024. "More attention and better volatility forecast accuracy: How does war attention affect stock volatility predictability?," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 1-19.
    4. Heba Ali, 2018. "Twitter, Investor Sentiment and Capital Markets: What Do We Know?," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 10(8), pages 158-158, August.
    5. Jung, Sang Hoon & Jeong, Yong Jin, 2021. "Examining stock markets and societal mood using Internet memes," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    6. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    7. Teti, Emanuele & Dallocchio, Maurizio & Aniasi, Alberto, 2019. "The relationship between twitter and stock prices. Evidence from the US technology industry," Technological Forecasting and Social Change, Elsevier, vol. 149(C).
    8. Audrino, Francesco & Sigrist, Fabio & Ballinari, Daniele, 2020. "The impact of sentiment and attention measures on stock market volatility," International Journal of Forecasting, Elsevier, vol. 36(2), pages 334-357.
    9. Bouteska, Ahmed & Ha, Le Thanh & Bhuiyan, Faruk & Sharif, Taimur & Abedin, Mohammad Zoynul, 2024. "Contagion between investor sentiment and green bonds in China during the global uncertainties," International Review of Economics & Finance, Elsevier, vol. 93(PA), pages 469-484.
    10. María José Ayala & Nicolás Gonzálvez-Gallego & Rocío Arteaga-Sánchez, 2024. "Google search volume index and investor attention in stock market: a systematic review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-29, December.
    11. Sophie Cockcroft & Mark Russell, 2018. "Big Data Opportunities for Accounting and Finance Practice and Research," Australian Accounting Review, CPA Australia, vol. 28(3), pages 323-333, September.
    12. Qadan, Mahmoud & Aharon, David Y. & Cohen, Gil, 2020. "Everybody likes shopping, including the US capital market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 551(C).
    13. Lehrer, Steven & Xie, Tian & Zhang, Xinyu, 2021. "Social media sentiment, model uncertainty, and volatility forecasting," Economic Modelling, Elsevier, vol. 102(C).
    14. Stefan Stieglitz & Christian Meske & Björn Ross & Milad Mirbabaie, 2020. "Going Back in Time to Predict the Future - The Complex Role of the Data Collection Period in Social Media Analytics," Information Systems Frontiers, Springer, vol. 22(2), pages 395-409, April.
    15. Frank Z. Xing & Erik Cambria & Lorenzo Malandri & Carlo Vercellis, 2018. "Discovering Bayesian Market Views for Intelligent Asset Allocation," Papers 1802.09911, arXiv.org, revised Jun 2018.
    16. Costola, Michele & Hinz, Oliver & Nofer, Michael & Pelizzon, Loriana, 2023. "Machine learning sentiment analysis, COVID-19 news and stock market reactions," Research in International Business and Finance, Elsevier, vol. 64(C).
    17. Fang Wang & Marko Gacesa, 2024. "Semi-strong Efficient Market of Bitcoin and Twitter: an Analysis of Semantic Vector Spaces of Extracted Keywords and Light Gradient Boosting Machine Models," Papers 2409.15988, arXiv.org.
    18. Zhen-Hua Yang & Jian-Guo Liu & Chang-Rui Yu & Jing-Ti Han, 2017. "Quantifying the effect of investors’ attention on stock market," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-16, May.
    19. Youzhu Li & Xianghui Gao & Mingying Du & Rui He & Shanshan Yang & Jason Xiong, 2020. "What Causes Different Sentiment Classification on Social Network Services? Evidence from Weibo with Genetically Modified Food in China," Sustainability, MDPI, vol. 12(4), pages 1-15, February.
    20. K Shiljas & Dilip Kumar & Hajam Abid Bashir, 2023. "Nexus between Twitter-based sentiment and tourism sector performance amid COVID-19 pandemic," Tourism Economics, , vol. 29(8), pages 2200-2205, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2404.07224. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.