IDEAS home Printed from https://ideas.repec.org/p/iim/iimawp/14607.html
   My bibliography  Save this paper

Buy, Sell or Hold: Entity-Aware Classification of Business News

Author

Listed:
  • Sinha, Ankur
  • Kedas, Satishwar
  • Kumar, Rishu
  • Malo, Pekka

Abstract

Financial sector is expected to be at the forefront of the adoption of machine learning methods, driven by the superior performance of the data-driven approaches over traditional modelling approaches. There has been a widespread interest in automatically extracting information from financial news flow as the signals might be useful for investment decisions. While quantitative finance focuses on analysis of structured financial data for investment decisions, the potential of utilizing unstructured news flow in decision making is not fully tapped. Research in financial news analytics tries to address this gap by detecting events and aspects that provide buy, sell or hold information in news, commonly interpreted as financial sentiments. In this paper, we develop a framework utilizing information theoretic concepts and machine learning methods that understands the context and is capable of extracting buy, sell or hold information contained within news headlines. The proposed framework is also capable of detecting conflicting sentiments on multiple companies within the same news headline, which to our best knowledge has not been studied earlier. Further, we develop an information system which analyzes the news flow in real-time, allowing users to track financial sentiments by company, sector and index via a dashboard. Through this study we make three dataset related contributions - firstly, a training dataset consisting of more than 12,000 news headlines annotated for entities and their relevant financial sentiments by multiple annotators, secondly, an entity database of over 1,000 financial and economic entities relevant to Indian economy and their forms of appearance in news media amounting to over 5,000 phrases and thirdly, make improvements in existing financial dictionaries. Using the proposed system, we study the effect of the information derived from daily news flow in the years 2012 to 2017, over the Indian broad market equity index NSE 500, and show that the information has predictive value.

Suggested Citation

  • Sinha, Ankur & Kedas, Satishwar & Kumar, Rishu & Malo, Pekka, 2019. "Buy, Sell or Hold: Entity-Aware Classification of Business News," IIMA Working Papers WP 2019-04-02, Indian Institute of Management Ahmedabad, Research and Publication Department.
  • Handle: RePEc:iim:iimawp:14607
    as

    Download full text from publisher

    File URL: https://www.iima.ac.in/sites/default/files/rnpfiles/18844866432019-04-02.pdf
    File Function: English Version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Warner, Jerold B. & Watts, Ross L. & Wruck, Karen H., 1988. "Stock prices and top management changes," Journal of Financial Economics, Elsevier, vol. 20(1-2), pages 461-492, January.
    2. Plakandaras, Vasilios & Gupta, Rangan & Gogas, Periklis & Papadimitriou, Theophilos, 2015. "Forecasting the U.S. real house price index," Economic Modelling, Elsevier, vol. 45(C), pages 259-267.
    3. Chambers, Ae & Penman, Sh, 1984. "Timeliness Of Reporting And The Stock-Price Reaction To Earnings Announcements," Journal of Accounting Research, Wiley Blackwell, vol. 22(1), pages 21-47.
    4. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    5. Rangan Gupta & Anandamayee Majumdar, 2015. "Forecasting US real house price returns over 1831-2013: evidence from copula models," Applied Economics, Taylor & Francis Journals, vol. 47(48), pages 5204-5213, October.
    6. Gabriele Ranco & Darko Aleksovski & Guido Caldarelli & Miha Grčar & Igor Mozetič, 2015. "The Effects of Twitter Sentiment on Stock Price Returns," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-21, September.
    7. Pekka Malo & Ankur Sinha & Pekka Korhonen & Jyrki Wallenius & Pyry Takala, 2014. "Good debt or bad debt: Detecting semantic orientations in economic texts," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 782-796, April.
    8. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    9. repec:bla:jfinan:v:59:y:2004:i:3:p:1259-1294 is not listed on IDEAS
    10. JaeHong Park & Prabhudev Konana & Bin Gu & Alok Kumar & Rajagopal Raghunathan, 2013. "Information Valuation and Confirmation Bias in Virtual Communities: Evidence from Stock Message Boards," Information Systems Research, INFORMS, vol. 24(4), pages 1050-1067, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ankur Sinha & Satishwar Kedas & Rishu Kumar & Pekka Malo, 2022. "SEntFiN 1.0: Entity‐aware sentiment analysis for financial news," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(9), pages 1314-1335, September.
    2. Femg, Xunan & Johansson, Anders C., 2019. "News or Noise? The Information Content of Social Media in China," Stockholm School of Economics Asia Working Paper Series 2019-52, Stockholm School of Economics, Stockholm China Economic Research Institute.
    3. Kirtac, Kemal & Germano, Guido, 2024. "Sentiment trading with large language models," Finance Research Letters, Elsevier, vol. 62(PB).
    4. Chen, Cathy Yi-Hsuan & Fengler, Matthias R. & Härdle, Wolfgang Karl & Liu, Yanchu, 2022. "Media-expressed tone, option characteristics, and stock return predictability," Journal of Economic Dynamics and Control, Elsevier, vol. 134(C).
    5. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    6. Marlene Amstad & Leonardo Gambacorta & Chao He & Dora Xia, 2021. "Trade sentiment and the stock market: new evidence based on big data textual analysis of Chinese media," BIS Working Papers 917, Bank for International Settlements.
    7. Plakandaras, Vasilios & Papadimitriou, Theophilos & Gogas, Periklis, 2019. "Forecasting transportation demand for the U.S. market," Transportation Research Part A: Policy and Practice, Elsevier, vol. 126(C), pages 195-214.
    8. Paola Cerchiello & Giancarlo Nicola, 2018. "Assessing News Contagion in Finance," Econometrics, MDPI, vol. 6(1), pages 1-19, February.
    9. Vasilios Plakandaras & Elie Bouri & Rangan Gupta, 2019. "Forecasting Bitcoin Returns: Is there a Role for the U.S. – China Trade War?," Working Papers 201980, University of Pretoria, Department of Economics.
    10. Travis Adams & Andrea Ajello & Diego Silva & Francisco Vazquez-Grande, 2023. "More than Words: Twitter Chatter and Financial Market Sentiment," Papers 2305.16164, arXiv.org.
    11. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    12. Ahelegbey, Daniel Felix & Cerchiello, Paola & Scaramozzino, Roberta, 2022. "Network based evidence of the financial impact of Covid-19 pandemic," International Review of Financial Analysis, Elsevier, vol. 81(C).
    13. Arcuri, Maria Cristina & Gandolfi, Gino & Russo, Ivan, 2023. "Does fake news impact stock returns? Evidence from US and EU stock markets," Journal of Economics and Business, Elsevier, vol. 125.
    14. David M. Goldberg & Nohel Zaman & Arin Brahma & Mariano Aloiso, 2022. "Are mortgage loan closing delay risks predictable? A predictive analysis using text mining on discussion threads," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(3), pages 419-437, March.
    15. Ti-Ching Peng, 2021. "The effect of hazard shock and disclosure information on property and land prices: a machine-learning assessment in the case of Japan," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 41(1), pages 1-32, February.
    16. Asier Guti'errez-Fandi~no & Miquel Noguer i Alonso & Petter Kolm & Jordi Armengol-Estap'e, 2021. "FinEAS: Financial Embedding Analysis of Sentiment," Papers 2111.00526, arXiv.org, revised Nov 2021.
    17. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    18. Kirtac, Kemal & Germano, Guido, 2024. "Sentiment trading with large language models," LSE Research Online Documents on Economics 122592, London School of Economics and Political Science, LSE Library.
    19. Xu Gong & Keqin Guan & Qiyang Chen, 2022. "The role of textual analysis in oil futures price forecasting based on machine learning approach," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 42(10), pages 1987-2017, October.
    20. Costola, Michele & Hinz, Oliver & Nofer, Michael & Pelizzon, Loriana, 2023. "Machine learning sentiment analysis, COVID-19 news and stock market reactions," Research in International Business and Finance, Elsevier, vol. 64(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iim:iimawp:14607. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/eciimin.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.