IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v64y2024i1d10.1007_s10614-023-10443-x.html
   My bibliography  Save this article

Predicting Firm Financial Performance from SEC Filing Changes Using Automatically Generated Dictionary

Author

Listed:
  • Aparna Gupta

    (Rensselaer Polytechnic Institute)

  • Vipula Rawte

    (Rensselaer Polytechnic Institute)

  • Mohammed J. Zaki

    (Rensselaer Polytechnic Institute)

Abstract

Textual data are increasingly used to predict firm performance, however extracting useful signals towards serving this goal with a continuously growing repository of financial reports and documents is challenging, even by the state-of-the-art machine learning and natural language processing (NLP) techniques. We propose a novel approach to automatically create a word list from SEC filings (10-K and 8-K reports) using advanced deep learning and NLP techniques and compare their performance against the widely used Loughran–McDonald sentiment dictionaries. We additionally analyze a corpus of 8-K and 10-K documents to evaluate their relative informativeness for firm performance prediction. Since 8-K filings provide corporate updates along a fiscal year, we compare their content against changes in 10-Ks between consecutive years to assess the incremental value of information provided in these regulatory filings. Information effectiveness is examined by predicting six key financial indicators for a set of US banks using ridge regression. Our results positively support sentiment dictionaries expansion by automatically extracting meaning from text and highlight the benefits obtainable from utilizing update filings.

Suggested Citation

  • Aparna Gupta & Vipula Rawte & Mohammed J. Zaki, 2024. "Predicting Firm Financial Performance from SEC Filing Changes Using Automatically Generated Dictionary," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 307-334, July.
  • Handle: RePEc:kap:compec:v:64:y:2024:i:1:d:10.1007_s10614-023-10443-x
    DOI: 10.1007/s10614-023-10443-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-023-10443-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-023-10443-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    2. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    3. Faisal Khalil & Gordon Pipa, 2022. "Is Deep-Learning and Natural Language Processing Transcending the Financial Forecasting? Investigation Through Lens of News Analytic Process," Computational Economics, Springer;Society for Computational Economics, vol. 60(1), pages 147-171, June.
    4. Juvenal José Duarte & Sahudy Montenegro González & José César Cruz, 2021. "Predicting Stock Price Falls Using News Data: Evidence from the Brazilian Market," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 311-340, January.
    5. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    6. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    7. Tsai, Ming-Feng & Wang, Chuan-Ju, 2017. "On the risk prediction and analysis of soft information in finance reports," European Journal of Operational Research, Elsevier, vol. 257(1), pages 243-250.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kumar, Rahul & Deb, Soumya Guha & Mukherjee, Shubhadeep, 2020. "Do words reveal the latent truth? Identifying communication patterns of corporate losers," Journal of Behavioral and Experimental Finance, Elsevier, vol. 26(C).
    2. Leilane de Freitas Rocha Cambara & Roberto Meurer, 2023. "News sentiment and foreign portfolio investment in Brazil," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(3), pages 3332-3348, July.
    3. Jana P. Fidrmuc & Tereza Tykvova, 2023. "Are Acquirer Shareholders Happier when Their Industries Are Unhappy?," Swiss Finance Institute Research Paper Series 23-52, Swiss Finance Institute.
    4. Bannier, Christina E. & Pauls, Thomas & Walter, Andreas, 2017. "CEO-speeches and stock returns," VfS Annual Conference 2017 (Vienna): Alternative Structures for Money and Banking 168192, Verein für Socialpolitik / German Economic Association.
    5. Doshi, Hitesh & Patel, Saurin & Ramani, Srikanth & Sooy, Matthew, 2023. "Uncertain tone, asset volatility and credit default swap spreads," Journal of Contemporary Accounting and Economics, Elsevier, vol. 19(3).
    6. Jie Hao & Viet T. Pham, 2022. "COVID‐19 Disclosures and Market Uncertainty: Evidence from 10‐Q Filings," Australian Accounting Review, CPA Australia, vol. 32(2), pages 238-266, June.
    7. An, Suwei, 2023. "Essays on incentive contracts, M&As, and firm risk," Other publications TiSEM dd97d2f5-1c9d-47c5-ba62-f, Tilburg University, School of Economics and Management.
    8. D. G. DeBoskey & Yan Luo & Linying Zhou, 2019. "CEO power, board oversight, and earnings announcement tone," Review of Quantitative Finance and Accounting, Springer, vol. 52(2), pages 657-680, February.
    9. Renato Camodeca & Alex Almici & Umberto Sagliaschi, 2018. "Sustainability Disclosure in Integrated Reporting: Does It Matter to Investors? A Cheap Talk Approach," Sustainability, MDPI, vol. 10(12), pages 1-34, November.
    10. Vegard Høghaug Larsen & Leif Anders Thorsrud, 2022. "Asset returns, news topics, and media effects," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 838-868, July.
    11. Koulikidou, Kleopatra & Chantziaras, Antonios & Dedoulis, Emmanouil & Leventis, Stergios, 2023. "Regulatory enforcement, foreignness, and language negativity: Evidence from SEC comment letters," Journal of International Accounting, Auditing and Taxation, Elsevier, vol. 51(C).
    12. Jiang, Cuiqing & Lyu, Ximei & Yuan, Yufei & Wang, Zhao & Ding, Yong, 2022. "Mining semantic features in current reports for financial distress prediction: Empirical evidence from unlisted public firms in China," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1086-1099.
    13. Jun Qi & Qinwei Chi & Ni Yang & Junyan Ouyang, 2023. "The Impact of the Tone of a Prospectus on IPO Underpricing: Evidence from China," Australian Accounting Review, CPA Australia, vol. 33(4), pages 375-390, December.
    14. Goodell, John W. & Kumar, Satish & Lim, Weng Marc & Pattnaik, Debidutta, 2021. "Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    15. Durnev, Art & Mangen, Claudine, 2020. "The spillover effects of MD&A disclosures for real investment: The role of industry competition," Journal of Accounting and Economics, Elsevier, vol. 70(1).
    16. Fang, Yiwei & Fiordelisi, Franco & Hasan, Iftekhar & Leung, Woon Sau & Wong, Gabriel, 2023. "Corporate culture and firm value: Evidence from crisis," Journal of Banking & Finance, Elsevier, vol. 146(C).
    17. Loughran, Tim & McDonald, Bill & Pragidis, Ioannis, 2019. "Assimilation of oil news into prices," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 105-118.
    18. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    19. Anand, Abhinav & Basu, Sankarshan & Pathak, Jalaj & Thampy, Ashok, 2021. "The impact of sentiment on emerging stock markets," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 161-177.
    20. Matteo Cinelli & Valerio Ficcadenti & Jessica Riccioni, 2020. "The interconnectedness of the economic content in the speeches of the US Presidents," Papers 2002.07880, arXiv.org.

    More about this item

    Keywords

    Regulatory filings; Text analytics; Bank risk; Performance; Prediction; Attention score;
    All these keywords.

    JEL classification:

    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:64:y:2024:i:1:d:10.1007_s10614-023-10443-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.