IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2407.17866.html
   My bibliography  Save this paper

Financial Statement Analysis with Large Language Models

Author

Listed:
  • Alex Kim
  • Maximilian Muhn
  • Valeri Nikolaev

Abstract

We investigate whether large language models (LLMs) can successfully perform financial statement analysis in a way similar to a professional human analyst. We provide standardized and anonymous financial statements to GPT4 and instruct the model to analyze them to determine the direction of firms' future earnings. Even without narrative or industry-specific information, the LLM outperforms financial analysts in its ability to predict earnings changes directionally. The LLM exhibits a relative advantage over human analysts in situations when the analysts tend to struggle. Furthermore, we find that the prediction accuracy of the LLM is on par with a narrowly trained state-of-the-art ML model. LLM prediction does not stem from its training memory. Instead, we find that the LLM generates useful narrative insights about a company's future performance. Lastly, our trading strategies based on GPT's predictions yield a higher Sharpe ratio and alphas than strategies based on other models. Our results suggest that LLMs may take a central role in analysis and decision-making.

Suggested Citation

  • Alex Kim & Maximilian Muhn & Valeri Nikolaev, 2024. "Financial Statement Analysis with Large Language Models," Papers 2407.17866, arXiv.org, revised Nov 2024.
  • Handle: RePEc:arx:papers:2407.17866
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2407.17866
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stephen H. Penman, 2001. "On Comparing Cash Flow and Accrual Accounting Models for Use in Equity Valuation: A Response to Lundholm and O'Keefe (CAR, Summer 2001)," Contemporary Accounting Research, John Wiley & Sons, vol. 18(4), pages 681-692, December.
    2. Ray Ball & Gil Sadka & Ayung Tseng, 2022. "Using accounting earnings and aggregate economic indicators to estimate firm-level systematic risk," Review of Accounting Studies, Springer, vol. 27(2), pages 607-646, June.
    3. Costello, Anna M. & Down, Andrea K. & Mehta, Mihir N., 2020. "Machine + man: A field experiment on the role of discretion in augmenting AI-based lending models," Journal of Accounting and Economics, Elsevier, vol. 70(2).
    4. Piotroski, JD, 2000. "Value investing: The use of historical financial statement information to separate winners from losers," Journal of Accounting Research, Wiley Blackwell, vol. 38, pages 1-41.
    5. Novy-Marx, Robert, 2013. "The other side of value: The gross profitability premium," Journal of Financial Economics, Elsevier, vol. 108(1), pages 1-28.
    6. S.P. Kothari & Eric So & Rodrigo Verdi, 2016. "Analysts’ Forecasts and Asset Pricing: A Survey," Annual Review of Financial Economics, Annual Reviews, vol. 8(1), pages 197-219, October.
    7. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    8. Alejandro Lopez-Lira & Yuehua Tang, 2023. "Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models," Papers 2304.07619, arXiv.org, revised Sep 2024.
    9. Ou, Jane A. & Penman, Stephen H., 1989. "Financial statement analysis and the prediction of stock returns," Journal of Accounting and Economics, Elsevier, vol. 11(4), pages 295-329, November.
    10. Elizabeth Blankespoor & Ed Dehaan & John Wertz & Christina Zhu, 2019. "Why Do Individual Investors Disregard Accounting Information? The Roles of Information Awareness and Acquisition Costs," Journal of Accounting Research, Wiley Blackwell, vol. 57(1), pages 53-84, March.
    11. Ray Ball & Gil Sadka & Ayung Tseng, 2022. "Correction to: using accounting earnings and aggregate economic indicators to estimate firm-level systematic risk," Review of Accounting Studies, Springer, vol. 27(2), pages 647-648, June.
    12. Ball, R & Foster, G, 1982. "Corporate Financial-Reporting - A Methodological Review Of Empirical-Research - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 20, pages 245-248.
    13. Ball, Ray & Kothari, S. P. & Robin, Ashok, 2000. "The effect of international institutional factors on properties of accounting earnings," Journal of Accounting and Economics, Elsevier, vol. 29(1), pages 1-51, February.
    14. Bouwman, Marinus J. & Frishkoff, Patricia A. & Frishkoff, Paul, 1987. "How do financial analysts make decisions? A process model of the investment screening decision," Accounting, Organizations and Society, Elsevier, vol. 12(1), pages 1-29, January.
    15. Basu, Sudipta & Markov, Stanimir, 2004. "Loss function assumptions in rational expectations tests on financial analysts' earnings forecasts," Journal of Accounting and Economics, Elsevier, vol. 38(1), pages 171-203, December.
    16. Xi Chen & Yang Ha (Tony) Cho & Yiwei Dou & Baruch Lev, 2022. "Predicting Future Earnings Changes Using Machine Learning and Detailed Financial Data," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 467-515, May.
    17. Paul Glasserman & Caden Lin, 2023. "Assessing Look-Ahead Bias in Stock Return Predictions Generated By GPT Sentiment Analysis," Papers 2309.17322, arXiv.org.
    18. Ball, R & Foster, G, 1982. "Corporate Financial-Reporting - A Methodological Review Of Empirical-Research," Journal of Accounting Research, Wiley Blackwell, vol. 20, pages 161-234.
    19. De Bondt, Werner F M & Thaler, Richard H, 1990. "Do Security Analysts Overreact?," American Economic Review, American Economic Association, vol. 80(2), pages 52-57, May.
    20. Pedro Bordalo & Nicola Gennaioli & Rafael La Porta & Andrei Shleifer, 2019. "Diagnostic Expectations and Stock Returns," Journal of Finance, American Finance Association, vol. 74(6), pages 2839-2874, December.
    21. Miao Liu, 2022. "Assessing Human Information Processing in Lending Decisions: A Machine Learning Approach," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 607-651, May.
    22. Cao, Sean & Jiang, Wei & Wang, Junbo & Yang, Baozhong, 2024. "From Man vs. Machine to Man + Machine: The art and AI of stock analyses," Journal of Financial Economics, Elsevier, vol. 160(C).
    23. Stephen H. Penman & Theodore Sougiannis, 1998. "A Comparison of Dividend, Cash Flow, and Earnings Approaches to Equity Valuation," Contemporary Accounting Research, John Wiley & Sons, vol. 15(3), pages 343-383, September.
    24. Amy P. Hutton & Lian Fen Lee & Susan Z. Shu, 2012. "Do Managers Always Know Better? The Relative Accuracy of Management and Analyst Forecasts," Journal of Accounting Research, Wiley Blackwell, vol. 50(5), pages 1217-1244, December.
    25. Abarbanell, Jeffrey S & Bernard, Victor L, 1992. "Tests of Analysts' Overreaction/Underreaction to Earnings Information as an Explanation for Anomalous Stock Price Behavior," Journal of Finance, American Finance Association, vol. 47(3), pages 1181-1207, July.
    26. Jeremiah Green & John R. M. Hand & X. Frank Zhang, 2016. "Errors and questionable judgments in analysts’ DCF models," Review of Accounting Studies, Springer, vol. 21(2), pages 596-632, June.
    27. Ashbaugh-Skaife, Hollis & Collins, Daniel W. & Kinney Jr., William R., 2007. "The discovery and reporting of internal control deficiencies prior to SOX-mandated audits," Journal of Accounting and Economics, Elsevier, vol. 44(1-2), pages 166-192, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Caterina Giannetti & Maria Saveria Mavillonio, 2024. "Crowdfunding Success: Human Insights vs Algorithmic Textual Extraction," Discussion Papers 2024/315, Dipartimento di Economia e Management (DEM), University of Pisa, Pisa, Italy.
    2. Yuzhe Yang & Yifei Zhang & Yan Hu & Yilin Guo & Ruoli Gan & Yueru He & Mingcong Lei & Xiao Zhang & Haining Wang & Qianqian Xie & Jimin Huang & Honghai Yu & Benyou Wang, 2024. "UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models," Papers 2410.14059, arXiv.org, revised Oct 2024.
    3. Xuewen Han & Neng Wang & Shangkun Che & Hongyang Yang & Kunpeng Zhang & Sean Xin Xu, 2024. "Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research," Papers 2411.04788, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kothari, S. P., 2001. "Capital markets research in accounting," Journal of Accounting and Economics, Elsevier, vol. 31(1-3), pages 105-231, September.
    2. Monika Chopra & Abhishek Miglani, 2018. "Do stock markets acceptably reflect earnings manipulation? Analysis of Indian manufacturing firms," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 45(3), pages 271-280, September.
    3. Dechow, Patricia & Ge, Weili & Schrand, Catherine, 2010. "Understanding earnings quality: A review of the proxies, their determinants and their consequences," Journal of Accounting and Economics, Elsevier, vol. 50(2-3), pages 344-401, December.
    4. Cao, Sean & Jiang, Wei & Wang, Junbo & Yang, Baozhong, 2024. "From Man vs. Machine to Man + Machine: The art and AI of stock analyses," Journal of Financial Economics, Elsevier, vol. 160(C).
    5. Geertsema, Paul & Lu, Helen, 2020. "The correlation structure of anomaly strategies," Journal of Banking & Finance, Elsevier, vol. 119(C).
    6. Lu Zhang, 2017. "The Investment CAPM," European Financial Management, European Financial Management Association, vol. 23(4), pages 545-603, September.
    7. Cong, Lin William & George, Nathan Darden & Wang, Guojun, 2023. "RIM-based value premium and factor pricing using value-price divergence," Journal of Banking & Finance, Elsevier, vol. 149(C).
    8. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble of Models for Optimal Predictive Performance with Applications to Sector Rotation Strategy," Papers 2304.09947, arXiv.org.
    9. Tran, Vu Le, 2023. "Sentiment and covariance characteristics," International Review of Financial Analysis, Elsevier, vol. 86(C).
    10. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    11. De Nard, Gianluca & Zhao, Zhao, 2022. "A large-dimensional test for cross-sectional anomalies:Efficient sorting revisited," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 654-676.
    12. Balakrishnan, Karthik & Shivakumar, Lakshmanan & Taori, Peeyush, 2021. "Analysts’ estimates of the cost of equity capital," Journal of Accounting and Economics, Elsevier, vol. 71(2).
    13. S. P. Kothari & Charles Wasley, 2019. "Commemorating the 50‐Year Anniversary of Ball and Brown (1968): The Evolution of Capital Market Research over the Past 50 Years," Journal of Accounting Research, Wiley Blackwell, vol. 57(5), pages 1117-1159, December.
    14. Le, Cao Hoang Anh & Shan, Yaowen & Taylor, Stephen, 2024. "International economic policy uncertainty and analysts' earnings forecasts," Pacific-Basin Finance Journal, Elsevier, vol. 85(C).
    15. Krauss, Christopher & Krüger, Tom & Beerstecher, Daniel, 2015. "The Piotroski F-Score: A fundamental value strategy revisited from an investor's perspective," FAU Discussion Papers in Economics 13/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    16. Hoang, Khoa & Cannavan, Damien & Gaunt, Clive & Huang, Ronghong, 2019. "Is that factor just lucky? Australian evidence," Pacific-Basin Finance Journal, Elsevier, vol. 57(C).
    17. Weichuan Deng & Pawel Polak & Abolfazl Safikhani & Ronakdilip Shah, 2023. "A Unified Framework for Fast Large-Scale Portfolio Optimization," Papers 2303.12751, arXiv.org, revised Nov 2023.
    18. H. J. Turtle & Kainan Wang, 2017. "The Value In Fundamental Accounting Information," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 40(1), pages 113-140, March.
    19. Hediger, Simon & Michel, Loris & Näf, Jeffrey, 2022. "On the use of random forest for two-sample testing," Computational Statistics & Data Analysis, Elsevier, vol. 170(C).
    20. Edward Li & Zhiyuan Tu & Dexin Zhou, 2024. "The Promise and Peril of Generative AI: Evidence from GPT-4 as Sell-Side Analysts," Papers 2412.01069, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2407.17866. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.