IDEAS home Printed from https://ideas.repec.org/a/bla/joares/v60y2022i2p467-515.html
   My bibliography  Save this article

Predicting Future Earnings Changes Using Machine Learning and Detailed Financial Data

Author

Listed:
  • XI CHEN
  • YANG HA (TONY) CHO
  • YIWEI DOU
  • BARUCH LEV

Abstract

We use machine learning methods and high‐dimensional detailed financial data to predict the direction of one‐year‐ahead earnings changes. Our models show significant out‐of‐sample predictive power: the area under the receiver operating characteristics curve ranges from 67.52% to 68.66%, significantly higher than the 50% of a random guess. The annual size‐adjusted returns to hedge portfolios formed based on the prediction of our models range from 5.02% to 9.74%. Our models outperform two conventional models that use logistic regressions and small sets of accounting variables, and professional analysts’ forecasts. Analyses suggest that the outperformance relative to the conventional models stems from both nonlinear predictor interactions missed by regressions and the use of more detailed financial data by machine learning.

Suggested Citation

  • Xi Chen & Yang Ha (Tony) Cho & Yiwei Dou & Baruch Lev, 2022. "Predicting Future Earnings Changes Using Machine Learning and Detailed Financial Data," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 467-515, May.
  • Handle: RePEc:bla:joares:v:60:y:2022:i:2:p:467-515
    DOI: 10.1111/1475-679X.12429
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/1475-679X.12429
    Download Restriction: no

    File URL: https://libkey.io/10.1111/1475-679X.12429?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Frankel, Richard & Jennings, Jared & Lee, Joshua, 2016. "Using unstructured and qualitative disclosures to explain accruals," Journal of Accounting and Economics, Elsevier, vol. 62(2), pages 209-227.
    2. Yang Bao & Bin Ke & Bin Li & Y. Julia Yu & Jie Zhang, 2020. "Detecting Accounting Fraud in Publicly Traded U.S. Firms Using a Machine Learning Approach," Journal of Accounting Research, Wiley Blackwell, vol. 58(1), pages 199-235, March.
    3. Holthausen, Robert W. & Larcker, David F., 1992. "The prediction of stock returns using financial statement information," Journal of Accounting and Economics, Elsevier, vol. 15(2-3), pages 373-411, August.
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    6. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    7. Kothari, S. P., 2001. "Capital markets research in accounting," Journal of Accounting and Economics, Elsevier, vol. 31(1-3), pages 105-231, September.
    8. Kewei Hou & Chen Xue & Lu Zhang, 2020. "Replicating Anomalies," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2019-2133.
    9. Elizabeth Blankespoor, 2019. "The Impact of Information Processing Costs on Firm Disclosure Choice: Evidence from the XBRL Mandate," Journal of Accounting Research, Wiley Blackwell, vol. 57(4), pages 919-967, September.
    10. Ou, Jane A. & Penman, Stephen H., 1989. "Financial statement analysis and the prediction of stock returns," Journal of Accounting and Economics, Elsevier, vol. 11(4), pages 295-329, November.
    11. Jap Efendi & Jin Dong Park & Chandra Subramaniam, 2016. "Does the XBRL Reporting Format Provide Incremental Information Value? A Study Using XBRL Disclosures During the Voluntary Filing Program," Abacus, Accounting Foundation, University of Sydney, vol. 52(2), pages 259-285, June.
    12. Jeremy Bertomeu & Edwige Cheynel & Eric Floyd & Wenqiang Pan, 2021. "Using machine learning to detect misstatements," Review of Accounting Studies, Springer, vol. 26(2), pages 468-519, June.
    13. Jacob Thomas & Frank X. Zhang, 2011. "Tax Expense Momentum," Journal of Accounting Research, Wiley Blackwell, vol. 49(3), pages 791-821, June.
    14. Richardson, Scott & Tuna, Irem & Wysocki, Peter, 2010. "Accounting anomalies and fundamental analysis: A review of recent research advances," Journal of Accounting and Economics, Elsevier, vol. 50(2-3), pages 410-454, December.
    15. Tyler Shumway & Vincent A. Warther, 1999. "The Delisting Bias in CRSP's Nasdaq Data and Its Implications for the Size Effect," Journal of Finance, American Finance Association, vol. 54(6), pages 2361-2379, December.
    16. Joseph D. Piotroski & Eric C. So, 2012. "Identifying Expectation Errors in Value/Glamour Strategies: A Fundamental Analysis Approach," The Review of Financial Studies, Society for Financial Studies, vol. 25(9), pages 2841-2875.
    17. Debreceny, Roger & Farewell, Stephanie & Piechocki, Maciej & Felden, Carsten & Gräning, André, 2010. "Does it add up? Early evidence on the data quality of XBRL filings to the SEC," Journal of Accounting and Public Policy, Elsevier, vol. 29(3), pages 296-306, June.
    18. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    19. Ou, Ja, 1990. "The Information-Content Of Nonearnings Accounting Numbers As Earnings Predictors," Journal of Accounting Research, Wiley Blackwell, vol. 28(1), pages 144-163.
    20. Shumway, Tyler, 1997. "The Delisting Bias in CRSP Data," Journal of Finance, American Finance Association, vol. 52(1), pages 327-340, March.
    21. Kevin Li & Partha Mohanram, 2019. "Fundamental Analysis: Combining the Search for Quality with the Search for Value†," Contemporary Accounting Research, John Wiley & Sons, vol. 36(3), pages 1263-1298, September.
    22. Kexing Ding & Baruch Lev & Xuan Peng & Ting Sun & Miklos A. Vasarhelyi, 2020. "Machine learning improves accounting estimates: evidence from insurance payments," Review of Accounting Studies, Springer, vol. 25(3), pages 1098-1134, September.
    23. Feng Li, 2010. "The Information Content of Forward‐Looking Statements in Corporate Filings—A Naïve Bayesian Machine Learning Approach," Journal of Accounting Research, Wiley Blackwell, vol. 48(5), pages 1049-1102, December.
    24. Monahan, Steven J., 2018. "Financial Statement Analysis and Earnings Forecasting," Foundations and Trends(R) in Accounting, now publishers, vol. 12(2), pages 105-215, July.
    25. Brown, Lawrence D., 1996. "Influential accounting articles, individuals, Ph.D. granting institutions and faculties: A citational analysis," Accounting, Organizations and Society, Elsevier, vol. 21(7-8), pages 723-754.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cao, Sean Shun & Jiang, Wei & Lei, Lijun (Gillian) & Zhou, Qing (Clara), 2024. "Applied AI for finance and accounting: Alternative data and opportunities," Pacific-Basin Finance Journal, Elsevier, vol. 84(C).
    2. Zhou, Ying & Li, Haoran & Xiao, Zhi & Qiu, Jing, 2023. "A user-centered explainable artificial intelligence approach for financial fraud detection," Finance Research Letters, Elsevier, vol. 58(PA).
    3. Hanauer, Matthias X. & Kalsbach, Tobias, 2023. "Machine learning and the cross-section of emerging market stock returns," Emerging Markets Review, Elsevier, vol. 55(C).
    4. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    5. Zhao, Qi & Xu, Weijun & Ji, Yucheng, 2023. "Predicting financial distress of Chinese listed companies using machine learning: To what extent does textual disclosure matter?," International Review of Financial Analysis, Elsevier, vol. 89(C).
    6. Alex Kim & Maximilian Muhn & Valeri Nikolaev, 2024. "Financial Statement Analysis with Large Language Models," Papers 2407.17866, arXiv.org.
    7. Ken Li, 2024. "Liquidity ratios and corporate failures," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 64(1), pages 1111-1134, March.
    8. Dichev, Ilia & Huang, Xinyi & Lee, Donald K.K & Zhao, Jianxin, 2023. "You have a point - but a point is not enough: The case for distributional forecasts of earnings," SocArXiv 4b2y8, Center for Open Science.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tran, Vu Le, 2023. "Sentiment and covariance characteristics," International Review of Financial Analysis, Elsevier, vol. 86(C).
    2. Blankespoor, Elizabeth & deHaan, Ed & Marinovic, Iván, 2020. "Disclosure processing costs, investors’ information choice, and equity market outcomes: A review," Journal of Accounting and Economics, Elsevier, vol. 70(2).
    3. De Nard, Gianluca & Zhao, Zhao, 2022. "A large-dimensional test for cross-sectional anomalies:Efficient sorting revisited," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 654-676.
    4. Bui, Dien Giau & Kong, De-Rong & Lin, Chih-Yung & Lin, Tse-Chun, 2023. "Momentum in machine learning: Evidence from the Taiwan stock market," Pacific-Basin Finance Journal, Elsevier, vol. 82(C).
    5. Miao Liu, 2022. "Assessing Human Information Processing in Lending Decisions: A Machine Learning Approach," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 607-651, May.
    6. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    7. Geertsema, Paul & Lu, Helen, 2020. "The correlation structure of anomaly strategies," Journal of Banking & Finance, Elsevier, vol. 119(C).
    8. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble of Models for Optimal Predictive Performance with Applications to Sector Rotation Strategy," Papers 2304.09947, arXiv.org.
    9. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    10. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    11. Jun, So Young & Kim, Dong Sung & Jung, Suk Yoon & Jun, Sang Gyung & Kim, Jong Woo, 2022. "Stock investment strategy combining earnings power index and machine learning," International Journal of Accounting Information Systems, Elsevier, vol. 47(C).
    12. Hanauer, Matthias X. & Kononova, Marina & Rapp, Marc Steffen, 2022. "Boosting agnostic fundamental analysis: Using machine learning to identify mispricing in European stock markets," Finance Research Letters, Elsevier, vol. 48(C).
    13. Yao, Haixiang & Xia, Shenghao & Liu, Hao, 2022. "Six-factor asset pricing and portfolio investment via deep learning: Evidence from Chinese stock market," Pacific-Basin Finance Journal, Elsevier, vol. 76(C).
    14. Weichuan Deng & Pawel Polak & Abolfazl Safikhani & Ronakdilip Shah, 2023. "A Unified Framework for Fast Large-Scale Portfolio Optimization," Papers 2303.12751, arXiv.org, revised Nov 2023.
    15. Gianluca De Nard & Simon Hediger & Markus Leippold, 2022. "Subsampled factor models for asset pricing: The rise of Vasa," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1217-1247, September.
    16. Hediger, Simon & Michel, Loris & Näf, Jeffrey, 2022. "On the use of random forest for two-sample testing," Computational Statistics & Data Analysis, Elsevier, vol. 170(C).
    17. Bartram, Söhnke M. & Grinblatt, Mark, 2018. "Agnostic fundamental analysis works," Journal of Financial Economics, Elsevier, vol. 128(1), pages 125-147.
    18. Yan, Jingda & Yu, Jialin, 2023. "Cross-stock momentum and factor momentum," Journal of Financial Economics, Elsevier, vol. 150(2).
    19. Colak, Gonul & Fu, Mengchuan & Hasan, Iftekhar, 2022. "On modeling IPO failure risk," Economic Modelling, Elsevier, vol. 109(C).
    20. Paul Geertsema & Helen Lu, 2023. "Relative Valuation with Machine Learning," Journal of Accounting Research, Wiley Blackwell, vol. 61(1), pages 329-376, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:joares:v:60:y:2022:i:2:p:467-515. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0021-8456 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.