IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2410.18988.html
   My bibliography  Save this paper

Generating long-horizon stock "buy" signals with a neural language model

Author

Listed:
  • Joel R. Bock

Abstract

This paper describes experiments on fine-tuning a small language model to generate forecasts of long-horizon stock price movements. Inputs to the model are narrative text from 10-K reports of large market capitalization companies in the S&P 500 index; the output is a forward-looking buy or sell decision. Price direction is predicted at discrete horizons up to 12 months after the report filing date. The results reported here demonstrate good out-of-sample statistical performance (F1-macro= 0.62) at medium to long investment horizons. In particular, the buy signals generated from 10-K text are found most precise at 6 and 9 months in the future. As measured by the F1 score, the buy signal provides between 4.8 and 9 percent improvement against a random stock selection model. In contrast, sell signals generated by the models do not perform well. This may be attributed to the highly imbalanced out-of-sample data, or perhaps due to management drafting annual reports with a bias toward positive language. Cross-sectional analysis of performance by economic sector suggests that idiosyncratic reporting styles within industries are correlated with varying degrees and time scales of price movement predictability.

Suggested Citation

  • Joel R. Bock, 2024. "Generating long-horizon stock "buy" signals with a neural language model," Papers 2410.18988, arXiv.org.
  • Handle: RePEc:arx:papers:2410.18988
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2410.18988
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Oliver Boguth & Murray Carlson & Adlai Fisher & Mikhail Simutin, 2016. "Horizon Effects in Average Returns: The Role of Slow Information Diffusion," The Review of Financial Studies, Society for Financial Studies, vol. 29(8), pages 2241-2281.
    2. Hanshuang Tong & Jun Li & Ning Wu & Ming Gong & Dongmei Zhang & Qi Zhang, 2024. "Ploutos: Towards interpretable stock movement prediction with financial large language model," Papers 2403.00782, arXiv.org.
    3. Yinheng Li & Shaofei Wang & Han Ding & Hang Chen, 2023. "Large Language Models in Finance: A Survey," Papers 2311.10723, arXiv.org, revised Jul 2024.
    4. Udit Gupta, 2023. "GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models," Papers 2309.03079, arXiv.org.
    5. Mehran Azimi & Anup Agrawal, 2021. "Is Positive Sentiment in Corporate Annual Reports Informative? Evidence from Deep Learning [Cash holdings and credit risk]," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 11(4), pages 762-805.
    6. Georgios Fatouros & Konstantinos Metaxas & John Soldatos & Dimosthenis Kyriazis, 2024. "Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection," Papers 2401.03737, arXiv.org, revised Apr 2024.
    7. Yuqi Nie & Yaxuan Kong & Xiaowen Dong & John M. Mulvey & H. Vincent Poor & Qingsong Wen & Stefan Zohren, 2024. "A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges," Papers 2406.11903, arXiv.org.
    8. Volkan Muslu & Suresh Radhakrishnan & K. R. Subramanyam & Dongkuk Lim, 2015. "Forward-Looking MD&A Disclosures and the Information Environment," Management Science, INFORMS, vol. 61(5), pages 931-948, May.
    9. Stefan Pasch & Daniel Ehnes, 2022. "StonkBERT: Can Language Models Predict Medium-Run Stock Price Movements?," Papers 2202.02268, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Han Ding & Yinheng Li & Junhao Wang & Hang Chen, 2024. "Large Language Model Agent in Financial Trading: A Survey," Papers 2408.06361, arXiv.org.
    2. Yuzhe Yang & Yifei Zhang & Yan Hu & Yilin Guo & Ruoli Gan & Yueru He & Mingcong Lei & Xiao Zhang & Haining Wang & Qianqian Xie & Jimin Huang & Honghai Yu & Benyou Wang, 2024. "UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models," Papers 2410.14059, arXiv.org, revised Oct 2024.
    3. Semenov, Andrei, 2021. "Measuring the stock's factor beta and identifying risk factors under market inefficiency," The Quarterly Review of Economics and Finance, Elsevier, vol. 80(C), pages 635-649.
    4. Chenglu Jin & Thomas Conlon & John Cotter, 2023. "Co-Skewness across Return Horizons," Journal of Financial Econometrics, Oxford University Press, vol. 21(5), pages 1483-1518.
    5. Sharifkhani, Ali & Simutin, Mikhail, 2021. "Feedback loops in industry trade networks and the term structure of momentum profits," Journal of Financial Economics, Elsevier, vol. 141(3), pages 1171-1187.
    6. Li, Yanqiong & Wang, Xiongyuan & He, Jie & Chan, Kam C., 2023. "Supply chain risk disclosure and seasoned equity offering discount," Pacific-Basin Finance Journal, Elsevier, vol. 82(C).
    7. Deborah Miori & Constantin Petrov, 2023. "Narratives from GPT-derived Networks of News, and a link to Financial Markets Dislocations," Papers 2311.14419, arXiv.org.
    8. Xuewen Han & Neng Wang & Shangkun Che & Hongyang Yang & Kunpeng Zhang & Sean Xin Xu, 2024. "Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research," Papers 2411.04788, arXiv.org.
    9. Özgür Arslan‐Ayaydin & James Thewissen & Wouter Torsin, 2021. "Disclosure tone management and labor unions," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 48(1-2), pages 102-147, January.
    10. Kempf, Elisabeth & Spalt, Oliver G., 2020. "Attracting the Sharks: Corporate Innovation and Securities Class Action Lawsuits," CEPR Discussion Papers 14358, C.E.P.R. Discussion Papers.
    11. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    12. Wang, Chunlan & Xin, Jianxuan & Sun, Fangfang & Shi, Yan & Du, Yuxuan, 2024. "The effects of manager sentiment in financial disclosure: Perspectives of operational efficiency and market reaction," Finance Research Letters, Elsevier, vol. 64(C).
    13. Alejandro Bernales & Marcela Valenzuela & Ilknur Zer, 2023. "Effects of Information Overload on Financial Markets: How Much Is Too Much?," International Finance Discussion Papers 1372, Board of Governors of the Federal Reserve System (U.S.).
    14. Jiang, Kangqi & Du, Xinyi & Chen, Zhongfei, 2022. "Firms' digitalization and stock price crash risk," International Review of Financial Analysis, Elsevier, vol. 82(C).
    15. Liu, Qigui & Chi, Wenqiang & Wang, Junyi, 2024. "How informative is question-and-answer similarity to financial analysts? Evidence from Chinese earnings communication conferences," Economic Modelling, Elsevier, vol. 135(C).
    16. Shimon Kogan & Vitaly Meursault, 2021. "Corporate Disclosure: Facts or Opinions?," Working Papers 21-40, Federal Reserve Bank of Philadelphia.
    17. Bozanic, Zahn & Roulstone, Darren T. & Van Buskirk, Andrew, 2018. "Management earnings forecasts and other forward-looking statements," Journal of Accounting and Economics, Elsevier, vol. 65(1), pages 1-20.
    18. Sun, Xiaowei & Zheng, Tianyu & Wang, Zehao & Li, Peigong, 2024. "Risk factors disclosure and corporate philanthropy," International Review of Financial Analysis, Elsevier, vol. 93(C).
    19. Qianqian Xie & Dong Li & Mengxi Xiao & Zihao Jiang & Ruoyu Xiang & Xiao Zhang & Zhengyu Chen & Yueru He & Weiguang Han & Yuzhe Yang & Shunian Chen & Yifei Zhang & Lihang Shen & Daniel Kim & Zhiwei Liu, 2024. "Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications," Papers 2408.11878, arXiv.org.
    20. Tianyu Zhou & Pinqiao Wang & Yilin Wu & Hongyang Yang, 2024. "FinRobot: AI Agent for Equity Research and Valuation with Large Language Models," Papers 2411.08804, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2410.18988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.