Generating long-horizon stock "buy" signals with a neural language model

My bibliography Save this paper

Generating long-horizon stock "buy" signals with a neural language model

Author

Listed:

Joel R. Bock

Registered:

Abstract

This paper describes experiments on fine-tuning a small language model to generate forecasts of long-horizon stock price movements. Inputs to the model are narrative text from 10-K reports of large market capitalization companies in the S&P 500 index; the output is a forward-looking buy or sell decision. Price direction is predicted at discrete horizons up to 12 months after the report filing date. The results reported here demonstrate good out-of-sample statistical performance (F1-macro= 0.62) at medium to long investment horizons. In particular, the buy signals generated from 10-K text are found most precise at 6 and 9 months in the future. As measured by the F1 score, the buy signal provides between 4.8 and 9 percent improvement against a random stock selection model. In contrast, sell signals generated by the models do not perform well. This may be attributed to the highly imbalanced out-of-sample data, or perhaps due to management drafting annual reports with a bias toward positive language. Cross-sectional analysis of performance by economic sector suggests that idiosyncratic reporting styles within industries are correlated with varying degrees and time scales of price movement predictability.

Suggested Citation

Joel R. Bock, 2024. "Generating long-horizon stock "buy" signals with a neural language model," Papers 2410.18988, arXiv.org.

Handle: RePEc:arx:papers:2410.18988

Download full text from publisher

References listed on IDEAS

Oliver Boguth & Murray Carlson & Adlai Fisher & Mikhail Simutin, 2016. "Horizon Effects in Average Returns: The Role of Slow Information Diffusion," The Review of Financial Studies, Society for Financial Studies, vol. 29(8), pages 2241-2281.
Hanshuang Tong & Jun Li & Ning Wu & Ming Gong & Dongmei Zhang & Qi Zhang, 2024. "Ploutos: Towards interpretable stock movement prediction with financial large language model," Papers 2403.00782, arXiv.org.
Yinheng Li & Shaofei Wang & Han Ding & Hang Chen, 2023. "Large Language Models in Finance: A Survey," Papers 2311.10723, arXiv.org, revised Jul 2024.
Udit Gupta, 2023. "GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models," Papers 2309.03079, arXiv.org.
Mehran Azimi & Anup Agrawal, 2021. "Is Positive Sentiment in Corporate Annual Reports Informative? Evidence from Deep Learning [Cash holdings and credit risk]," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 11(4), pages 762-805.
Georgios Fatouros & Konstantinos Metaxas & John Soldatos & Dimosthenis Kyriazis, 2024. "Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection," Papers 2401.03737, arXiv.org, revised Apr 2024.
Yuqi Nie & Yaxuan Kong & Xiaowen Dong & John M. Mulvey & H. Vincent Poor & Qingsong Wen & Stefan Zohren, 2024. "A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges," Papers 2406.11903, arXiv.org.
Volkan Muslu & Suresh Radhakrishnan & K. R. Subramanyam & Dongkuk Lim, 2015. "Forward-Looking MD&A Disclosures and the Information Environment," Management Science, INFORMS, vol. 61(5), pages 931-948, May.
Stefan Pasch & Daniel Ehnes, 2022. "StonkBERT: Can Language Models Predict Medium-Run Stock Price Movements?," Papers 2202.02268, arXiv.org.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Han Ding & Yinheng Li & Junhao Wang & Hang Chen, 2024. "Large Language Model Agent in Financial Trading: A Survey," Papers 2408.06361, arXiv.org.
Yuzhe Yang & Yifei Zhang & Yan Hu & Yilin Guo & Ruoli Gan & Yueru He & Mingcong Lei & Xiao Zhang & Haining Wang & Qianqian Xie & Jimin Huang & Honghai Yu & Benyou Wang, 2024. "UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models," Papers 2410.14059, arXiv.org, revised Feb 2025.
Semenov, Andrei, 2021. "Measuring the stock's factor beta and identifying risk factors under market inefficiency," The Quarterly Review of Economics and Finance, Elsevier, vol. 80(C), pages 635-649.
Chenglu Jin & Thomas Conlon & John Cotter, 2023. "Co-Skewness across Return Horizons," Journal of Financial Econometrics, Oxford University Press, vol. 21(5), pages 1483-1518.
- Thomas Conlon & John Cotter & Chenglu Jin, 2019. "Co-skewness across Return Horizons," Working Papers 201910, Geary Institute, University College Dublin.
- Chenglu Jin & Thomas Conlon & John Cotter, 2022. "Co-skewness across Return Horizons," Working Papers 202210, Geary Institute, University College Dublin.
Dongshin Kim & Dongkuk Lim & Jonathan A. Wiley, 2023. "Narrative Investment-Risk Disclosure & REIT Investment," The Journal of Real Estate Finance and Economics, Springer, vol. 66(2), pages 542-567, February.
Sharifkhani, Ali & Simutin, Mikhail, 2021. "Feedback loops in industry trade networks and the term structure of momentum profits," Journal of Financial Economics, Elsevier, vol. 141(3), pages 1171-1187.
Allen H. Huang & Jianghua Shen & Amy Y. Zang, 2022. "The unintended benefit of the risk factor mandate of 2005," Review of Accounting Studies, Springer, vol. 27(4), pages 1319-1355, December.
Li, Yanqiong & Wang, Xiongyuan & He, Jie & Chan, Kam C., 2023. "Supply chain risk disclosure and seasoned equity offering discount," Pacific-Basin Finance Journal, Elsevier, vol. 82(C).
Deborah Miori & Constantin Petrov, 2023. "Narratives from GPT-derived Networks of News, and a link to Financial Markets Dislocations," Papers 2311.14419, arXiv.org.
Xuewen Han & Neng Wang & Shangkun Che & Hongyang Yang & Kunpeng Zhang & Sean Xin Xu, 2024. "Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research," Papers 2411.04788, arXiv.org.
Özgür Arslan‐Ayaydin & James Thewissen & Wouter Torsin, 2021. "Disclosure tone management and labor unions," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 48(1-2), pages 102-147, January.
Kempf, Elisabeth & Spalt, Oliver G., 2020. "Attracting the Sharks: Corporate Innovation and Securities Class Action Lawsuits," CEPR Discussion Papers 14358, C.E.P.R. Discussion Papers.
Gustaf Bellstam & Sanjai Bhagat & J. Anthony Cookson, 2021. "A Text-Based Analysis of Corporate Innovation," Management Science, INFORMS, vol. 67(7), pages 4004-4031, July.
García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
Wang, Chunlan & Xin, Jianxuan & Sun, Fangfang & Shi, Yan & Du, Yuxuan, 2024. "The effects of manager sentiment in financial disclosure: Perspectives of operational efficiency and market reaction," Finance Research Letters, Elsevier, vol. 64(C).
Alejandro Bernales & Marcela Valenzuela & Ilknur Zer, 2023. "Effects of Information Overload on Financial Markets: How Much Is Too Much?," International Finance Discussion Papers 1372, Board of Governors of the Federal Reserve System (U.S.).
Jiang, Kangqi & Du, Xinyi & Chen, Zhongfei, 2022. "Firms' digitalization and stock price crash risk," International Review of Financial Analysis, Elsevier, vol. 82(C).
Liu, Qigui & Chi, Wenqiang & Wang, Junyi, 2024. "How informative is question-and-answer similarity to financial analysts? Evidence from Chinese earnings communication conferences," Economic Modelling, Elsevier, vol. 135(C).
Shimon Kogan & Vitaly Meursault, 2021. "Corporate Disclosure: Facts or Opinions?," Working Papers 21-40, Federal Reserve Bank of Philadelphia.
Arnav Grover, 2025. "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024," Papers 2502.01992, arXiv.org.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-12-02 (Big Data)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2410.18988. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Generating long-horizon stock "buy" signals with a neural language model

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data