IDEAS home Printed from https://ideas.repec.org/a/eee/jfinec/v147y2023i3p525-549.html
   My bibliography  Save this article

The colour of finance words

Author

Listed:
  • García, Diego
  • Hu, Xiaowen
  • Rohrer, Maximilian

Abstract

Our paper relies on stock price reactions to colour words, in order to provide new dictionaries of positive and negative words in a finance context. We extend the machine learning algorithm of Taddy (2013), adding a cross-validation layer to avoid over-fitting. In head-to-head comparisons, our dictionaries outperform the standard bag-of-words approach (Loughran and McDonald, 2011) when predicting stock price movements out-of-sample. By comparing their composition, word-by-word, our method refines and expands the sentiment dictionaries in the literature. The breadth of our dictionaries and their ability to disambiguate words using bigrams both help to colour finance discourse better.

Suggested Citation

  • García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
  • Handle: RePEc:eee:jfinec:v:147:y:2023:i:3:p:525-549
    DOI: 10.1016/j.jfineco.2022.11.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304405X22002422
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jfineco.2022.11.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    2. Hanley, Kathleen Weiss & Hoberg, Gerard, 2012. "Litigation risk, strategic disclosure and the underpricing of initial public offerings," Journal of Financial Economics, Elsevier, vol. 103(2), pages 235-254.
    3. Eugene F. Fama & Kenneth R. French, 2001. "Disappearing Dividends: Changing Firm Characteristics Or Lower Propensity To Pay?," Journal of Applied Corporate Finance, Morgan Stanley, vol. 14(1), pages 67-79, March.
    4. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    5. Stephen Hansen & Michael McMahon & Andrea Prat, 2018. "Transparency and Deliberation Within the FOMC: A Computational Linguistics Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 133(2), pages 801-870.
    6. Glasserman, Paul & Mamaysky, Harry, 2019. "Does Unusual News Forecast Market Stress?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 54(5), pages 1937-1974, October.
    7. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    8. Kathleen Weiss Hanley & Gerard Hoberg, 2019. "Dynamic Interpretation of Emerging Risks in the Financial Sector," The Review of Financial Studies, Society for Financial Studies, vol. 32(12), pages 4543-4603.
    9. Tarek A Hassan & Stephan Hollander & Laurence van Lent & Ahmed Tahoun, 2019. "Firm-Level Political Risk: Measurement and Effects," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(4), pages 2135-2202.
    10. Jegadeesh, Narasimhan & Wu, Di, 2013. "Word power: A new approach for content analysis," Journal of Financial Economics, Elsevier, vol. 110(3), pages 712-729.
    11. Gerard Hoberg & Gordon Phillips, 2016. "Text-Based Network Industries and Endogenous Product Differentiation," Journal of Political Economy, University of Chicago Press, vol. 124(5), pages 1423-1465.
    12. Tim Loughran & Bill McDonald, 2020. "Textual Analysis in Finance," Annual Review of Financial Economics, Annual Reviews, vol. 12(1), pages 357-375, December.
    13. Jason V. Chen & Venky Nagar & Jordan Schoenfeld, 2018. "Manager-analyst conversations in earnings conference calls," Review of Accounting Studies, Springer, vol. 23(4), pages 1315-1354, December.
    14. Fama, Eugene F & MacBeth, James D, 1973. "Risk, Return, and Equilibrium: Empirical Tests," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 607-636, May-June.
    15. Matt Taddy, 2013. "Multinomial Inverse Regression for Text Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 755-770, September.
    16. David H. Solomon, 2012. "Selective Publicity and Stock Prices," Journal of Finance, American Finance Association, vol. 67(2), pages 599-638, April.
    17. Pierre Jinghong Liang & Vitaly Meursault & Bryan B. Routledge & Madeline Marco Scanlon, 2021. "PEAD.txt: Post-Earnings-Announcement Drift Using Text," Working Papers 21-07, Federal Reserve Bank of Philadelphia.
    18. Margaret Roberts & Brandon Stewart & Tingley, Dustin & Edoardo Airoldi, 2013. "The structural topic model and applied social science," Working Paper 132666, Harvard University OpenScholar.
    19. Frankel, R & Johnson, M & Skinner, DJ, 1999. "An empirical examination of conference calls as a voluntary disclosure medium," Journal of Accounting Research, Wiley Blackwell, vol. 37(1), pages 133-150.
    20. Volkan Muslu & Suresh Radhakrishnan & K. R. Subramanyam & Dongkuk Lim, 2015. "Forward-Looking MD&A Disclosures and the Information Environment," Management Science, INFORMS, vol. 61(5), pages 931-948, May.
    21. Manela, Asaf & Moreira, Alan, 2017. "News implied volatility and disaster concerns," Journal of Financial Economics, Elsevier, vol. 123(1), pages 137-162.
    22. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    23. Hoberg, Gerard & Lewis, Craig, 2017. "Do fraudulent firms produce abnormal disclosure?," Journal of Corporate Finance, Elsevier, vol. 43(C), pages 58-85.
    24. Zheng Tracy Ke & Bryan T. Kelly & Dacheng Xiu, 2019. "Predicting Returns With Text Data," NBER Working Papers 26186, National Bureau of Economic Research, Inc.
    25. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    26. repec:bla:jfinan:v:59:y:2004:i:3:p:1259-1294 is not listed on IDEAS
    27. García, Diego & Norli, Øyvind, 2012. "Geographic dispersion and stock returns," Journal of Financial Economics, Elsevier, vol. 106(3), pages 547-565.
    28. J. Anthony Cookson & Marina Niessner, 2020. "Why Don't We Agree? Evidence from a Social Network of Investors," Journal of Finance, American Finance Association, vol. 75(1), pages 173-228, February.
    29. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    30. Anastassia Fedyk, 2021. "Disagreement after News: Gradual Information Diffusion or Differences of Opinion?," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 11(3), pages 465-501.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pan, Zhiyuan & Zhong, Hao & Wang, Yudong & Huang, Juan, 2024. "Forecasting oil futures returns with news," Energy Economics, Elsevier, vol. 134(C).
    2. Sudarshan Kumar & Sobhesh Kumar Agarwalla & Jayanth R. Varma & Vineet Virmani, 2023. "Harvesting the volatility smile in a large emerging market: A Dynamic Nelson–Siegel approach," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 43(11), pages 1615-1644, November.
    3. Agam Shah & Sudheer Chava, 2023. "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks," Papers 2305.16633, arXiv.org.
    4. Du, Rui & Mino, Ajkel & Wang, Jianghao & Zheng, Siqi, 2024. "Transboundary vegetation fire smoke and expressed sentiment: Evidence from Twitter," Journal of Environmental Economics and Management, Elsevier, vol. 124(C).
    5. Liu, Qigui & Chi, Wenqiang & Wang, Junyi, 2024. "How informative is question-and-answer similarity to financial analysts? Evidence from Chinese earnings communication conferences," Economic Modelling, Elsevier, vol. 135(C).
    6. Shuaiyu Chen & T. Clifton Green & Huseyin Gulen & Dexin Zhou, 2024. "What Does ChatGPT Make of Historical Stock Returns? Extrapolation and Miscalibration in LLM Stock Return Forecasts," Papers 2409.11540, arXiv.org.
    7. Kevin Benson & Ing-Haw Cheng & John Hull & Charles Martineau & Yoshio Nozawa & Vasily Strela & Yuntao Wu & Jun Yuan, 2024. "Understanding the Excess Bond Premium," Papers 2412.04063, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Fritzsch & Philipp Scharner & Gregor Weiß, 2021. "Estimating the relation between digitalization and the market value of insurers," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 529-567, September.
    2. Faccini, Renato & Matin, Rastin & Skiadopoulos, George, 2023. "Dissecting climate risks: Are they reflected in stock prices?," Journal of Banking & Finance, Elsevier, vol. 155(C).
    3. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    4. Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
    5. Kirtac, Kemal & Germano, Guido, 2024. "Sentiment trading with large language models," Finance Research Letters, Elsevier, vol. 62(PB).
    6. Alejandro Lopez-Lira & Yuehua Tang, 2023. "Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models," Papers 2304.07619, arXiv.org, revised Sep 2024.
    7. Paul E. Soto, 2021. "Breaking the Word Bank: Measurement and Effects of Bank Level Uncertainty," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 1-45, April.
    8. Chris Florackis & Christodoulos Louca & Roni Michaely & Michael Weber, 2023. "Cybersecurity Risk," The Review of Financial Studies, Society for Financial Studies, vol. 36(1), pages 351-407.
    9. Marie Bessec & Julien Fouquau, 2024. "A Green Wave in Media: A Change of Tack in Stock Markets," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 86(5), pages 1026-1057, October.
    10. Hansen, Stephen & Davis, Steven & Seminario-Amez, Cristhian, 2020. "Firm-level Risk Exposures and Stock Returns in the Wake of COVID-19," CEPR Discussion Papers 15314, C.E.P.R. Discussion Papers.
    11. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    12. Li, Ken, 2022. "Textual fundamentals in earnings press releases," Advances in accounting, Elsevier, vol. 57(C).
    13. Jeon, Yoontae & McCurdy, Thomas H. & Zhao, Xiaofei, 2022. "News as sources of jumps in stock returns: Evidence from 21 million news articles for 9000 companies," Journal of Financial Economics, Elsevier, vol. 145(2), pages 1-17.
    14. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    15. Pan, Zhiyuan & Zhong, Hao & Wang, Yudong & Huang, Juan, 2024. "Forecasting oil futures returns with news," Energy Economics, Elsevier, vol. 134(C).
    16. Yi Yang & Kunpeng Zhang & Yangyang Fan, 2022. "Analyzing Firm Reports for Volatility Prediction: A Knowledge-Driven Text-Embedding Approach," INFORMS Journal on Computing, INFORMS, vol. 34(1), pages 522-540, January.
    17. Tengfei Zhang, 2020. "Manager Uncertainty and Cross-Sectional Stock Returns," 2020 Papers pzh934, Job Market Papers.
    18. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    19. Aysan, Ahmet Faruk & Caporin, Massimiliano & Cepni, Oguzhan, 2024. "Not all words are equal: Sentiment and jumps in the cryptocurrency market," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 91(C).
    20. Zheng, Hannan & Schwenkler, Gustavo, 2020. "The network of firms implied by the news," ESRB Working Paper Series 108, European Systemic Risk Board.

    More about this item

    Keywords

    Measuring sentiment; Machine learning; Earnings calls; 10-Ks; WSJ;
    All these keywords.

    JEL classification:

    • D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information; Mechanism Design
    • G14 - Financial Economics - - General Financial Markets - - - Information and Market Efficiency; Event Studies; Insider Trading

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jfinec:v:147:y:2023:i:3:p:525-549. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505576 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.