IDEAS home Printed from https://ideas.repec.org/a/wly/isacfm/v23y2016i3p215-239.html
   My bibliography  Save this article

Do Sentiments Matter in Fraud Detection? Estimating Semantic Orientation of Annual Reports

Author

Listed:
  • Sunita Goel
  • Ozlem Uzuner

Abstract

We present a novel approach for analysing the qualitative content of annual reports. Using natural language processing techniques we determine if sentiment expressed in the text matters in fraud detection. We focus on the Management Discussion and Analysis (MD&A) section of annual reports because of the nonfactual content present in this section, unlike other components of the annual reports. We measure the sentiment expressed in the text on the dimensions of polarity, subjectivity, and intensity and investigate in depth whether truthful and fraudulent MD&As differ in terms of sentiment polarity, sentiment subjectivity and sentiment intensity. Our results show that fraudulent MD&As on average contain three times more positive sentiment and four times more negative sentiment compared with truthful MD&As. This suggests that use of both positive and negative sentiment is more pronounced in fraudulent MD&As. We further find that, compared with truthful MD&As, fraudulent MD&As contain a greater proportion of subjective content than objective content. This suggests that the use of subjectivity clues such as presence of too many adjectives and adverbs could be an indicator of fraud. Clear cases of fraud show a higher intensity of sentiment exhibited by more use of adverbs in the “adverb modifying adjective” pattern. Based on the results of this study, frequent use of intensifiers, particularly in this pattern, could be another indicator of fraud. Moreover, the dimensions of subjectivity and intensity help in accurately classifying borderline examples of MD&As (that are equal in sentiment polarity) into fraudulent and truthful categories. When taken together, these findings suggest that fraudulent MD&As in contrast to truthful MD&As contain higher sentiment content. Copyright © 2016 John Wiley & Sons, Ltd.

Suggested Citation

  • Sunita Goel & Ozlem Uzuner, 2016. "Do Sentiments Matter in Fraud Detection? Estimating Semantic Orientation of Annual Reports," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 215-239, July.
  • Handle: RePEc:wly:isacfm:v:23:y:2016:i:3:p:215-239
    DOI: 10.1002/isaf.1392
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/isaf.1392
    Download Restriction: no

    File URL: https://libkey.io/10.1002/isaf.1392?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sunita Goel & Jagdish Gangolly, 2012. "Beyond The Numbers: Mining The Annual Reports For Hidden Cues Indicative Of Financial Statement Fraud," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 19(2), pages 75-89, April.
    2. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    3. Mike Thelwall & Kevan Buckley & Georgios Paltoglou, 2012. "Sentiment strength detection for the social web," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(1), pages 163-173, January.
    4. Diego García, 2013. "Sentiment during Recessions," Journal of Finance, American Finance Association, vol. 68(3), pages 1267-1300, June.
    5. P. S. Bradley & O. L. Mangasarian & W. N. Street, 1998. "Feature Selection via Mathematical Programming," INFORMS Journal on Computing, INFORMS, vol. 10(2), pages 209-217, May.
    6. John R. Carlson & Joey F. George & Judee K. Burgoon & Mark Adkins & Cindy H. White, 2004. "Deception in Computer-Mediated Communication," Group Decision and Negotiation, Springer, vol. 13(1), pages 5-28, January.
    7. Lina Zhou & Judee K. Burgoon & Jay F. Nunamaker & Doug Twitchell, 2004. "Automating Linguistics-Based Cues for Detecting Deception in Text-Based Asynchronous Computer-Mediated Communications," Group Decision and Negotiation, Springer, vol. 13(1), pages 81-106, January.
    8. Jeffrey T. Hancock & Michael T. Woodworth & Saurabh Goorha, 2010. "See No Evil: The Effect of Communication Medium and Motivation on Deception Detection," Group Decision and Negotiation, Springer, vol. 19(4), pages 327-343, July.
    9. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    10. repec:bla:jfinan:v:59:y:2004:i:3:p:1259-1294 is not listed on IDEAS
    11. Patricia M. Dechow & Weili Ge & Chad R. Larson & Richard G. Sloan, 2011. "Predicting Material Accounting Misstatements," Contemporary Accounting Research, John Wiley & Sons, vol. 28(1), pages 17-82, March.
    12. Mark Cecchini & Haldun Aytug & Gary J. Koehler & Praveen Pathak, 2010. "Detecting Management Fraud in Public Companies," Management Science, INFORMS, vol. 56(7), pages 1146-1160, July.
    13. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    14. Mike Thelwall & Kevan Buckley & Georgios Paltoglou, 2012. "Sentiment strength detection for the social web," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(1), pages 163-173, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ahmed, Shamima & Alshater, Muneer M. & Ammari, Anis El & Hammami, Helmi, 2022. "Artificial intelligence and machine learning in finance: A bibliometric review," Research in International Business and Finance, Elsevier, vol. 61(C).
    2. Li, Jing & Li, Nan & Xia, Tongshui & Guo, Jinjin, 2023. "Textual analysis and detection of financial fraud: Evidence from Chinese manufacturing firms," Economic Modelling, Elsevier, vol. 126(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qing Liu & Hosung Son, 2024. "Methods for aggregating investor sentiment from social media," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-22, December.
    2. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    3. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    4. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    5. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    6. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    7. Saadon, Yossi & Schreiber, Ben Z., 2023. "Newspapers tone and the overnight-intraday stock return anomaly," Journal of Financial Markets, Elsevier, vol. 65(C).
    8. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    9. Elias Zavitsanos & Dimitris Mavroeidis & Konstantinos Bougiatiotis & Eirini Spyropoulou & Lefteris Loukas & Georgios Paliouras, 2023. "Financial misstatement detection: a realistic evaluation," Papers 2305.17457, arXiv.org.
    10. Zhang, Xiaotao & Li, Guoran & Li, Yishuo & Zou, Gaofeng & Wu, Ji George, 2023. "Which is more important in stock market forecasting: Attention or sentiment?," International Review of Financial Analysis, Elsevier, vol. 89(C).
    11. Chau, Michael & Lin, Chih-Yung & Lin, Tse-Chun, 2020. "Wisdom of crowds before the 2007–2009 global financial crisis," Journal of Financial Stability, Elsevier, vol. 48(C).
    12. Miwa, Kotaro, 2022. "The informational role of analysts’ textual statements," Research in International Business and Finance, Elsevier, vol. 59(C).
    13. Mosi Rosenboim & Yossi Saadon & Ben Z. Schreiber, 2018. "“Much Ado about Nothing”? The Effect of Print Media Tone on Stock Indices," Bank of Israel Working Papers 2018.10, Bank of Israel.
    14. Sharpe, Steven A. & Sinha, Nitish R. & Hollrah, Christopher A., 2023. "The power of narrative sentiment in economic forecasts," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1097-1121.
    15. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    16. Chen, Cathy Yi-Hsuan & Després, Roméo & Guo, Li & Renault, Thomas, 2019. "What makes cryptocurrencies special? Investor sentiment and return predictability during the bubble," IRTG 1792 Discussion Papers 2019-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Calomiris, Charles W. & Mamaysky, Harry, 2019. "How news and its context drive risk and returns around the world," Journal of Financial Economics, Elsevier, vol. 133(2), pages 299-336.
    18. Elshandidy, Tamer & Kamel, Hany, 2024. "Tone of narrative disclosures and earnings management: UK evidence," Advances in accounting, Elsevier, vol. 64(C).
    19. Brandt, Michael W. & Gao, Lin, 2019. "Macro fundamentals or geopolitical events? A textual analysis of news events for crude oil," Journal of Empirical Finance, Elsevier, vol. 51(C), pages 64-94.
    20. Charles W. Calomiris & Harry Mamaysky, 2018. "How News and Its Context Drive Risk and Returns Around the World," NBER Working Papers 24430, National Bureau of Economic Research, Inc.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:isacfm:v:23:y:2016:i:3:p:215-239. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1099-1174/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.