IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0209323.html
   My bibliography  Save this article

Statistical inferences for polarity identification in natural language

Author

Listed:
  • Nicolas Pröllochs
  • Stefan Feuerriegel
  • Dirk Neumann

Abstract

Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes LASSO regularization as a statistical tool to extract decisive words from textual content in order to study the reception of granular expressions in natural language. This differs from the usual use of the LASSO as a predictive model and, instead, yields highly interpretable statistical inferences between the occurrences of words and an outcome variable. Accordingly, the method suggests direct implications for the social sciences: it serves as a statistical procedure for generating domain-specific dictionaries as opposed to frequently employed heuristics. In addition, researchers can now identify text segments and word choices that are statistically decisive to authors or readers and, based on this knowledge, test hypotheses from behavioral research.

Suggested Citation

  • Nicolas Pröllochs & Stefan Feuerriegel & Dirk Neumann, 2018. "Statistical inferences for polarity identification in natural language," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-21, December.
  • Handle: RePEc:plo:pone00:0209323
    DOI: 10.1371/journal.pone.0209323
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0209323
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0209323&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0209323?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Christie M. Fuller & David P. Biros & Judee Burgoon & Jay Nunamaker, 2013. "An Examination and Validation of Linguistic Constructs for Studying High-Stakes Deception," Group Decision and Negotiation, Springer, vol. 22(1), pages 117-134, January.
    2. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    3. Carter, ME & Soo, BS, 1999. "The relevance of Form 8-K reports," Journal of Accounting Research, Wiley Blackwell, vol. 37(1), pages 119-132.
    4. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    5. Matt Taddy, 2013. "Multinomial Inverse Regression for Text Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 755-770, September.
    6. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    7. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    8. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    9. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kirill Solovev & Nicolas Prollochs, 2021. "Integrating Floor Plans into Hedonic Models for Rent Price Appraisal," Papers 2102.08162, arXiv.org.
    2. Karsten Müller, 2022. "German forecasters’ narratives: How informative are German business cycle forecast reports?," Empirical Economics, Springer, vol. 62(5), pages 2373-2415, May.
    3. Ming-Fu Hsu & Chingho Chang & Jhih‐Hong Zeng, 2022. "Automated text mining process for corporate risk analysis and management," Risk Management, Palgrave Macmillan, vol. 24(4), pages 386-419, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
    2. Stefan Feuerriegel & Nicolas Prollochs, 2018. "Investor Reaction to Financial Disclosures Across Topics: An Application of Latent Dirichlet Allocation," Papers 1805.03308, arXiv.org.
    3. Karsten Müller, 2022. "German forecasters’ narratives: How informative are German business cycle forecast reports?," Empirical Economics, Springer, vol. 62(5), pages 2373-2415, May.
    4. Leilane de Freitas Rocha Cambara & Roberto Meurer, 2023. "News sentiment and foreign portfolio investment in Brazil," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(3), pages 3332-3348, July.
    5. Bannier, Christina E. & Pauls, Thomas & Walter, Andreas, 2017. "CEO-speeches and stock returns," VfS Annual Conference 2017 (Vienna): Alternative Structures for Money and Banking 168192, Verein für Socialpolitik / German Economic Association.
    6. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    7. Jie Hao & Viet T. Pham, 2022. "COVID‐19 Disclosures and Market Uncertainty: Evidence from 10‐Q Filings," Australian Accounting Review, CPA Australia, vol. 32(2), pages 238-266, June.
    8. An, Suwei, 2023. "Essays on incentive contracts, M&As, and firm risk," Other publications TiSEM dd97d2f5-1c9d-47c5-ba62-f, Tilburg University, School of Economics and Management.
    9. Chong, Terence Tai Leung & Wu, Zhang & Liu, Yuchen, 2019. "Market Reaction to iPhone Rumors," MPRA Paper 92014, University Library of Munich, Germany.
    10. Renato Camodeca & Alex Almici & Umberto Sagliaschi, 2018. "Sustainability Disclosure in Integrated Reporting: Does It Matter to Investors? A Cheap Talk Approach," Sustainability, MDPI, vol. 10(12), pages 1-34, November.
    11. repec:bny:wpaper:0054 is not listed on IDEAS
    12. Koulikidou, Kleopatra & Chantziaras, Antonios & Dedoulis, Emmanouil & Leventis, Stergios, 2023. "Regulatory enforcement, foreignness, and language negativity: Evidence from SEC comment letters," Journal of International Accounting, Auditing and Taxation, Elsevier, vol. 51(C).
    13. Durnev, Art & Mangen, Claudine, 2020. "The spillover effects of MD&A disclosures for real investment: The role of industry competition," Journal of Accounting and Economics, Elsevier, vol. 70(1).
    14. Loughran, Tim & McDonald, Bill & Pragidis, Ioannis, 2019. "Assimilation of oil news into prices," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 105-118.
    15. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    16. Anand, Abhinav & Basu, Sankarshan & Pathak, Jalaj & Thampy, Ashok, 2021. "The impact of sentiment on emerging stock markets," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 161-177.
    17. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    18. Onur Bayar & Emre Kesici, 2024. "The impact of social media on venture capital financing: evidence from Twitter interactions," Review of Quantitative Finance and Accounting, Springer, vol. 62(1), pages 195-224, January.
    19. Barth, Andreas & Radev, Deyan, 2022. "Integration culture of global banks and the transmission of lending shocks," Journal of Banking & Finance, Elsevier, vol. 134(C).
    20. Nadine Gatzert & Dinah Heidinger, 2020. "An Empirical Analysis of Market Reactions to the First Solvency and Financial Condition Reports in the European Insurance Industry," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 87(2), pages 407-436, June.
    21. Chau, Michael & Lin, Chih-Yung & Lin, Tse-Chun, 2020. "Wisdom of crowds before the 2007–2009 global financial crisis," Journal of Financial Stability, Elsevier, vol. 48(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0209323. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.