IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0201703.html
   My bibliography  Save this article

Latent human traits in the language of social media: An open-vocabulary approach

Author

Listed:
  • Vivek Kulkarni
  • Margaret L Kern
  • David Stillwell
  • Michal Kosinski
  • Sandra Matz
  • Lyle Ungar
  • Steven Skiena
  • H Andrew Schwartz

Abstract

Over the past century, personality theory and research has successfully identified core sets of characteristics that consistently describe and explain fundamental differences in the way people think, feel and behave. Such characteristics were derived through theory, dictionary analyses, and survey research using explicit self-reports. The availability of social media data spanning millions of users now makes it possible to automatically derive characteristics from behavioral data—language use—at large scale. Taking advantage of linguistic information available through Facebook, we study the process of inferring a new set of potential human traits based on unprompted language use. We subject these new traits to a comprehensive set of evaluations and compare them with a popular five factor model of personality. We find that our language-based trait construct is often more generalizable in that it often predicts non-questionnaire-based outcomes better than questionnaire-based traits (e.g. entities someone likes, income and intelligence quotient), while the factors remain nearly as stable as traditional factors. Our approach suggests a value in new constructs of personality derived from everyday human language use.

Suggested Citation

  • Vivek Kulkarni & Margaret L Kern & David Stillwell & Michal Kosinski & Sandra Matz & Lyle Ungar & Steven Skiena & H Andrew Schwartz, 2018. "Latent human traits in the language of social media: An open-vocabulary approach," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-18, November.
  • Handle: RePEc:plo:pone00:0201703
    DOI: 10.1371/journal.pone.0201703
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0201703
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0201703&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0201703?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. H Andrew Schwartz & Johannes C Eichstaedt & Margaret L Kern & Lukasz Dziurzynski & Stephanie M Ramones & Megha Agrawal & Achal Shah & Michal Kosinski & David Stillwell & Martin E P Seligman & Lyle H U, 2013. "Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-16, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jörn H. Block & Walter Diegel & Christian Fisch, 2024. "How venture capital funding changes an entrepreneur’s digital identity: more self-confidence and professionalism but less authenticity!," Review of Managerial Science, Springer, vol. 18(8), pages 2287-2319, August.
    2. Karel Hrazdil & Jiri Novak & Rafael Rogo & Christine Wiedman & Ray Zhang, 2020. "Measuring executive personality using machine‐learning algorithms: A new approach and audit fee‐based validation tests," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 47(3-4), pages 519-544, March.
    3. Luo, Shuli & He, Sylvia Y., 2021. "Understanding gender difference in perceptions toward transit services across space and time: A social media mining approach," Transport Policy, Elsevier, vol. 111(C), pages 63-73.
    4. Gow, Ian D. & Kaplan, Steven N. & Larcker, David F. & Zakolyukina, Anastasia A., 2016. "CEO Personality and Firm Policies," Research Papers 3444, Stanford University, Graduate School of Business.
    5. Mikkel Wallentin, 2018. "Sex differences in post-stroke aphasia rates are caused by age. A meta-analysis and database query," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-18, December.
    6. Hannes Rosenbusch & Maya Aghaei & Anthony M. Evans & Marcel Zeelenberg, 2021. "Psychological trait inferences from women’s clothing: human and machine prediction," Journal of Computational Social Science, Springer, vol. 4(2), pages 479-501, November.
    7. Rachel Winter & Anna Lavis, 2021. "The Impact of COVID-19 on Young People’s Mental Health in the UK: Key Insights from Social Media Using Online Ethnography," IJERPH, MDPI, vol. 19(1), pages 1-13, December.
    8. Lushi Chen & Tao Gong & Michal Kosinski & David Stillwell & Robert L Davidson, 2017. "Building a profile of subjective well-being for social media users," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-15, November.
    9. Chunhua Ju & Qiuyang Gu & Yi Fang & Fuguang Bao, 2020. "Research on User Influence Model Integrating Personality Traits under Strong Connection," Sustainability, MDPI, vol. 12(6), pages 1-15, March.
    10. Manuel Kaiser & Andreas Kuckertz, 2025. "Emotions and entrepreneurial finance: Analysis of venture capitalists’ and business angels’ digital footprints on Twitter," International Entrepreneurship and Management Journal, Springer, vol. 21(1), pages 1-29, December.
    11. Karel Hrazdil & Fereshteh Mahmoudian & Jamal A. Nazari, 2021. "Executive personality and sustainability: Do extraverted chief executive officers improve corporate social responsibility?," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 28(6), pages 1564-1578, November.
    12. Daniel Hoppe & Helen Keller & Felix Horstmann, 2022. "Got Employer Image? How Applicants Choose Their Employer," Corporate Reputation Review, Palgrave Macmillan, vol. 25(2), pages 139-159, May.
    13. Niklas Ziemann, 2022. "You will receive your money next week! Experimental evidence on the role of Future-Time Reference for intertemporal decision-making," CEPA Discussion Papers 56, Center for Economic Policy Analysis.
    14. Sandra C Matz & Jochen I Menges & David J Stillwell & H Andrew Schwartz, 2019. "Predicting individual-level income from Facebook profiles," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-13, March.
    15. Yunhwan Kim & Sunmi Lee, 2021. "Personality of Public Health Organizations’ Instagram Accounts and According Differences in Photos at Content and Pixel Levels," IJERPH, MDPI, vol. 18(8), pages 1-15, April.
    16. Aimei Yang & Alvin Zhou & Jieun Shin & Ke Huang-Isherwood & Wenlin Liu & Chuqing Dong & Eugene Lee & Jingyi Sun, 2024. "Sharing is caring? How moral foundation frames drive the sharing of corrective messages and misinformation about COVID-19 vaccines," Journal of Computational Social Science, Springer, vol. 7(3), pages 2701-2733, December.
    17. Dariusz Zdonek & Karol Król, 2021. "The Impact of Sex and Personality Traits on Social Media Use during the COVID-19 Pandemic in Poland," Sustainability, MDPI, vol. 13(9), pages 1-27, April.
    18. Yue Han & Theodoros Lappas & Gaurav Sabnis, 2020. "The Importance of Interactions Between Content Characteristics and Creator Characteristics for Studying Virality in Social Media," Information Systems Research, INFORMS, vol. 31(2), pages 576-588, June.
    19. H. Andrew Schwartz & Lyle H. Ungar, 2015. "Data-Driven Content Analysis of Social Media," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 78-94, May.
    20. Olga Bogolyubova & Polina Panicheva & Yanina Ledovaya & Roman Tikhonov & Bulat Yaminov, 2020. "The Language of Positive Mental Health: Findings From a Sample of Russian Facebook Users," SAGE Open, , vol. 10(2), pages 21582440209, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0201703. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.