IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v17y2020i4p1125-d318933.html
   My bibliography  Save this article

Cultural Differences in Tweeting about Drinking Across the US

Author

Listed:
  • Salvatore Giorgi

    (Computer and Information Science Department, University of Pennsylvania, Philadelphia, PA 19104, USA
    National Institutes of Health, National Institute on Drug Abuse, Bethesda, MD 20892, USA)

  • David B. Yaden

    (Department of Psychology, University of Pennsylvania, Philadelphia, PA 19104, USA)

  • Johannes C. Eichstaedt

    (Department of Psychology & Institute for Human-Centered Artificial Intelligence, Stanford University, Stanford, CA 94305, USA)

  • Robert D. Ashford

    (Substance Use Disorders Institute, University of the Sciences, Philadelphia, PA 19104, USA)

  • Anneke E.K. Buffone

    (Department of Psychology, University of Pennsylvania, Philadelphia, PA 19104, USA)

  • H. Andrew Schwartz

    (Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA)

  • Lyle H. Ungar

    (Computer and Information Science Department, University of Pennsylvania, Philadelphia, PA 19104, USA)

  • Brenda Curtis

    (National Institutes of Health, National Institute on Drug Abuse, Bethesda, MD 20892, USA)

Abstract

Excessive alcohol use in the US contributes to over 88,000 deaths per year and costs over $250 billion annually. While previous studies have shown that excessive alcohol use can be detected from general patterns of social media engagement, we characterized how drinking-specific language varies across regions and cultures in the US. From a database of 38 billion public tweets, we selected those mentioning “drunk”, found the words and phrases distinctive of drinking posts, and then clustered these into topics and sets of semantically related words. We identified geolocated “drunk” tweets and correlated their language with the prevalence of self-reported excessive alcohol consumption (Behavioral Risk Factor Surveillance System; BRFSS). We then identified linguistic markers associated with excessive drinking in different regions and cultural communities as identified by the American Community Project. “Drunk” tweet frequency (of the 3.3 million geolocated “drunk” tweets) correlated with excessive alcohol consumption at both the county and state levels ( r = 0.26 and 0.45, respectively, p < 0.01). Topic analyses revealed that excessive alcohol consumption was most correlated with references to drinking with friends ( r = 0.20), family ( r = 0.15), and driving under the influence ( r = 0.14). Using the American Community Project classification, we found a number of cultural markers of drinking: religious communities had a high frequency of anti-drunk driving tweets, Hispanic centers discussed family members drinking, and college towns discussed sexual behavior. This study shows that Twitter can be used to explore the specific sociocultural contexts in which excessive alcohol use occurs within particular regions and communities. These findings can inform more targeted public health messaging and help to better understand cultural determinants of substance abuse.

Suggested Citation

  • Salvatore Giorgi & David B. Yaden & Johannes C. Eichstaedt & Robert D. Ashford & Anneke E.K. Buffone & H. Andrew Schwartz & Lyle H. Ungar & Brenda Curtis, 2020. "Cultural Differences in Tweeting about Drinking Across the US," IJERPH, MDPI, vol. 17(4), pages 1-14, February.
  • Handle: RePEc:gam:jijerp:v:17:y:2020:i:4:p:1125-:d:318933
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/17/4/1125/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/17/4/1125/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Brenda Curtis & Salvatore Giorgi & Anneke E K Buffone & Lyle H Ungar & Robert D Ashford & Jessie Hemmons & Dan Summers & Casey Hamilton & H Andrew Schwartz, 2018. "Can Twitter be used to predict county excessive alcohol consumption rates?," PLOS ONE, Public Library of Science, vol. 13(4), pages 1-16, April.
    2. H Andrew Schwartz & Johannes C Eichstaedt & Margaret L Kern & Lukasz Dziurzynski & Stephanie M Ramones & Megha Agrawal & Achal Shah & Michal Kosinski & David Stillwell & Martin E P Seligman & Lyle H U, 2013. "Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-16, September.
    3. Monroe, Burt L. & Colaresi, Michael P. & Quinn, Kevin M., 2008. "Fightin' Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict," Political Analysis, Cambridge University Press, vol. 16(4), pages 372-403.
    4. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lee, Ji Young & Lee, Hyun Ji & Masters, Allyson S. & Fletcher, Katelyn K. & Suh, Daniel D. & Golinkoff, Roberta M. & Hirsh-Pasek, Kathy, 2023. "Bringing playful learning to South Korea: An alternative pedagogical approach to promote children's learning and success," International Journal of Educational Development, Elsevier, vol. 97(C).
    2. Michael Stellefson & Samantha R. Paige & Beth H. Chaney & J. Don Chaney, 2020. "Social Media and Health Promotion," IJERPH, MDPI, vol. 17(9), pages 1-5, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. H. Andrew Schwartz & Lyle H. Ungar, 2015. "Data-Driven Content Analysis of Social Media," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 78-94, May.
    2. Brenda Curtis & Salvatore Giorgi & Anneke E K Buffone & Lyle H Ungar & Robert D Ashford & Jessie Hemmons & Dan Summers & Casey Hamilton & H Andrew Schwartz, 2018. "Can Twitter be used to predict county excessive alcohol consumption rates?," PLOS ONE, Public Library of Science, vol. 13(4), pages 1-16, April.
    3. Xiaoli Wang & Shuangsheng Wu & C Raina MacIntyre & Hongbin Zhang & Weixian Shi & Xiaomin Peng & Wei Duan & Peng Yang & Yi Zhang & Quanyi Wang, 2015. "Using an Adjusted Serfling Regression Model to Improve the Early Warning at the Arrival of Peak Timing of Influenza in Beijing," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-14, March.
    4. Markowitz, Sara & Nesson, Erik & Robinson, Joshua J., 2019. "The effects of employment on influenza rates," Economics & Human Biology, Elsevier, vol. 34(C), pages 286-295.
    5. Bentzen, Jeanet Sinding, 2021. "In crisis, we pray: Religiosity and the COVID-19 pandemic," Journal of Economic Behavior & Organization, Elsevier, vol. 192(C), pages 541-583.
    6. Jesse T. Richman & Ryan J. Roberts, 2023. "Assessing Spurious Correlations in Big Search Data," Forecasting, MDPI, vol. 5(1), pages 1-12, February.
    7. Grechyna, Daryna, 2025. "Raising awareness of climate change: Nature, activists, politicians?," Ecological Economics, Elsevier, vol. 227(C).
    8. Yangkun Huang & Xiaoping Xu & Sini Su, 2021. "Diverging from News Media: An Exploratory Study on the Changing Dynamics between Media and Public Attention on Cancer in China from 2011–2020," IJERPH, MDPI, vol. 18(16), pages 1-13, August.
    9. Vosen, Simeon & Schmidt, Torsten, 2012. "A monthly consumption indicator for Germany based on Internet search query data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 19(7), pages 683-687.
    10. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    11. Sean Coogan & Zhixian Sui & David Raubenheimer, 2018. "Gluttony and guilt: monthly trends in internet search query data are comparable with national-level energy intake and dieting behavior," Palgrave Communications, Palgrave Macmillan, vol. 4(1), pages 1-9, December.
    12. Tobias Preis & Federico Botta & Helen Susannah Moat, 2020. "Sensing global tourism numbers with millions of publicly shared online photographs," Environment and Planning A, , vol. 52(3), pages 471-477, May.
    13. D'Amuri, Francesco & Marcucci, Juri, 2009. "‘Google it!’ Forecasting the US unemployment rate with a Google job search index," ISER Working Paper Series 2009-32, Institute for Social and Economic Research.
    14. Liwen Ling & Dabin Zhang & Shanying Chen & Amin W. Mugera, 2020. "Can online search data improve the forecast accuracy of pork price in China?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(4), pages 671-686, July.
    15. Klaus Ackermann & Simon D Angus & Paul A Raschky, 2020. "Estimating Sleep and Work Hours from Alternative Data by Segmented Functional Classification Analysis, SFCA," SoDa Laboratories Working Paper Series 2020-04, Monash University, SoDa Laboratories.
    16. Daniele Barchiesi & Helen Susannah Moat & Christian Alis & Steven Bishop & Tobias Preis, 2015. "Quantifying International Travel Flows Using Flickr," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-8, July.
    17. Breithaupt, Patrick & Kesler, Reinhold & Niebel, Thomas & Rammer, Christian, 2020. "Intangible capital indicators based on web scraping of social media," ZEW Discussion Papers 20-046, ZEW - Leibniz Centre for European Economic Research.
    18. Jensen, Carsten & Naumann, Elias, 2016. "Increasing pressures and support for public healthcare in Europe," Health Policy, Elsevier, vol. 120(6), pages 698-705.
    19. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    20. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:17:y:2020:i:4:p:1125-:d:318933. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.