IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v16y2019i10p1766-d232352.html
   My bibliography  Save this article

Pride, Love, and Twitter Rants: Combining Machine Learning and Qualitative Techniques to Understand What Our Tweets Reveal about Race in the US

Author

Listed:
  • Thu T. Nguyen

    (Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA 94158, USA)

  • Shaniece Criss

    (Department of Health Science, Furman University, Greenville, SC 29613, USA)

  • Amani M. Allen

    (Divisions of Community Health Sciences and Epidemiology, University of California, Berkeley, CA 94704, USA)

  • M. Maria Glymour

    (Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA 94158, USA
    Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA 02215, USA)

  • Lynn Phan

    (Program of Public Health Science, University of Maryland School of Public Health, College Park, MD 20742, USA)

  • Ryan Trevino

    (Department of Health Sciences, College of Science and Health, DePaul University, Chicago, IL 60614, USA)

  • Shrikha Dasari

    (Department of Epidemiology & Biostatistics, University of California San Francisco, San Francisco, CA 94158, USA)

  • Quynh C. Nguyen

    (Department of Epidemiology & Biostatistics, University of Maryland School of Public Health, College Park, MD 20742, USA)

Abstract

Objective : Describe variation in sentiment of tweets using race-related terms and identify themes characterizing the social climate related to race. Methods : We applied a Stochastic Gradient Descent Classifier to conduct sentiment analysis of 1,249,653 US tweets using race-related terms from 2015–2016. To evaluate accuracy, manual labels were compared against computer labels for a random subset of 6600 tweets. We conducted qualitative content analysis on a random sample of 2100 tweets. Results : Agreement between computer labels and manual labels was 74%. Tweets referencing Middle Eastern groups (12.5%) or Blacks (13.8%) had the lowest positive sentiment compared to tweets referencing Asians (17.7%) and Hispanics (17.5%). Qualitative content analysis revealed most tweets were represented by the categories: negative sentiment (45%), positive sentiment such as pride in culture (25%), and navigating relationships (15%). While all tweets use one or more race-related terms, negative sentiment tweets which were not derogatory or whose central topic was not about race were common. Conclusion : This study harnesses relatively untapped social media data to develop a novel area-level measure of social context (sentiment scores) and highlights some of the challenges in doing this work. New approaches to measuring the social environment may enhance research on social context and health.

Suggested Citation

  • Thu T. Nguyen & Shaniece Criss & Amani M. Allen & M. Maria Glymour & Lynn Phan & Ryan Trevino & Shrikha Dasari & Quynh C. Nguyen, 2019. "Pride, Love, and Twitter Rants: Combining Machine Learning and Qualitative Techniques to Understand What Our Tweets Reveal about Race in the US," IJERPH, MDPI, vol. 16(10), pages 1-19, May.
  • Handle: RePEc:gam:jijerp:v:16:y:2019:i:10:p:1766-:d:232352
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/16/10/1766/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/16/10/1766/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Krieger, N. & Sidney, S., 1996. "Racial discrimination and blood pressure: The CARDIA study of young black and white adults," American Journal of Public Health, American Public Health Association, vol. 86(10), pages 1370-1378.
    2. Krieger, Nancy & Smith, Kevin & Naishadham, Deepa & Hartman, Cathy & Barbeau, Elizabeth M., 2005. "Experiences of discrimination: Validity and reliability of a self-report measure for population health research on racism and health," Social Science & Medicine, Elsevier, vol. 61(7), pages 1576-1596, October.
    3. Devah Pager, 2007. "The Use of Field Experiments for Studies of Employment Discrimination: Contributions, Critiques, and Directions for the Future," The ANNALS of the American Academy of Political and Social Science, , vol. 609(1), pages 104-133, January.
    4. Diane Lauderdale, 2006. "Birth outcomes for Arabic-named women in California before and after September 11," Demography, Springer;Population Association of America (PAA), vol. 43(1), pages 185-201, February.
    5. Lee, Y. & Muennig, P. & Kawachi, I. & Hatzenbuehler, M.L., 2015. "Effects of racial prejudice on the health of communities: A multilevel survival analysis," American Journal of Public Health, American Public Health Association, vol. 105(11), pages 2349-2355.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thu T. Nguyen & Shaniece Criss & Pallavi Dwivedi & Dina Huang & Jessica Keralis & Erica Hsu & Lynn Phan & Leah H. Nguyen & Isha Yardi & M. Maria Glymour & Amani M. Allen & David H. Chae & Gilbert C. G, 2020. "Exploring U.S. Shifts in Anti-Asian Sentiment with the Emergence of COVID-19," IJERPH, MDPI, vol. 17(19), pages 1-13, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Johnston, David W. & Lordan, Grace, 2012. "Discrimination makes me sick! An examination of the discrimination–health relationship," Journal of Health Economics, Elsevier, vol. 31(1), pages 99-111.
    2. Nancy Krieger & Pamela D Waterman & Anna Kosheleva & Jarvis T Chen & Dana R Carney & Kevin W Smith & Gary G Bennett & David R Williams & Elmer Freeman & Beverley Russell & Gisele Thornhill & Kristin M, 2011. "Exposing Racial Discrimination: Implicit & Explicit Measures–The My Body, My Story Study of 1005 US-Born Black & White Community Health Center Members," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-24, November.
    3. Gee, Gilbert C. & Spencer, Michael & Chen, Juan & Yip, Tiffany & Takeuchi, David T., 2007. "The association between self-reported racial discrimination and 12-month DSM-IV mental disorders among Asian Americans nationwide," Social Science & Medicine, Elsevier, vol. 64(10), pages 1984-1996, May.
    4. Gee, Gilbert & Walsemann, Katrina, 2009. "Does health predict the reporting of racial discrimination or do reports of discrimination predict health? Findings from the National Longitudinal Study of Youth," Social Science & Medicine, Elsevier, vol. 68(9), pages 1676-1684, May.
    5. Seung-Sup Kim & Yeonseung Chung & S V Subramanian & David R Williams, 2012. "Measuring Discrimination in South Korea: Underestimating the Prevalence of Discriminatory Experiences among Female and Less Educated Workers?," PLOS ONE, Public Library of Science, vol. 7(3), pages 1-8, March.
    6. Gilbert, Paul A. & Zemore, Sarah E., 2016. "Discrimination and drinking: A systematic review of the evidence," Social Science & Medicine, Elsevier, vol. 161(C), pages 178-194.
    7. Pam Phojanakong & Emily Brown Weida & Gabriella Grimaldi & Félice Lê-Scherban & Mariana Chilton, 2019. "Experiences of Racial and Ethnic Discrimination Are Associated with Food Insecurity and Poor Health," IJERPH, MDPI, vol. 16(22), pages 1-13, November.
    8. Cunningham, Timothy J. & Seeman, Teresa E. & Kawachi, Ichiro & Gortmaker, Steven L. & Jacobs, David R. & Kiefe, Catarina I. & Berkman, Lisa F., 2012. "Racial/ethnic and gender differences in the association between self-reported experiences of racial/ethnic discrimination and inflammation in the CARDIA cohort of 4 US communities," Social Science & Medicine, Elsevier, vol. 75(5), pages 922-931.
    9. Lukachko, Alicia & Hatzenbuehler, Mark L. & Keyes, Katherine M., 2014. "Structural racism and myocardial infarction in the United States," Social Science & Medicine, Elsevier, vol. 103(C), pages 42-50.
    10. Colen, Cynthia G. & Ramey, David M. & Cooksey, Elizabeth C. & Williams, David R., 2018. "Racial disparities in health among nonpoor African Americans and Hispanics: The role of acute and chronic discrimination," Social Science & Medicine, Elsevier, vol. 199(C), pages 167-180.
    11. Kelaher, M. & Paul, Sheila & Lambert, Helen & Ahmad, Waqar & Paradies, Yin & Davey Smith, George, 2008. "Discrimination and health in an English study," Social Science & Medicine, Elsevier, vol. 66(7), pages 1627-1636, April.
    12. Chae, David H. & Clouston, Sean & Martz, Connor D. & Hatzenbuehler, Mark L. & Cooper, Hannah L.F. & Turpin, Rodman & Stephens-Davidowitz, Seth & Kramer, Michael R., 2018. "Area racism and birth outcomes among Blacks in the United States," Social Science & Medicine, Elsevier, vol. 199(C), pages 49-55.
    13. Hatzenbuehler, Mark L. & Rutherford, Caroline & McKetta, Sarah & Prins, Seth J. & Keyes, Katherine M., 2020. "Structural stigma and all-cause mortality among sexual minorities: Differences by sexual behavior?," Social Science & Medicine, Elsevier, vol. 244(C).
    14. Harris, Ricci & Tobias, Martin & Jeffreys, Mona & Waldegrave, Kiri & Karlsen, Saffron & Nazroo, James, 2006. "Racism and health: The relationship between experience of racial discrimination and health in New Zealand," Social Science & Medicine, Elsevier, vol. 63(6), pages 1428-1441, September.
    15. Krieger, Nancy & Chen, Jarvis T. & Waterman, Pamela D. & Hartman, Cathy & Stoddard, Anne M. & Quinn, Margaret M. & Sorensen, Glorian & Barbeau, Elizabeth M., 2008. "The inverse hazard law: Blood pressure, sexual harassment, racial discrimination, workplace abuse and occupational exposures in US low-income black, white and Latino workers," Social Science & Medicine, Elsevier, vol. 67(12), pages 1970-1981, December.
    16. Diane Coffey & Ashwini Deshpande & Jeffrey Hammer & Dean Spears, 2019. "Local Social Inequality, Economic Inequality, and Disparities in Child Height in India," Demography, Springer;Population Association of America (PAA), vol. 56(4), pages 1427-1452, August.
    17. Fabian T C Schmidt & Clemens M Lechner & Daniel Danner, 2020. "New wine in an old bottle? A facet-level perspective on the added value of Grit over BFI–2 Conscientiousness," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-25, February.
    18. Alfonso Urzúa & Alejandra Caqueo-Urízar & Diego Henríquez & David R. Williams, 2021. "Discrimination and Health: The Mediating Effect of Acculturative Stress," IJERPH, MDPI, vol. 18(10), pages 1-11, May.
    19. Gaeul Kim & Jinmok Kim & Su-Kyoung Lee & Juho Sim & Yangwook Kim & Byung-Yoon Yun & Jin-Ha Yoon, 2020. "Multidimensional gender discrimination in workplace and depressive symptoms," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-13, July.
    20. Petra Persson & Maya Rossin-Slater, 2018. "Family Ruptures, Stress, and the Mental Health of the Next Generation," American Economic Review, American Economic Association, vol. 108(4-5), pages 1214-1252, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:16:y:2019:i:10:p:1766-:d:232352. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.