IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/jvpbw_v1.html
   My bibliography  Save this paper

Ethnography and Machine Learning: Synergies and New Directions

Author

Listed:
  • Abramson, Corey
  • Li, Zhuofan

Abstract

Ethnography (social scientific methods that illuminate how people understand, navigate and shape the real world contexts in which they live their lives) and machine learning (computational techniques that use big data and statistical learning models to perform quantifiable tasks) are each core to contemporary social science. Yet these tools have remained largely separate in practice. This chapter draws on a growing body of scholarship that argues that ethnography and machine learning can be usefully combined, particularly for large comparative studies. Specifically, this paper (a) explains the value (and challenges) of using machine learning alongside qualitative field research for certain types of projects, (b) discusses recent methodological trends to this effect, (c) provides examples that illustrate workflow drawn from several large projects, and (d) concludes with a roadmap for enabling productive coevolution of field methods and machine learning. Keywords ethnography, computational social science, qualitative methods, machine learning, natural language processing, large language models, computational ethnography, digital ethnography, big data, research methods, mixed-methods

Suggested Citation

  • Abramson, Corey & Li, Zhuofan, 2024. "Ethnography and Machine Learning: Synergies and New Directions," OSF Preprints jvpbw_v1, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:jvpbw_v1
    DOI: 10.31219/osf.io/jvpbw_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/675616d36b40ab0a3f661619/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/jvpbw_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Richard Van Noorden, 2022. "How language-generation AIs could transform science," Nature, Nature, vol. 605(7908), pages 21-21, May.
    2. Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
    3. Nikolitsa Grigoropoulou & Mario L. Small, 2022. "The data revolution in social science needs qualitative research," Nature Human Behaviour, Nature, vol. 6(7), pages 904-906, July.
    4. Freese, Jeremy & Peterson, David, 2017. "Replication in Social Science," SocArXiv 5bck9, Center for Open Science.
    5. Bart Bonikowski & Laura K. Nelson, 2022. "From Ends to Means: The Promise of Computational Text Analysis for Theoretically Driven Sociological Research," Sociological Methods & Research, , vol. 51(4), pages 1469-1483, November.
    6. Laura K. Nelson, 2020. "Computational Grounded Theory: A Methodological Framework," Sociological Methods & Research, , vol. 49(1), pages 3-42, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alex Luscombe & Kevin Dick & Kevin Walby, 2022. "Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(3), pages 1023-1044, June.
    2. AJ Alvero & Jasmine Pal & Katelyn M. Moussavian, 2022. "Linguistic, cultural, and narrative capital: computational and human readings of transfer admissions essays," Journal of Computational Social Science, Springer, vol. 5(2), pages 1709-1734, November.
    3. Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023. "Measuring partisan media bias in US newscasts from 2001 to 2012," European Journal of Political Economy, Elsevier, vol. 78(C).
    4. Ho-Chun Herbert Chang & Brooke Harrington & Feng Fu & Daniel Rockmore, 2023. "Complex Systems of Secrecy: The Offshore Networks of Oligarchs," Papers 2303.03371, arXiv.org.
    5. Franz Neuberger & Martin Bujard & Tobias Rüttenauer, 2022. "Where does public childcare boost female labor force participation? Exploring geographical heterogeneity across Germany 2007–2017," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(24), pages 693-722.
    6. Fanelli, Daniele, 2020. "Metascientific reproducibility patterns revealed by informatic measure of knowledge," MetaArXiv 5vnhj, Center for Open Science.
    7. Stijn Daenekindt & Julian Schaap, 2022. "Using word embedding models to capture changing media discourses: a study on the role of legitimacy, gender and genre in 24,000 music reviews, 1999–2021," Journal of Computational Social Science, Springer, vol. 5(2), pages 1615-1636, November.
    8. Özgür Özvatan & Bastian Neuhauser & Gökçe Yurdakul, 2023. "The ‘Arab Clans’ Discourse: Narrating Racialization, Kinship, and Crime in the German Media," Social Sciences, MDPI, vol. 12(2), pages 1-18, February.
    9. Martin Kreidl & Zuzana Žilinčíková, 2023. "Adult children’s union type and contact with mothers: A replication," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 48(23), pages 641-680.
    10. Fanelli, Daniele, 2022. "The "Tau" of Science - How to Measure, Study, and Integrate Quantitative and Qualitative Knowledge," MetaArXiv 67sak_v1, Center for Open Science.
    11. Julian Ashwin & Aditya Chhabra & Vijayendra Rao, 2023. "Using Large Language Models for Qualitative Analysis can Introduce Serious Bias," Papers 2309.17147, arXiv.org, revised Oct 2023.
    12. Scholdra, Thomas P. & Wichmann, Julian R.K. & Reinartz, Werner J., 2023. "Reimagining personalization in the physical store," Journal of Retailing, Elsevier, vol. 99(4), pages 563-579.
    13. repec:hal:journl:hal-04907529 is not listed on IDEAS
    14. Fišar, Miloš & Greiner, Ben & Huber, Christoph & Katok, Elena & Ozkes, Ali & Management Science Reproducibility Collaboration, 2023. "Reproducibility in Management Science," Department for Strategy and Innovation Working Paper Series 03/2023, WU Vienna University of Economics and Business.
    15. van Loon, Austin, 2022. "Three Families of Automated Text Analysis," SocArXiv htnej, Center for Open Science.
    16. Thompson, Phillip S. & Klotz, Anthony C., 2022. "Led by curiosity and responding with voice: The influence of leader displays of curiosity and leader gender on follower reactions of psychological safety and voice," Organizational Behavior and Human Decision Processes, Elsevier, vol. 172(C).
    17. Daniel T. L. Shek & Diya Dou & Xiaoqin Zhu & Xiang Li & Lindan Tan, 2022. "Materialism, Egocentrism and Delinquent Behavior in Chinese Adolescents in Mainland China: A Short-Term Longitudinal Study," IJERPH, MDPI, vol. 19(8), pages 1-15, April.
    18. Bart Bonikowski & Yuchen Luo & Oscar Stuhler, 2022. "Politics as Usual? Measuring Populism, Nationalism, and Authoritarianism in U.S. Presidential Campaigns (1952–2020) with Neural Language Models," Sociological Methods & Research, , vol. 51(4), pages 1721-1787, November.
    19. Michel Herzig, 2020. "Mediating Factors of Family Structure and Early Home-leaving: A Replication and Extension of van den Berg, Kalmijn, and Leopold (2018)," European Journal of Population, Springer;European Association for Population Studies, vol. 36(4), pages 643-674, September.
    20. Chin, Jason & Zeiler, Kathryn, 2021. "Replicability in Empirical Legal Research," LawArXiv 2b5k4, Center for Open Science.
    21. Stephan Puehringer, 2023. "Wie viel Wettbewerb wollen wir (uns leisten)? Zur Verwettbewerblichung der Universitaeten in Oesterreich und darueber hinaus," ICAE Working Papers 149, Johannes Kepler University, Institute for Comprehensive Analysis of the Economy.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:jvpbw_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.