IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/jvpbw.html
   My bibliography  Save this paper

Ethnography and Machine Learning: Synergies and New Directions

Author

Listed:
  • Abramson, Corey
  • Li, Zhuofan

Abstract

Ethnography (social scientific methods that illuminate how people understand, navigate and shape the real world contexts in which they live their lives) and machine learning (computational techniques that use big data and statistical learning models to perform quantifiable tasks) are each core to contemporary social science. Yet these tools have remained largely separate in practice. This chapter draws on a growing body of scholarship that argues that ethnography and machine learning can be usefully combined, particularly for large comparative studies. Specifically, this paper (a) explains the value (and challenges) of using machine learning alongside qualitative field research for certain types of projects, (b) discusses recent methodological trends to this effect, (c) provides examples that illustrate workflow drawn from several large projects, and (d) concludes with a roadmap for enabling productive coevolution of field methods and machine learning. Keywords ethnography, computational social science, qualitative methods, machine learning, natural language processing, large language models, computational ethnography, digital ethnography, big data, research methods, mixed-methods

Suggested Citation

  • Abramson, Corey & Li, Zhuofan, 2024. "Ethnography and Machine Learning: Synergies and New Directions," OSF Preprints jvpbw, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:jvpbw
    DOI: 10.31219/osf.io/jvpbw
    as

    Download full text from publisher

    File URL: https://osf.io/download/675616d36b40ab0a3f661619/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/jvpbw?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Freese, Jeremy & Peterson, David, 2017. "Replication in Social Science," SocArXiv 5bck9, Center for Open Science.
    2. Richard Van Noorden, 2022. "How language-generation AIs could transform science," Nature, Nature, vol. 605(7908), pages 21-21, May.
    3. Bart Bonikowski & Laura K. Nelson, 2022. "From Ends to Means: The Promise of Computational Text Analysis for Theoretically Driven Sociological Research," Sociological Methods & Research, , vol. 51(4), pages 1469-1483, November.
    4. Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
    5. Laura K. Nelson, 2020. "Computational Grounded Theory: A Methodological Framework," Sociological Methods & Research, , vol. 49(1), pages 3-42, February.
    6. Nikolitsa Grigoropoulou & Mario L. Small, 2022. "The data revolution in social science needs qualitative research," Nature Human Behaviour, Nature, vol. 6(7), pages 904-906, July.
    7. Freese, Jeremy & Peterson, David, 2017. "Replication in Social Science," SocArXiv 5bck9_v1, Center for Open Science.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abramson, Corey & Li, Zhuofan, 2024. "Ethnography and Machine Learning: Synergies and New Directions," OSF Preprints jvpbw_v1, Center for Open Science.
    2. Franz Neuberger & Martin Bujard & Tobias Rüttenauer, 2022. "Where does public childcare boost female labor force participation? Exploring geographical heterogeneity across Germany 2007–2017," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(24), pages 693-722.
    3. Alex Luscombe & Kevin Dick & Kevin Walby, 2022. "Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(3), pages 1023-1044, June.
    4. Fanelli, Daniele, 2020. "Metascientific reproducibility patterns revealed by informatic measure of knowledge," MetaArXiv 5vnhj, Center for Open Science.
    5. Martin Kreidl & Zuzana Žilinčíková, 2023. "Adult children’s union type and contact with mothers: A replication," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 48(23), pages 641-680.
    6. Fišar, Miloš & Greiner, Ben & Huber, Christoph & Katok, Elena & Ozkes, Ali & Management Science Reproducibility Collaboration, 2023. "Reproducibility in Management Science," Department for Strategy and Innovation Working Paper Series 03/2023, WU Vienna University of Economics and Business.
    7. Thompson, Phillip S. & Klotz, Anthony C., 2022. "Led by curiosity and responding with voice: The influence of leader displays of curiosity and leader gender on follower reactions of psychological safety and voice," Organizational Behavior and Human Decision Processes, Elsevier, vol. 172(C).
    8. Daniel T. L. Shek & Diya Dou & Xiaoqin Zhu & Xiang Li & Lindan Tan, 2022. "Materialism, Egocentrism and Delinquent Behavior in Chinese Adolescents in Mainland China: A Short-Term Longitudinal Study," IJERPH, MDPI, vol. 19(8), pages 1-15, April.
    9. Michel Herzig, 2020. "Mediating Factors of Family Structure and Early Home-leaving: A Replication and Extension of van den Berg, Kalmijn, and Leopold (2018)," European Journal of Population, Springer;European Association for Population Studies, vol. 36(4), pages 643-674, September.
    10. Stephan Puehringer, 2023. "Wie viel Wettbewerb wollen wir (uns leisten)? Zur Verwettbewerblichung der Universitaeten in Oesterreich und darueber hinaus," ICAE Working Papers 149, Johannes Kepler University, Institute for Comprehensive Analysis of the Economy.
    11. Jack I. Richter & Pankaj C. Patel, 2022. "Impact of the COVID-19 pandemic on the hours lost by self-employed racial minorities: evidence from Brazil," Small Business Economics, Springer, vol. 58(2), pages 769-805, February.
    12. Ankel-Peters, Jörg & Fiala, Nathan & Neubauer, Florian, 2023. "Do economists replicate?," Journal of Economic Behavior & Organization, Elsevier, vol. 212(C), pages 219-232.
    13. Dreber, Anna & Johannesson, Magnus, 2023. "A framework for evaluating reproducibility and replicability in economics," I4R Discussion Paper Series 38, The Institute for Replication (I4R).
    14. Mats Alvesson & Jörgen Sandberg, 2020. "The Problematizing Review: A Counterpoint to Elsbach and Van Knippenberg’s Argument for Integrative Reviews," Journal of Management Studies, Wiley Blackwell, vol. 57(6), pages 1290-1304, September.
    15. Luis Alfonso Dau & Grazia D. Santangelo & Arjen Witteloostuijn, 2022. "Replication studies in international business," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 53(2), pages 215-230, March.
    16. Anna Dreber & Magnus Johannesson, 2025. "A framework for evaluating reproducibility and replicability in economics," Economic Inquiry, Western Economic Association International, vol. 63(2), pages 338-356, April.
    17. Yuyan Jiang & Xueli Liu, 2023. "A construction and empirical research of the journal disruption index based on open citation data," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(7), pages 3935-3958, July.
    18. Andrew G.H. Thompson & Oliver Escobar & Jennifer J. Roberts & Stephen Elstub & Niccole M. Pamphilis, 2021. "The Importance of Context and the Effect of Information and Deliberation on Opinion Change Regarding Environmental Issues in Citizens’ Juries," Sustainability, MDPI, vol. 13(17), pages 1-21, September.
    19. Willy Bolander & Nawar N. Chaker & Alec Pappas & Daniel R. Bradbury, 2021. "Operationalizing salesperson performance with secondary data: aligning practice, scholarship, and theory," Journal of the Academy of Marketing Science, Springer, vol. 49(3), pages 462-481, May.
    20. Antonia Krefeld-Schwalb & Benjamin Scheibehenne, 2023. "Tighter nets for smaller fishes? Mapping the development of statistical practices in consumer research between 2008 and 2020," Marketing Letters, Springer, vol. 34(3), pages 351-365, September.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:jvpbw. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.