IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v17y2020i3p864-d314348.html
   My bibliography  Save this article

The Story of Goldilocks and Three Twitter’s APIs: A Pilot Study on Twitter Data Sources and Disclosure

Author

Listed:
  • Yoonsang Kim

    (Social Data Collaboratory, Public Health, NORC at the University of Chicago, Chicago, IL 60603, USA)

  • Rachel Nordgren

    (Biostatistics, School of Public Health, University of Illinois at Chicago, Chicago, IL 60612, USA)

  • Sherry Emery

    (Social Data Collaboratory, Public Health, NORC at the University of Chicago, Chicago, IL 60603, USA)

Abstract

Public health and social science increasingly use Twitter for behavioral and marketing surveillance. However, few studies provide sufficient detail about Twitter data collection to allow either direct comparisons between studies or to support replication. The three primary application programming interfaces (API) of Twitter data sources are Streaming, Search, and Firehose. To date, no clear guidance exists about the advantages and limitations of each API, or about the comparability of the amount, content, and user accounts of retrieved tweets from each API. Such information is crucial to the validity, interpretation, and replicability of research findings. This study examines whether tweets collected using the same search filters over the same time period, but calling different APIs, would retrieve comparable datasets. We collected tweets about anti-smoking, e-cigarettes, and tobacco using the aforementioned APIs. The retrieved tweets largely overlapped between three APIs, but each also retrieved unique tweets, and the extent of overlap varied over time and by topic, resulting in different trends and potentially supporting diverging inferences. Researchers need to understand how different data sources can influence both the amount, content, and user accounts of data they retrieve from social media, in order to assess the implications of their choice of data source.

Suggested Citation

  • Yoonsang Kim & Rachel Nordgren & Sherry Emery, 2020. "The Story of Goldilocks and Three Twitter’s APIs: A Pilot Study on Twitter Data Sources and Disclosure," IJERPH, MDPI, vol. 17(3), pages 1-15, January.
  • Handle: RePEc:gam:jijerp:v:17:y:2020:i:3:p:864-:d:314348
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/17/3/864/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/17/3/864/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Robert M. Bond & Christopher J. Fariss & Jason J. Jones & Adam D. I. Kramer & Cameron Marlow & Jaime E. Settle & James H. Fowler, 2012. "A 61-million-person experiment in social influence and political mobilization," Nature, Nature, vol. 489(7415), pages 295-298, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sanaz Ghorbanloo & Sajjad Shokouhyar, 2023. "Consumers' attitude footprint on sustainable development in developed and developing countries: a case study in the electronic industry," Operations Management Research, Springer, vol. 16(3), pages 1444-1475, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Johnson, Nathan & Turnbull, Benjamin & Reisslein, Martin, 2022. "Social media influence, trust, and conflict: An interview based study of leadership perceptions," Technology in Society, Elsevier, vol. 68(C).
    2. Alan Gerber & Mitchell Hoffman & John Morgan & Collin Raymond, 2020. "One in a Million: Field Experiments on Perceived Closeness of the Election and Voter Turnout," American Economic Journal: Applied Economics, American Economic Association, vol. 12(3), pages 287-325, July.
    3. Kenju Kamei & Louis Putterman & Jean-Robert Tyran, 2019. "Civic Engagement as a Second-Order Public Good: The Cooperative Underpinnings of the Accountable State," Discussion Papers 19-10, University of Copenhagen. Department of Economics.
    4. Ruyi Ge & Juan Feng & Bin Gu, 2016. "Borrower’s default and self-disclosure of social media information in P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 2(1), pages 1-6, December.
    5. Jiang, Lincheng & Zhao, Xiang & Ge, Bin & Xiao, Weidong & Ruan, Yirun, 2019. "An efficient algorithm for mining a set of influential spreaders in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 516(C), pages 58-65.
    6. Kristina Gavin Bigsby & Jeffrey W. Ohlmann & Kang Zhao, 2017. "Online and Off the Field: Predicting School Choice in College Football Recruiting from Social Media Data," Decision Analysis, INFORMS, vol. 14(4), pages 261-273, December.
    7. Yann Algan & Quoc-Anh Do & Nicolò Dalvit & Alexis Le Chapelain & Yves Zenou, 2015. "How Social Networks Shape Our Beliefs: A Natural Experiment among Future French Politicians," Working Papers hal-03459820, HAL.
    8. Mechtenberg, Lydia & Tyran, Jean-Robert, 2019. "Voter motivation and the quality of democratic choice," Games and Economic Behavior, Elsevier, vol. 116(C), pages 241-259.
    9. Daniele Barchiesi & Helen Susannah Moat & Christian Alis & Steven Bishop & Tobias Preis, 2015. "Quantifying International Travel Flows Using Flickr," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-8, July.
    10. repec:spo:wpmain:info:hdl:2441/78vacv4udu92eq3fec89svm9uv is not listed on IDEAS
    11. Carolin V. Zorell, 2020. "Nudges, Norms, or Just Contagion? A Theory on Influences on the Practice of (Non-)Sustainable Behavior," Sustainability, MDPI, vol. 12(24), pages 1-21, December.
    12. Julian Freitag & Anna Kerkhof & Johannes Münster, 2021. "Selective sharing of news items and the political position of news outlets," ECONtribute Discussion Papers Series 056, University of Bonn and University of Cologne, Germany.
    13. Paul Resnick & Eytan Adar & Cliff Lampe, 2015. "What Social Media Data We Are Missing and How to Get It," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 192-206, May.
    14. Donati, Dante, 2023. "Mobile Internet access and political outcomes: Evidence from South Africa," Journal of Development Economics, Elsevier, vol. 162(C).
    15. Yuho Chung & Yiwei Li & Jianmin Jia, 2021. "Exploring embeddedness, centrality, and social influence on backer behavior: the role of backer networks in crowdfunding," Journal of the Academy of Marketing Science, Springer, vol. 49(5), pages 925-946, September.
    16. David Ehrlich & Nora Szech, 2022. "How To Start a Grassroots Movement," Papers 2209.01345, arXiv.org.
    17. Liberini, Federica & Redoano, Michela & Russo, Antonio & Cuevas, Angel & Cuevas, Ruben, 2018. "Politics in the Facebook Era Evidence from the 2016 US Presidential Elections," CAGE Online Working Paper Series 389, Competitive Advantage in the Global Economy (CAGE).
    18. Mohammad Abouei Mehrizi & Federico Corò & Emilio Cruciani & Gianlorenzo D’Angelo, 2022. "Election control through social influence with voters’ uncertainty," Journal of Combinatorial Optimization, Springer, vol. 44(1), pages 635-669, August.
    19. Sun, Jiachen & Feng, Ling & Du, Mingwei & Ma, Xiao & Fan, Zhengping & Gloor, Peter & Hu, Yanqing, 2021. "Ultra-efficient information detection on large-scale online social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 581(C).
    20. repec:hal:spmain:info:hdl:2441/78vacv4udu92eq3fec89svm9uv is not listed on IDEAS
    21. Liu, Jian-Guo & Li, Ren-De & Guo, Qiang & Zhang, Yi-Cheng, 2018. "Collective iteration behavior for online social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 499(C), pages 490-497.
    22. Sabrina Stöckli & Doris Hofer, 2020. "Susceptibility to social influence predicts behavior on Facebook," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-20, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:17:y:2020:i:3:p:864-:d:314348. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.