Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences
Author
Abstract
Suggested Citation
DOI: 10.1007/s11135-021-01164-0
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Grimmer, Justin, 2010. "A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases," Political Analysis, Cambridge University Press, vol. 18(1), pages 1-35, January.
- Alberto Cavallo, 2018.
"Scraped Data and Sticky Prices,"
The Review of Economics and Statistics, MIT Press, vol. 100(1), pages 105-119, March.
- Alberto Cavallo, 2015. "Scraped Data and Sticky Prices," NBER Working Papers 21490, National Bureau of Economic Research, Inc.
- Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
- Marc Keuschnigg & Niclas Lovsjö & Peter Hedström, 2018. "Analytical sociology and computational social science," Journal of Computational Social Science, Springer, vol. 1(1), pages 3-14, January.
- Ulbricht, Lena, 2020. "Scraping the demos. Digitalization, web scraping and the democratic project," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 27(3), pages 426-442.
- Margaret E. Roberts & Brandon M. Stewart & Dustin Tingley & Christopher Lucas & Jetson Leder‐Luis & Shana Kushner Gadarian & Bethany Albertson & David G. Rand, 2014. "Structural Topic Models for Open‐Ended Survey Responses," American Journal of Political Science, John Wiley & Sons, vol. 58(4), pages 1064-1082, October.
- Boeing, Geoff, 2017. "New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings," SocArXiv v54w4, Center for Open Science.
- Nina Cesare & Hedwig Lee & Tyler McCormick & Emma Spiro & Emilio Zagheni, 2018. "Promises and Pitfalls of Using Digital Traces for Demographic Research," Demography, Springer;Population Association of America (PAA), vol. 55(5), pages 1979-1999, October.
- Noortje Marres & Esther Weltevrede, 2013. "Scraping The Social?," Journal of Cultural Economy, Taylor & Francis Journals, vol. 6(3), pages 313-335, August.
- Lin Qiu & Sarah Hian May Chan & David Chan, 2018. "Big data in social and psychological science: theoretical and methodological issues," Journal of Computational Social Science, Springer, vol. 1(1), pages 59-66, January.
- Dustin S. Stoltz & Marshall A. Taylor, 2019. "Concept Mover’s Distance: measuring concept engagement via word embeddings in texts," Journal of Computational Social Science, Springer, vol. 2(2), pages 293-313, July.
- Feng Shi & Yongren Shi & Fedor A. Dokshin & James A. Evans & Michael W. Macy, 2017. "Millions of online book co-purchases reveal partisan differences in the consumption of science," Nature Human Behaviour, Nature, vol. 1(4), pages 1-9, April.
- Georg von Krogh & Eric von Hippel, 2006. "The Promise of Research on Open Source Software," Management Science, INFORMS, vol. 52(7), pages 975-983, July.
- Laura K. Nelson, 2020. "Computational Grounded Theory: A Methodological Framework," Sociological Methods & Research, , vol. 49(1), pages 3-42, February.
- Gavin Abercrombie & Riza Batista-Navarro, 2020. "Sentiment and position-taking analysis of parliamentary debates: a systematic literature review," Journal of Computational Social Science, Springer, vol. 3(1), pages 245-270, April.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Paola Beccherle & Luciana Lazzeretti & Stefania Oliva, 2024. "Exploring the Interplay of Museum and City Reputation: Insights from the Uffizi Case Study," Working Papers - Business wp2024_02.rdf, Universita' degli Studi di Firenze, Dipartimento di Scienze per l'Economia e l'Impresa.
- Tobias Blanke, 2024. "Reassembling digital archives—strategies for counter-archiving," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-12, December.
- Potter, Andrew & Soroka, Anthony & Naim, Mohamed, 2022. "Regional resilience for rail freight transport," Journal of Transport Geography, Elsevier, vol. 104(C).
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- AJ Alvero & Jasmine Pal & Katelyn M. Moussavian, 2022. "Linguistic, cultural, and narrative capital: computational and human readings of transfer admissions essays," Journal of Computational Social Science, Springer, vol. 5(2), pages 1709-1734, November.
- Damani K. White-Lewis & KerryAnn O’Meara & Kiernan Mathews & Nicholas Havey, 2023. "Leaving the Institution or Leaving the Academy? Analyzing the Factors that Faculty Weigh in Actual Departure Decisions," Research in Higher Education, Springer;Association for Institutional Research, vol. 64(3), pages 473-494, May.
- Mónica D. Oliveira & Inês Mataloto & Panos Kanavos, 2019.
"Multi-criteria decision analysis for health technology assessment: addressing methodological challenges to improve the state of the art,"
The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 20(6), pages 891-918, August.
- Oliveira, Mónica D. & Mataloto, Inês & Kanavos, Panos, 2019. "Multi-criteria decision analysis for health technology assessment: addressing methodological challenges to improve the state of the art," LSE Research Online Documents on Economics 100763, London School of Economics and Political Science, LSE Library.
- Stijn Daenekindt & Julian Schaap, 2022. "Using word embedding models to capture changing media discourses: a study on the role of legitimacy, gender and genre in 24,000 music reviews, 1999–2021," Journal of Computational Social Science, Springer, vol. 5(2), pages 1615-1636, November.
- Christopher Wratil & Sara B Hobolt, 2019. "Public deliberations in the Council of the European Union: Introducing and validating DICEU," European Union Politics, , vol. 20(3), pages 511-531, September.
- Nils Augustin & Andreas Eckhardt & Alexander Willem Jong, 2023. "Understanding decentralized autonomous organizations from the inside," Electronic Markets, Springer;IIM University of St. Gallen, vol. 33(1), pages 1-14, December.
- Michal Ovádek & Nicolas Lampach & Arthur Dyevre, 2020. "What’s the talk in Brussels? Leveraging daily news coverage to measure issue attention in the European Union," European Union Politics, , vol. 21(2), pages 204-232, June.
- Jennifer Pan & Margaret E. Roberts, 2020. "Censorship’s Effect on Incidental Exposure to Information: Evidence From Wikipedia," SAGE Open, , vol. 10(1), pages 21582440198, February.
- Sanders, James & Lisi, Giulio & Schonhardt-Bailey, Cheryl, 2018. "Themes and topics in parliamentary oversight hearings: a new direction in textual data analysis," LSE Research Online Documents on Economics 87624, London School of Economics and Political Science, LSE Library.
- Yuriy Gorodnichenko & Viacheslav Sheremirov & Oleksandr Talavera, 2018.
"Price Setting in Online Markets: Does IT Click?,"
Journal of the European Economic Association, European Economic Association, vol. 16(6), pages 1764-1811.
- Yuriy Gorodnichenko & Viacheslav Sheremirov & Oleksandr Talavera, 2014. "Price Setting in Online Markets: Does IT Click?," NBER Working Papers 20819, National Bureau of Economic Research, Inc.
- Yuriy Gorodnichenko & Viacheslav Sheremirov & Oleksandr Talavera, 2015. "Price setting in online markets: does IT click?," Working Papers 15-1, Federal Reserve Bank of Boston.
- Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023.
"Measuring partisan media bias in US newscasts from 2001 to 2012,"
European Journal of Political Economy, Elsevier, vol. 78(C).
- Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2020. "Measuring partisan media bias in US Newscasts from 2001-2012," Working Paper 183/2020, Helmut Schmidt University, Hamburg, revised 15 Nov 2022.
- Sheedy, Kevin D., 2010.
"Intrinsic inflation persistence,"
Journal of Monetary Economics, Elsevier, vol. 57(8), pages 1049-1061, November.
- Sheedy, Kevin D., 2007. "Intrinsic inflation persistence," LSE Research Online Documents on Economics 3739, London School of Economics and Political Science, LSE Library.
- Kevin D. Sheedy, 2007. "Intrinsic Inflation Persistence," CEP Discussion Papers dp0837, Centre for Economic Performance, LSE.
- Magnus Schückes & Tobias Gutmann, 2021. "Why do startups pursue initial coin offerings (ICOs)? The role of economic drivers and social identity on funding choice," Small Business Economics, Springer, vol. 57(2), pages 1027-1052, August.
- Arthur Schram & Boris Van Leeuwen & Theo Offerman, 2013.
"Superstars Need Social Benefits: An Experiment on Network Formation,"
Working Papers
1306, Departament Empresa, Universitat Autònoma de Barcelona, revised Jul 2013.
- Boris van Leeuwen & Theo Offerman & Arthur Schram, 2013. "Superstars need Social Benefits: An Experiment on Network Formation," Tinbergen Institute Discussion Papers 13-112/I, Tinbergen Institute.
- Sandra Wankmüller, 2023. "A comparison of approaches for imbalanced classification problems in the context of retrieving relevant documents for an analysis," Journal of Computational Social Science, Springer, vol. 6(1), pages 91-163, April.
- Jurić Tado, 2022. "Forecasting Migration and Integration Trends Using Digital Demography – A Case Study of Emigration Flows from Croatia to Austria and Germany," Comparative Southeast European Studies, De Gruyter, vol. 70(1), pages 125-152, March.
- McCannon, Bryan & Zhou, Yang & Hall, Joshua, 2021. "Measuring a Contract’s Breadth: A Text Analysis," Working Papers 11013, George Mason University, Mercatus Center.
- Everett, Jeff & Shiraz Rahaman, Abu & Neu, Dean & Saxton, Gregory, 2024. "Letters to the editor, institutional experimentation, and the public accounting professional," CRITICAL PERSPECTIVES ON ACCOUNTING, Elsevier, vol. 99(C).
- Minchul Lee & Min Song, 2020. "Incorporating citation impact into analysis of research trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(2), pages 1191-1224, August.
- Grajzl, Peter & Murrell, Peter, 2021.
"A machine-learning history of English caselaw and legal ideas prior to the Industrial Revolution I: generating and interpreting the estimates,"
Journal of Institutional Economics, Cambridge University Press, vol. 17(1), pages 1-19, February.
- Peter Grajzl & Peter Murrell, 2020. "A Machine-Learning History of English Caselaw and Legal Ideas Prior to the Industrial Revolution I: Generating and Interpreting the Estimates," CESifo Working Paper Series 8774, CESifo.
More about this item
Keywords
Web scraping; Digital methods; Law; Ethics; Algorithmic thinking; Access to information; Social science research;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:56:y:2022:i:3:d:10.1007_s11135-021-01164-0. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.