IDEAS home Printed from https://ideas.repec.org/b/rsw/rswout/7-2de.html
   My bibliography  Save this book

Erhebung und Nutzung unstrukturierter Daten in den Sozial-, Verhaltens- und Wirtschaftswissenschaften

Editor

Listed:
  • Rat für Sozial- und Wirtschaftsdaten RatSWD

Abstract

Die zunehmende Digitalisierung unserer Lebenswelt in den letzten Jahrzehnten hat zu einer Reihe von neuen Datenquellen für die Sozial-, Verhaltens- und Wirtschaftswissenschaften geführt. Hierzu gehören vor allem auch unstrukturierte Daten, die sich dadurch auszeichnen, dass sie nicht in Form eines festen Datenformats vorliegen und daher nicht einfach datenanalytisch weiterverarbeitet werden können (z.B. Facebook-Texte, Instagram-Bilder, YouTube-Videos, Twitter-Nachrichten). Die Nutzung unstrukturierter Daten ist mit spezifischen Herausforderungen verknüpft, die gerade dadurch entstehen, dass die Daten typischerweise nicht in einer kontrollierten wissenschaftlichen Studie erhoben werden, sondern häufig im natürlichen Lebensumfeld anfallen. Aufbauend auf den Ergebnissen eines Expert:innen-Workshops werden die spezifischen Herausforderungen bei der Erhebung und Nutzung unstrukturierter Daten beschrieben und Empfehlungen formuliert. Diese orientieren sich am Total Error Framework und beziehen sich auf die Datengenerierung (Definition von Untersuchungseinheiten, Coverage und Sampling Error, Nonresponse und Missing Data Error), die Datenaufbereitung (Spezifikationsfehler, Validität, Messfehler und inhaltliche Fehler) sowie die Datenanalyse (Record Linkage und Verarbeitungsfehler, Modellierungsfehler, analytische Fehler). Abschließend werden offene Fragen und Herausforderungen bei der Forschung mit unstrukturierten Daten diskutiert.

Suggested Citation

  • Rat für Sozial- und Wirtschaftsdaten RatSWD (ed.), 2023. "Erhebung und Nutzung unstrukturierter Daten in den Sozial-, Verhaltens- und Wirtschaftswissenschaften," RatSWD Output Series, German Data Forum (RatSWD), volume 7, number 7-2de, August.
  • Handle: RePEc:rsw:rswout:7-2de
    DOI: https://doi.org/10.17620/02671.73
    as

    Download full text from publisher

    File URL: https://www.konsortswd.de/wp-content/uploads/RatSWD_Output2.7_Unstrukturierte-Daten_2023.pdf
    Download Restriction: no

    File URL: https://libkey.io/https://doi.org/10.17620/02671.73?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:nas:journl:v:115:y:2018:p:2584-2589 is not listed on IDEAS
    2. Brady T West & Joseph W Sakshaug & Guy Alain S Aurelien, 2016. "How Big of a Problem is Analytic Error in Secondary Analyses of Survey Data?," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-29, June.
    3. Uri Simonsohn & Joseph P. Simmons & Leif D. Nelson, 2020. "Specification curve analysis," Nature Human Behaviour, Nature, vol. 4(11), pages 1208-1214, November.
    4. Uri Simonsohn & Joseph P. Simmons & Leif D. Nelson, 2020. "Publisher Correction: Specification curve analysis," Nature Human Behaviour, Nature, vol. 4(11), pages 1215-1215, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ankel-Peters, Jörg & Vance, Colin & Bensch, Gunther, 2022. "Spotlight on researcher decisions – Infrastructure evaluation, instrumental variables, and first-stage specification screening," OSF Preprints sw6kd_v1, Center for Open Science.
    2. Eibich, Peter & Goldzahl, Léontine, 2021. "Does retirement affect secondary preventive care use? Evidence from breast cancer screening," Economics & Human Biology, Elsevier, vol. 43(C).
    3. Rose, Julian & Neubauer, Florian & Ankel-Peters, Jörg, 2024. "Long-Term Effects of the Targeting the Ultra-Poor Program - A Reproducibility and Replicability Assessment of Banerjee et al. (2021)," I4R Discussion Paper Series 142, The Institute for Replication (I4R).
    4. Rubin, Mark, 2023. "Type I error rates are not usually inflated," MetaArXiv 3kv2b, Center for Open Science.
    5. Dreber, Anna & Johannesson, Magnus, 2023. "A framework for evaluating reproducibility and replicability in economics," Ruhr Economic Papers 1055, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    6. Dorison, Charles A & Lerner, Jennifer S & Heller, Blake H & Rothman, Alexander J & Kawachi, Ichiro I & Wang, Ke & Rees, Vaughan W & Gill, Brian P & Gibbs, Nancy & Ebersole, Charles R & Vally, Zahir & , 2022. "In COVID-19 health messaging, loss framing increases anxiety with little-to-no concomitant benefits : Experimental evidence from 84 countries," Other publications TiSEM 235f67b6-6be5-4061-8693-3, Tilburg University, School of Economics and Management.
    7. Gretton, Jeremy & Roemer, Tobias & Schlüter, Elmar, 2024. "Replication of Hamel & Wilcox-Archuleta (2022): "Black Workers in White Places: Daytime Racial Diversity and White Public Opinion"," I4R Discussion Paper Series 61, The Institute for Replication (I4R), revised 2024.
    8. Mitre-Becerril, David & MacDonald, John M., 2024. "Does urban development influence crime? Evidence from Philadelphia’s new zoning regulations," Journal of Urban Economics, Elsevier, vol. 142(C).
    9. Rubin, Mark, 2024. "Type I Error Rates are Not Usually Inflated," MetaArXiv 3kv2b_v1, Center for Open Science.
    10. Fieberg, Christian & Günther, Steffen & Poddig, Thorsten & Zaremba, Adam, 2024. "Non-standard errors in the cryptocurrency world," International Review of Financial Analysis, Elsevier, vol. 92(C).
    11. Helmers, Viola & van der Werf, Edwin, 2022. "Did the German Aviation Tax Affect Passenger Numbers? New Evidence Employing Difference-in-differences," VfS Annual Conference 2022 (Basel): Big Data in Economics 264118, Verein für Socialpolitik / German Economic Association.
    12. Tran, Nhan, 2024. "Parents' legal status and children's health insurance: Evidence from DACA," MPRA Paper 120173, University Library of Munich, Germany.
    13. Huber, Christoph & Kirchler, Michael, 2023. "Experiments in finance: A survey of historical trends," Journal of Behavioral and Experimental Finance, Elsevier, vol. 37(C).
    14. Slichter, David & Tran, Nhan, 2023. "Do better journals publish better estimates?," MPRA Paper 118433, University Library of Munich, Germany.
    15. Verhagen, Mark D., 2021. "A Pragmatist's Guide to Using Prediction in the Social Sciences," SocArXiv tjkcy_v1, Center for Open Science.
    16. Karsten Hansen & Kanishka Misra & Robert Evan Sanders, 2024. "Uninformed Choices in Perishables," Marketing Science, INFORMS, vol. 43(4), pages 751-777, July.
    17. Bachler, Sebastian & Erhart, Andrea & Holzknecht, Armando, 2023. "Replication Report on Altmann et al. (2022)," I4R Discussion Paper Series 43, The Institute for Replication (I4R).
    18. W. Ben Mccartney & John Orellana‐Li & Calvin Zhang, 2024. "Political Polarization Affects Households' Financial Decisions: Evidence from Home Sales," Journal of Finance, American Finance Association, vol. 79(2), pages 795-841, April.
    19. Nikolova, Milena & Cnossen, Femke & Nikolaev, Boris, 2024. "Robots, meaning, and self-determination," Research Policy, Elsevier, vol. 53(5).
    20. Cantone, Giulio Giacomo & Tomaselli, Venera, 2024. "On the Coherence of Composite Indexes: Multiversal Model and Specification Analysis for an Index of Well-Being," MetaArXiv d5y26, Center for Open Science.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rsw:rswout:7-2de. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RatSWD (email available below). General contact details of provider: https://edirc.repec.org/data/rtswdde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.