IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v600y2021i7890d10.1038_s41586-021-04198-4.html
   My bibliography  Save this article

Unrepresentative big surveys significantly overestimated US vaccine uptake

Author

Listed:
  • Valerie C. Bradley

    (University of Oxford)

  • Shiro Kuriwaki

    (Stanford University)

  • Michael Isakov

    (Harvard University)

  • Dino Sejdinovic

    (University of Oxford)

  • Xiao-Li Meng

    (Harvard University)

  • Seth Flaxman

    (University of Oxford)

Abstract

Surveys are a crucial tool for understanding public opinion and behaviour, and their accuracy depends on maintaining statistical representativeness of their target populations by minimizing biases from all sources. Increasing data size shrinks confidence intervals but magnifies the effect of survey bias: an instance of the Big Data Paradox1. Here we demonstrate this paradox in estimates of first-dose COVID-19 vaccine uptake in US adults from 9 January to 19 May 2021 from two large surveys: Delphi–Facebook2,3 (about 250,000 responses per week) and Census Household Pulse4 (about 75,000 every two weeks). In May 2021, Delphi–Facebook overestimated uptake by 17 percentage points (14–20 percentage points with 5% benchmark imprecision) and Census Household Pulse by 14 (11–17 percentage points with 5% benchmark imprecision), compared to a retroactively updated benchmark the Centers for Disease Control and Prevention published on 26 May 2021. Moreover, their large sample sizes led to miniscule margins of error on the incorrect estimates. By contrast, an Axios–Ipsos online panel5 with about 1,000 responses per week following survey research best practices6 provided reliable estimates and uncertainty quantification. We decompose observed error using a recent analytic framework1 to explain the inaccuracy in the three surveys. We then analyse the implications for vaccine hesitancy and willingness. We show how a survey of 250,000 respondents can produce an estimate of the population mean that is no more accurate than an estimate from a simple random sample of size 10. Our central message is that data quality matters more than data quantity, and that compensating the former with the latter is a mathematically provable losing proposition.

Suggested Citation

  • Valerie C. Bradley & Shiro Kuriwaki & Michael Isakov & Dino Sejdinovic & Xiao-Li Meng & Seth Flaxman, 2021. "Unrepresentative big surveys significantly overestimated US vaccine uptake," Nature, Nature, vol. 600(7890), pages 695-700, December.
  • Handle: RePEc:nat:nature:v:600:y:2021:i:7890:d:10.1038_s41586-021-04198-4
    DOI: 10.1038/s41586-021-04198-4
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-021-04198-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-021-04198-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gazmararian, Alexander F., 2024. "Fossil fuel communities support climate policy coupled with just transition assistance," Energy Policy, Elsevier, vol. 184(C).
    2. Bussemakers, Carlijn & van Dijk, Mart & Dima, Alexandra L. & de Bruin, Marijn, 2023. "How well do surveys on adherence to pandemic policies assess actual behaviour: Measurement properties of the Dutch COVID-19 adherence to prevention advice survey (CAPAS)," Social Science & Medicine, Elsevier, vol. 339(C).
    3. He, Xin & Mao, Xiaojun & Wang, Zhonglei, 2024. "Nonparametric augmented probability weighting with sparsity," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    4. Tesary Lin & Avner Strulov-Shlain, 2023. "Choice Architecture, Privacy Valuations, and Selection Bias in Consumer Data," Papers 2308.13496, arXiv.org.
    5. Camilla Salvatore, 2023. "Inference with non-probability samples and survey data integration: a science mapping study," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 83-107, April.
    6. Stoler, Justin & Klofstad, Casey A. & Enders, Adam M. & Uscinski, Joseph E., 2022. "Sociopolitical and psychological correlates of COVID-19 vaccine hesitancy in the United States during summer 2021," Social Science & Medicine, Elsevier, vol. 306(C).
    7. Cameron Deal & Shea Greenberg & Gilbert Gonzales, 2024. "Sexual identity, poverty, and utilization of government services," Journal of Population Economics, Springer;European Society for Population Economics, vol. 37(2), pages 1-31, June.
    8. Jonathan Eggleston & Carl Lieberman, 2024. "Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data," Working Papers 24-60, Center for Economic Studies, U.S. Census Bureau.
    9. Avinash Collis & Kiran Garimella & Alex Moehring & M. Amin Rahimian & Stella Babalola & Nina H. Gobat & Dominick Shattuck & Jeni Stolow & Sinan Aral & Dean Eckles, 2022. "Global survey on COVID-19 beliefs, behaviours and norms," Nature Human Behaviour, Nature, vol. 6(9), pages 1310-1317, September.
    10. Nelson, Victoria & Bashyal, Bidhan & Tan, Pang-Ning & Argyris, Young Anna, 2024. "Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance," Social Science & Medicine, Elsevier, vol. 348(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:600:y:2021:i:7890:d:10.1038_s41586-021-04198-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.