IDEAS home Printed from https://ideas.repec.org/a/nas/journl/v120y2023pe2220558120.html
   My bibliography  Save this article

An in-depth examination of requirements for disclosure risk assessment

Author

Listed:
  • Ron S. Jarmin

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • John M. Abowd

    (b Department of Economics , Cornell University , Ithaca , NY 14853)

  • Robert Ashmead

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Ryan Cumings-Menon

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Nathan Goldschlag

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Michael B. Hawes

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Sallie Ann Keller

    (c Biocomplexity Institute , University of Virginia , Charlottesville , VA 22904)

  • Daniel Kifer

    (d Department of Computer Science and Engineering , Penn State University , University Park , PA 16802)

  • Philip Leclerc

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Jerome P. Reiter

    (e Department of Statistical Science , Duke University , Durham , NC 27708)

  • Rolando A. Rodríguez

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Ian Schmutte

    (f Department of Economics , University of Georgia , Athens , GA 30602)

  • Victoria A. Velkoff

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

  • Pavel Zhuravlev

    (a U.S. Census Bureau, Office of the Deputy Director , Washington , DC 20233)

Abstract

The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. We argue that any proposal for quantifying disclosure risk should be based on prespecified, objective criteria. We illustrate this approach to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. More research is needed, but in the near term, the counterfactual approach appears best-suited for privacy versus utility analysis.

Suggested Citation

  • Ron S. Jarmin & John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Nathan Goldschlag & Michael B. Hawes & Sallie Ann Keller & Daniel Kifer & Philip Leclerc & Jerome P. Reiter & Rolando A. Rodrígue, 2023. "An in-depth examination of requirements for disclosure risk assessment," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 120(43), pages 2220558120-, October.
  • Handle: RePEc:nas:journl:v:120:y:2023:p:e2220558120
    DOI: 10.1073/pnas.2220558120
    as

    Download full text from publisher

    File URL: https://doi.org/10.1073/pnas.2220558120
    Download Restriction: no

    File URL: https://libkey.io/10.1073/pnas.2220558120?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Raj Chetty & Matthew O. Jackson & Theresa Kuchler & Johannes Stroebel & Nathaniel Hendren & Robert B. Fluegge & Sara Gong & Federico Gonzalez & Armelle Grondin & Matthew Jacob & Drew Johnston & Martin, 2022. "Social capital I: measurement and associations with economic mobility," Nature, Nature, vol. 608(7921), pages 108-121, August.
    2. Luc Rocher & Julien M. Hendrickx & Yves-Alexandre de Montjoye, 2019. "Estimating the success of re-identifications in incomplete datasets using generative models," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    3. Raj Chetty & John N. Friedman, 2019. "A Practical Method to Reduce Privacy Loss When Disclosing Statistics Based on Small Samples," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 414-420, May.
    4. Muralidhar Krishnamurty & Domingo-Ferrer Josep, 2023. "Database Reconstruction Is Not So Easy and Is Different from Reidentification," Journal of Official Statistics, Sciendo, vol. 39(3), pages 381-398, September.
    5. Satkartar K. Kinney & Jerome P. Reiter & Arnold P. Reznek & Javier Miranda & Ron S. Jarmin & John M. Abowd, 2011. "Towards Unrestricted Public Use Business Microdata: The Synthetic Longitudinal Business Database," International Statistical Review, International Statistical Institute, vol. 79(3), pages 362-384, December.
    6. Reiter, Jerome P., 2005. "Estimating Risks of Identification Disclosure in Microdata," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1103-1112, December.
    7. John M. Abowd & Ian M. Schmutte, 2019. "An Economic Analysis of Privacy Protection and Statistical Accuracy as Social Choices," American Economic Review, American Economic Association, vol. 109(1), pages 171-202, January.
    8. Drechsler, Jörg & Reiter, Jerome P., 2010. "Sampling With Synthesis: A New Approach for Releasing Public Use Census Microdata," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1347-1357.
    9. Steven Ruggles & David Riper, 2022. "The Role of Chance in the Census Bureau Database Reconstruction Experiment," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 781-788, June.
    10. Wasserman, Larry & Zhou, Shuheng, 2010. "A Statistical Framework for Differential Privacy," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 375-389.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    2. Braathen, Christian & Thorsen, Inge & Ubøe, Jan, 2022. "Adjusting for Cell Suppression in Commuting Trip Data," Discussion Papers 2022/13, Norwegian School of Economics, Department of Business and Management Science.
    3. Raj Chetty & John N. Friedman, 2019. "A Practical Method to Reduce Privacy Loss When Disclosing Statistics Based on Small Samples," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 414-420, May.
    4. John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Simson Garfinkel & Micah Heineck & Christine Heiss & Robert Johns & Daniel Kifer & Philip Leclerc & Ashwin Machanavajjhala & Brett Moran & William, 2022. "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Papers 2204.08986, arXiv.org.
    5. Rehse, Dominik & Tremöhlen, Felix, 2020. "Fostering participation in digital public health interventions: The case of digital contact tracing," ZEW Discussion Papers 20-076, ZEW - Leibniz Centre for European Economic Research.
    6. Michler, Jeffrey D. & Josephson, Anna & Kilic, Talip & Murray, Siobhan, 2022. "Privacy protection, measurement error, and the integration of remote sensing and socioeconomic survey data," Journal of Development Economics, Elsevier, vol. 158(C).
    7. Ori Heffetz & Katrina Ligett, 2014. "Privacy and Data-Based Research," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 75-98, Spring.
    8. Vilhuber, Lars, 2023. "Reproducibility and transparency versus privacy and confidentiality: Reflections from a data editor," Journal of Econometrics, Elsevier, vol. 235(2), pages 2285-2294.
    9. Craig Wesley Carpenter & Anders Van Sandt & Scott Loveridge, 2022. "Measurement error in US regional economic data," Journal of Regional Science, Wiley Blackwell, vol. 62(1), pages 57-80, January.
    10. Wieringa, Jaap & Kannan, P.K. & Ma, Xiao & Reutterer, Thomas & Risselada, Hans & Skiera, Bernd, 2021. "Data analytics in a privacy-concerned world," Journal of Business Research, Elsevier, vol. 122(C), pages 915-925.
    11. Jörg Drechsler, 2015. "Multiple Imputation of Multilevel Missing Data—Rigor Versus Simplicity," Journal of Educational and Behavioral Statistics, , vol. 40(1), pages 69-95, February.
    12. Javier Miranda & Lars Vilhuber, 2016. "Using Partially Synthetic Microdata to Protect Sensitive Cells in Business Statistics," Working Papers 16-10, Center for Economic Studies, U.S. Census Bureau.
    13. Ian M. Schmutte & Nathan Yoder, 2022. "Information Design for Differential Privacy," Papers 2202.05452, arXiv.org, revised Jul 2024.
    14. Heng Xu & Nan Zhang, 2022. "Implications of Data Anonymization on the Statistical Evidence of Disparity," Management Science, INFORMS, vol. 68(4), pages 2600-2618, April.
    15. Hang J. Kim & Jerome P. Reiter & Alan F. Karr, 2018. "Simultaneous edit-imputation and disclosure limitation for business establishment data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(1), pages 63-82, January.
    16. Satkartar K. Kinney & Jerome P. Reiter & Javier Miranda, 2014. "Improving The Synthetic Longitudinal Business Database," Working Papers 14-12, Center for Economic Studies, U.S. Census Bureau.
    17. Mittag, Nikolas, 2016. "Correcting for Misreporting of Government Benefits," IZA Discussion Papers 10266, Institute of Labor Economics (IZA).
    18. Horrigan, John B. & Whitacre, Brian E. & Galperin, Hernan, 2024. "Understanding uptake in demand-side broadband subsidy programs: The affordable connectivity program case," Telecommunications Policy, Elsevier, vol. 48(8).
    19. John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
    20. Robert Braid, 2024. "Alternative forms of remuneration at the Holy Spirit Hospital of Marseille in the Fourteenth century," Post-Print hal-04573252, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nas:journl:v:120:y:2023:p:e2220558120. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Eric Cain (email available below). General contact details of provider: http://www.pnas.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.