IDEAS home Printed from https://ideas.repec.org/a/nas/journl/v119y2022pe2104906119.html
   My bibliography  Save this article

Balancing data privacy and usability in the federal statistical system

Author

Listed:
  • V. Joseph Hotz

    (a Department of Economics, Duke University, Durham, NC 27708;)

  • Christopher R. Bollinger

    (b Department of Economics, University of Kentucky, Lexington, KY 40503;)

  • Tatiana Komarova

    (c The London School of Economics and Political Science, London WC2A 3PH, United Kingdom;)

  • Charles F. Manski

    (d Department of Economics, Northwestern University, Evanston, IL 60208;)

  • Robert A. Moffitt

    (e Department of Economics, Johns Hopkins University, Baltimore, MD 21211;)

  • Denis Nekipelov

    (f Department of Economics, University of Virginia, Charlottesville, VA 22904;)

  • Aaron Sojourner

    (g W. E. Upjohn Institute for Employment Policy, Kalamazoo, MI 49007;)

  • Bruce D. Spencer

    (h Department of Statistics and Data Science, Northwestern University, Evanston, IL 60208)

Abstract

The federal statistical system is experiencing competing pressures for change. On the one hand, for confidentiality reasons, much socially valuable data currently held by federal agencies is either not made available to researchers at all or only made available under onerous conditions. On the other hand, agencies which release public databases face new challenges in protecting the privacy of the subjects in those databases, which leads them to consider releasing fewer data or masking the data in ways that will reduce their accuracy. In this essay, we argue that the discussion has not given proper consideration to the reduced social benefits of data availability and their usability relative to the value of increased levels of privacy protection. A more balanced benefit–cost framework should be used to assess these trade-offs. We express concerns both with synthetic data methods for disclosure limitation, which will reduce the types of research that can be reliably conducted in unknown ways, and with differential privacy criteria that use what we argue is an inappropriate measure of disclosure risk. We recommend that the measure of disclosure risk used to assess all disclosure protection methods focus on what we believe is the risk that individuals should care about, that more study of the impact of differential privacy criteria and synthetic data methods on data usability for research be conducted before either is put into widespread use, and that more research be conducted on alternative methods of disclosure risk reduction that better balance benefits and costs.

Suggested Citation

  • V. Joseph Hotz & Christopher R. Bollinger & Tatiana Komarova & Charles F. Manski & Robert A. Moffitt & Denis Nekipelov & Aaron Sojourner & Bruce D. Spencer, 2022. "Balancing data privacy and usability in the federal statistical system," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 119(31), pages 2104906119-, August.
  • Handle: RePEc:nas:journl:v:119:y:2022:p:e2104906119
    as

    Download full text from publisher

    File URL: http://www.pnas.org/content/119/31/e2104906119.full
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nas:journl:v:119:y:2022:p:e2104906119. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Eric Cain (email available below). General contact details of provider: http://www.pnas.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.