IDEAS home Printed from https://ideas.repec.org/a/eee/ecosta/v31y2024icp81-99.html
   My bibliography  Save this article

Differentially Private Goodness-of-Fit Tests for Continuous Variables

Author

Listed:
  • Kwak, Seung Woo
  • Ahn, Jeongyoun
  • Lee, Jaewoo
  • Park, Cheolwoo

Abstract

Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.

Suggested Citation

  • Kwak, Seung Woo & Ahn, Jeongyoun & Lee, Jaewoo & Park, Cheolwoo, 2024. "Differentially Private Goodness-of-Fit Tests for Continuous Variables," Econometrics and Statistics, Elsevier, vol. 31(C), pages 81-99.
  • Handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99
    DOI: 10.1016/j.ecosta.2021.09.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S2452306221001143
    Download Restriction: Full text for ScienceDirect subscribers only. Contains open access articles

    File URL: https://libkey.io/10.1016/j.ecosta.2021.09.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wasserman, Larry & Zhou, Shuheng, 2010. "A Statistical Framework for Differential Privacy," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 375-389.
    2. Campano, Fred & Salvatore, Dominick, 2006. "Income Distribution," OUP Catalogue, Oxford University Press, number 9780195300918.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Walker, Douglas O., 2007. "Patterns of income distribution among world regions," Journal of Policy Modeling, Elsevier, vol. 29(4), pages 643-655.
    2. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    3. Roberto Dell’Anno & Jorge Martinez-Vazquez, 2013. "A Behavioral Local Public Finance Perspective on the Renter’s Illusion Hypothesis," International Center for Public Policy Working Paper Series, at AYSPS, GSU paper1303, International Center for Public Policy, Andrew Young School of Policy Studies, Georgia State University.
    4. Claire McKay Bowen & Fang Liu & Bingyue Su, 2021. "Differentially private data release via statistical election to partition sequentially," METRON, Springer;Sapienza Università di Roma, vol. 79(1), pages 1-31, April.
    5. Ron S. Jarmin & John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Nathan Goldschlag & Michael B. Hawes & Sallie Ann Keller & Daniel Kifer & Philip Leclerc & Jerome P. Reiter & Rolando A. Rodrígue, 2023. "An in-depth examination of requirements for disclosure risk assessment," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 120(43), pages 2220558120-, October.
    6. Salvatore, Dominick & Campano, Fred, 2022. "Regional differences in inequality and income distribution in the United States," Journal of Policy Modeling, Elsevier, vol. 44(4), pages 780-789.
    7. Cárdenas-Retamal, Roberto & Dresdner-Cid, Jorge & Ceballos-Concha, Adams, 2021. "Impact assessment of salmon farming on income distribution in remote coastal areas: The Chilean case," Food Policy, Elsevier, vol. 101(C).
    8. Raj Chetty & John N. Friedman, 2019. "A Practical Method to Reduce Privacy Loss When Disclosing Statistics Based on Small Samples," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 414-420, May.
    9. John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Simson Garfinkel & Micah Heineck & Christine Heiss & Robert Johns & Daniel Kifer & Philip Leclerc & Ashwin Machanavajjhala & Brett Moran & William, 2022. "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Papers 2204.08986, arXiv.org.
    10. James W. Dean & G. Robert Ross, 2006. "Paradoxes and Puzzles in Our Globalized World Public Support of Trade Policy, International Outsourcing Trade Liberalization, Globalization," Carleton Economic Papers 06-07, Carleton University, Department of Economics.
    11. Chiang, Yen-Sheng, 2015. "Inequality measures perform differently in global and local assessments: An exploratory computational experiment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 437(C), pages 1-11.
    12. Katherine B. Coffman & Lucas C. Coffman & Keith M. Marzilli Ericson, 2017. "The Size of the LGBT Population and the Magnitude of Antigay Sentiment Are Substantially Underestimated," Management Science, INFORMS, vol. 63(10), pages 3168-3186, October.
    13. Ori Heffetz & Katrina Ligett, 2014. "Privacy and Data-Based Research," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 75-98, Spring.
    14. Dawid, H. & Harting, P. & Neugart, M., 2018. "Cohesion policy and inequality dynamics: Insights from a heterogeneous agents macroeconomic model," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 220-255.
    15. Mariana Marchionni & Walter Sosa-Escudero & Javier Alejo, 2008. "Efectos Distributivos de Esquemas Alternativos de Tarifas Sociales: Una Exploración Cuantitativa," CEDLAS, Working Papers 0069, CEDLAS, Universidad Nacional de La Plata.
    16. Liu, Liwen & Zhang, Ming, 2018. "High-speed rail impacts on travel times, accessibility, and economic productivity: A benchmarking analysis in city-cluster regions of China," Journal of Transport Geography, Elsevier, vol. 73(C), pages 25-40.
    17. Toth Daniell, 2014. "Data Smearing: An Approach to Disclosure Limitation for Tabular Data," Journal of Official Statistics, Sciendo, vol. 30(4), pages 839-857, December.
    18. Jinshuo Dong & Aaron Roth & Weijie J. Su, 2022. "Gaussian differential privacy," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 3-37, February.
    19. Soumya Mukherjee & Aratrika Mustafi & Aleksandra Slavkovi'c & Lars Vilhuber, 2023. "Assessing Utility of Differential Privacy for RCTs," Papers 2309.14581, arXiv.org.
    20. Baumol, William J., 2007. "On income distribution and growth," Journal of Policy Modeling, Elsevier, vol. 29(4), pages 545-548.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/econometrics-and-statistics .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.