IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/122099.html
   My bibliography  Save this paper

Edge differentially private estimation in the β-model via jittering and method of moments

Author

Listed:
  • Chang, Jinyuan
  • Hu, Qiao
  • Kolaczyk, Eric D.
  • Yao, Qiwei
  • Yi, Fengting

Abstract

A standing challenge in data privacy is the trade-off between the level of privacy and the efficiency of statistical inference. Here, we conduct an in-depth study of this trade-off for parameter estimation in the β-model (Ann. Appl. Probab. 21 (2011) 1400–1435) for edge differentially private network data released via jittering (J. R. Stat. Soc. Ser. C. Appl. Stat. 66 (2017) 481–500). Unlike most previous approaches based on maximum likelihood estimation for this network model, we proceed via the method of moments. This choice facilitates our exploration of a substantially broader range of privacy levels—corresponding to stricter privacy—than has been to date. Over this new range, we discover our proposed estimator for the parameters exhibits an interesting phase transition, with both its convergence rate and asymptotic variance following one of three different regimes of behavior depending on the level of privacy. Because identification of the operable regime is difficult, if not impossible in practice, we devise a novel adaptive bootstrap procedure to construct uniform inference across different phases. In fact, leveraging this bootstrap we are able to provide for simultaneous inference of all parameters in the β-model (i.e., equal to the number of nodes), which, to our best knowledge, is the first result of its kind. Numerical experiments confirm the competitive and reliable finite sample performance of the proposed inference methods, next to a comparable maximum likelihood method, as well as significant advantages in terms of computational speed and memory.

Suggested Citation

  • Chang, Jinyuan & Hu, Qiao & Kolaczyk, Eric D. & Yao, Qiwei & Yi, Fengting, 2024. "Edge differentially private estimation in the β-model via jittering and method of moments," LSE Research Online Documents on Economics 122099, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:122099
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/122099/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Vishesh Karwa & Pavel N. Krivitsky & Aleksandra B. Slavković, 2017. "Sharing social network data: differentially private estimation of exponential family random-graph models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 481-500, April.
    2. Chang, Jinyuan & Qiu, Yumou & Yao, Qiwei & Zou, Tao, 2018. "Confidence regions for entries of a large precision matrix," Journal of Econometrics, Elsevier, vol. 206(1), pages 57-82.
    3. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    4. Hennig, Christian, 2007. "Cluster-wise assessment of cluster stability," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 258-271, September.
    5. Jinyuan Chang & Eric D Kolaczyk & Qiwei Yao, 2020. "Discussion of ‘Network cross-validation by edge sampling’," Biometrika, Biometrika Trust, vol. 107(2), pages 277-280.
    6. Jinyuan Chang & Chao Zheng & Wen‐Xin Zhou & Wen Zhou, 2017. "Simulation‐based hypothesis testing of high dimensional means under covariance heterogeneity," Biometrics, The International Biometric Society, vol. 73(4), pages 1300-1310, December.
    7. Chang, Jinyuan & Qiu, Yumou & Yao, Qiwei & Zou, Tao, 2018. "Confidence regions for entries of a large precision matrix," LSE Research Online Documents on Economics 87513, London School of Economics and Political Science, LSE Library.
    8. Wasserman, Larry & Zhou, Shuheng, 2010. "A Statistical Framework for Differential Privacy," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 375-389.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chang, Jinyuan & Jiang, Qing & Shao, Xiaofeng, 2023. "Testing the martingale difference hypothesis in high dimension," Journal of Econometrics, Elsevier, vol. 235(2), pages 972-1000.
    2. Chang, Jinyuan & Qiu, Yumou & Yao, Qiwei & Zou, Tao, 2018. "Confidence regions for entries of a large precision matrix," LSE Research Online Documents on Economics 87513, London School of Economics and Political Science, LSE Library.
    3. Yumou Qiu & Jing Tao & Xiao‐Hua Zhou, 2021. "Inference of heterogeneous treatment effects using observational data with high‐dimensional covariates," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 1016-1043, November.
    4. Hafner, Christian M. & Wang, Linqi, 2023. "A dynamic conditional score model for the log correlation matrix," Journal of Econometrics, Elsevier, vol. 237(2).
    5. Baek, Changryong & Gates, Katheleen M. & Leinwand, Benjamin & Pipiras, Vladas, 2021. "Two sample tests for high-dimensional autocovariances," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    6. Mehmet Caner & Qingliang Fan & Yingying Li, 2024. "Navigating Complexity: Constrained Portfolio Analysis in High Dimensions with Tracking Error and Weight Constraints," Papers 2402.17523, arXiv.org.
    7. Caner, Mehmet & Medeiros, Marcelo & Vasconcelos, Gabriel F.R., 2023. "Sharpe Ratio analysis in high dimensions: Residual-based nodewise regression in factor models," Journal of Econometrics, Elsevier, vol. 235(2), pages 393-417.
    8. Lixiu Wu & Jiang Hu, 2024. "Multi-sample hypothesis testing of high-dimensional mean vectors under covariance heterogeneity," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 76(4), pages 579-615, August.
    9. Hafner, Christian M. & Linton, Oliver B. & Tang, Haihan, 2020. "Estimation of a multiplicative correlation structure in the large dimensional case," Journal of Econometrics, Elsevier, vol. 217(2), pages 431-470.
    10. Zhengbang Li & Fuxiang Liu & Luanjie Zeng & Guoxin Zuo, 2021. "A stationary bootstrap test about two mean vectors comparison with somewhat dense differences and fewer sample size than dimension," Computational Statistics, Springer, vol. 36(2), pages 941-960, June.
    11. Ekaterina Seregina, 2020. "A Basket Half Full: Sparse Portfolios," Papers 2011.04278, arXiv.org, revised Apr 2021.
    12. Chang, Jinyuan & Qiu, Yumou & Yao, Qiwei & Zou, Tao, 2018. "Confidence regions for entries of a large precision matrix," Journal of Econometrics, Elsevier, vol. 206(1), pages 57-82.
    13. He, Yong & Zhang, Mingjuan & Zhang, Xinsheng & Zhou, Wang, 2020. "High-dimensional two-sample mean vectors test and support recovery with factor adjustment," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    14. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Yuta Koike, 2022. "High-dimensional Data Bootstrap," Papers 2205.09691, arXiv.org.
    15. Dmitry Semenov & Alexander Koldanov & Petr Koldanov, 2024. "Analysis of weakly correlated nodes in market network," Computational Management Science, Springer, vol. 21(1), pages 1-18, June.
    16. Haoyan Hu & Yumou Qiu, 2023. "Inference for nonparanormal partial correlation via regularized rank‐based nodewise regression," Biometrics, The International Biometric Society, vol. 79(2), pages 1173-1186, June.
    17. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    18. Brice Ozenne & Esben Budtz-Jørgensen & Sebastian Elgaard Ebert, 2023. "Controlling the familywise error rate when performing multiple comparisons in a linear latent variable model," Computational Statistics, Springer, vol. 38(1), pages 1-23, March.
    19. Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
    20. Saha, Enakshi & Sarkar, Soham & Ghosh, Anil K., 2017. "Some high-dimensional one-sample tests based on functions of interpoint distances," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 83-95.

    More about this item

    Keywords

    Adaptive inference; bootstrap inference; data privacy; data release mechanism; edge differential privacy; phase transition; β-model;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:122099. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.