IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2001.06052.html
   My bibliography  Save this paper

Recovering Network Structure from Aggregated Relational Data using Penalized Regression

Author

Listed:
  • Hossein Alidaee
  • Eric Auerbach
  • Michael P. Leung

Abstract

Social network data can be expensive to collect. Breza et al. (2017) propose aggregated relational data (ARD) as a low-cost substitute that can be used to recover the structure of a latent social network when it is generated by a specific parametric random effects model. Our main observation is that many economic network formation models produce networks that are effectively low-rank. As a consequence, network recovery from ARD is generally possible without parametric assumptions using a nuclear-norm penalized regression. We demonstrate how to implement this method and provide finite-sample bounds on the mean squared error of the resulting estimator for the distribution of network links. Computation takes seconds for samples with hundreds of observations. Easy-to-use code in R and Python can be found at https://github.com/mpleung/ARD.

Suggested Citation

  • Hossein Alidaee & Eric Auerbach & Michael P. Leung, 2020. "Recovering Network Structure from Aggregated Relational Data using Penalized Regression," Papers 2001.06052, arXiv.org.
  • Handle: RePEc:arx:papers:2001.06052
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2001.06052
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. de Paula, Aureo & Rasul, Imran & Souza, Pedro, 2018. "Identifying Network Ties from Panel Data: Theory and an Application to Tax Competition," CEPR Discussion Papers 12792, C.E.P.R. Discussion Papers.
    2. Alexandre Belloni & Mingli Chen & Victor Chernozhukov, 2016. "Quantile Graphical Models: Prediction and Conditional Independence with Applications to Systemic Risk," Papers 1607.00286, arXiv.org, revised Oct 2019.
    3. Aureo de Paula & Imran Rasul & Pedro CL Souza, 2018. "Recovering social networks from panel data: Identification, simulations and an application," Documentos de Trabajo 16173, The Latin American and Caribbean Economic Association (LACEA).
    4. Hoff P.D. & Raftery A.E. & Handcock M.S., 2002. "Latent Space Approaches to Social Network Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1090-1098, December.
    5. Bryan S. Graham, 2017. "An Econometric Model of Network Formation With Degree Heterogeneity," Econometrica, Econometric Society, vol. 85, pages 1033-1063, July.
    6. Belloni, Alexandre. & Chen, Mingli & Chernozhukov, Victor, 2016. "Quantile Graphical Models: Prediction and Conditional Independence with Applications to Financial Risk Management," The Warwick Economics Research Paper Series (TWERPS) 1125, University of Warwick, Department of Economics.
    7. Tyler H. McCormick & Tian Zheng, 2015. "Latent Surface Models for Networks Using Aggregated Relational Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1684-1695, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hong, Shengjie & Su, Liangjun & Jiang, Tao, 2023. "Profile GMM estimation of panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 235(2), pages 927-948.
    2. Marko Mlikota, 2022. "Cross-Sectional Dynamics Under Network Structure: Theory and Macroeconomic Applications," Papers 2211.13610, arXiv.org, revised Sep 2024.
    3. Steven Wilkins Reeves & Shane Lubold & Arun G. Chandrasekhar & Tyler H. McCormick, 2024. "Model-Based Inference and Experimental Design for Interference Using Partial Network Data," Papers 2406.11940, arXiv.org.
    4. Ma, Shujie & Su, Liangjun & Zhang, Yichong, 2020. "Detecting Latent Communities in Network Formation Models," Economics and Statistics Working Papers 12-2020, Singapore Management University, School of Economics.
    5. Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
    6. Yiren Wang & Liangjun Su & Yichong Zhang, 2022. "Low-rank Panel Quantile Regression: Estimation and Inference," Papers 2210.11062, arXiv.org.
    7. Mohamed Mostagir & James Siderius, 2023. "Social Inequality and the Spread of Misinformation," Management Science, INFORMS, vol. 69(2), pages 968-995, February.
    8. Candelaria, Luis E. & Ura, Takuya, 2023. "Identification and inference of network formation games with misclassified links," Journal of Econometrics, Elsevier, vol. 235(2), pages 862-891.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marco Battaglini & Eleonora Patacchini & Edoardo Rainone, 2019. "Endogenous Social Connections in Legislatures," NBER Working Papers 25988, National Bureau of Economic Research, Inc.
    2. Chih‐Sheng Hsieh & Lung‐Fei Lee & Vincent Boucher, 2020. "Specification and estimation of network formation and network interaction models with the exponential probability distribution," Quantitative Economics, Econometric Society, vol. 11(4), pages 1349-1390, November.
    3. Boucher, Vincent, 2020. "Equilibrium homophily in networks," European Economic Review, Elsevier, vol. 123(C).
    4. Patacchini, Eleonora & Hsieh, Chih-Sheng & Lin, Xu, 2019. "Social Interaction Methods," CEPR Discussion Papers 14141, C.E.P.R. Discussion Papers.
    5. Yann Bramoullé & Habiba Djebbari & Bernard Fortin, 2020. "Peer Effects in Networks: A Survey," Annual Review of Economics, Annual Reviews, vol. 12(1), pages 603-629, August.
    6. Luisa Corrado & Roberta Distante & Majlinda Joxhe, 2019. "Body mass index and social interactions from adolescence to adulthood," Spatial Economic Analysis, Taylor & Francis Journals, vol. 14(4), pages 425-445, October.
    7. Magnus, Jan R. & Sentana, Enrique, 2020. "Zero-diagonality as a linear structure," Economics Letters, Elsevier, vol. 196(C).
    8. Chen, Mingli & Fernández-Val, Iván & Weidner, Martin, 2021. "Nonlinear factor models for network and panel data," Journal of Econometrics, Elsevier, vol. 220(2), pages 296-324.
    9. Zhou, Wenyu, 2019. "A network social interaction model with heterogeneous links," Economics Letters, Elsevier, vol. 180(C), pages 50-53.
    10. Promit K. Chaudhuri & Matthew O. Jackson & Sudipta Sarangi & Hector Tzavellas, 2023. "Games Under Network Uncertainty," Papers 2305.03124, arXiv.org, revised Dec 2024.
    11. Candelaria, Luis E. & Ura, Takuya, 2020. "Identification and Inference of Network Formation Games with Misclassified Links," The Warwick Economics Research Paper Series (TWERPS) 1258, University of Warwick, Department of Economics.
    12. Bryan S. Graham, 2019. "Network Data," Papers 1912.06346, arXiv.org.
    13. Ayden Higgins & Federico Martellosio, 2019. "Shrinkage Estimation of Network Spillovers with Factor Structured Errors," Papers 1909.02823, arXiv.org, revised Nov 2021.
    14. Chih‐Sheng Hsieh & Xu Lin, 2021. "Social interactions and social preferences in social networks," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(2), pages 165-189, March.
    15. Linardi, Fernando & Diks, Cees & van der Leij, Marco & Lazier, Iuri, 2020. "Dynamic interbank network analysis using latent space models," Journal of Economic Dynamics and Control, Elsevier, vol. 112(C).
    16. Hsieh, Chih-Sheng & Hsu, Yu-Chin & Ko, Stanley I.M. & Kovářík, Jaromír & Logan, Trevon D., 2024. "Non-representative sampled networks: Estimation of network structural properties by weighting," Journal of Econometrics, Elsevier, vol. 240(1).
    17. Victor Chernozhukov & Chen Huang & Weining Wang, 2021. "Uniform Inference on High-dimensional Spatial Panel Networks," Papers 2105.07424, arXiv.org, revised Sep 2023.
    18. Michael Delgado & Meilin Ma & H. Holly Wang, 2021. "Exploring Spatial Price Relationships: The Case of African Swine Fever in China," NBER Chapters, in: Risks in Agricultural Supply Chains, National Bureau of Economic Research, Inc.
    19. Junhui Cai & Dan Yang & Wu Zhu & Haipeng Shen & Linda Zhao, 2021. "Network regression and supervised centrality estimation," Papers 2111.12921, arXiv.org.
    20. Chih-Sheng Hsieh & Stanley I. M. Ko & Jaromír Kovářík & Trevon Logan, 2018. "Non-Randomly Sampled Networks: Biases and Corrections," NBER Working Papers 25270, National Bureau of Economic Research, Inc.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2001.06052. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.