IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2010.04703.html
   My bibliography  Save this paper

Sparse network asymptotics for logistic regression

Author

Listed:
  • Bryan S. Graham

Abstract

Consider a bipartite network where $N$ consumers choose to buy or not to buy $M$ different products. This paper considers the properties of the logistic regression of the $N\times M$ array of i-buys-j purchase decisions, $\left[Y_{ij}\right]_{1\leq i\leq N,1\leq j\leq M}$, onto known functions of consumer and product attributes under asymptotic sequences where (i) both $N$ and $M$ grow large and (ii) the average number of products purchased per consumer is finite in the limit. This latter assumption implies that the network of purchases is sparse: only a (very) small fraction of all possible purchases are actually made (concordant with many real-world settings). Under sparse network asymptotics, the first and last terms in an extended Hoeffding-type variance decomposition of the score of the logit composite log-likelihood are of equal order. In contrast, under dense network asymptotics, the last term is asymptotically negligible. Asymptotic normality of the logistic regression coefficients is shown using a martingale central limit theorem (CLT) for triangular arrays. Unlike in the dense case, the normality result derived here also holds under degeneracy of the network graphon. Relatedly, when there happens to be no dyadic dependence in the dataset in hand, it specializes to recently derived results on the behavior of logistic regression with rare events and iid data. Sparse network asymptotics may lead to better inference in practice since they suggest variance estimators which (i) incorporate additional sources of sampling variation and (ii) are valid under varying degrees of dyadic dependence.

Suggested Citation

  • Bryan S. Graham, 2020. "Sparse network asymptotics for logistic regression," Papers 2010.04703, arXiv.org.
  • Handle: RePEc:arx:papers:2010.04703
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2010.04703
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bengtsson, Ola & Hsu, David H., 2015. "Ethnic matching in the U.S. venture capital market," Journal of Business Venturing, Elsevier, vol. 30(2), pages 338-354.
    2. repec:dau:papers:123456789/10840 is not listed on IDEAS
    3. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    4. Koen Jochmans, 2018. "Semiparametric Analysis of Network Formation," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(4), pages 705-713, October.
    5. Bryan S. Graham & Fengshi Niu & James L. Powell, 2019. "Kernel Density Estimation for Undirected Dyadic Data," Papers 1907.13630, arXiv.org.
    6. Luca Marotta & Salvatore Miccichè & Yoshi Fujiwara & Hiroshi Iyetomi & Hideaki Aoyama & Mauro Gallegati & Rosario N Mantegna, 2015. "Bank-Firm Credit Network in Japan: An Analysis of a Bipartite Network," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-18, May.
    7. Arthur Lewbel & Lars Nesheim, 2019. "Sparse demand systems: corners and complements," CeMMAP working papers CWP45/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    8. Aldous, David J., 1981. "Representations for partially exchangeable arrays of random variables," Journal of Multivariate Analysis, Elsevier, vol. 11(4), pages 581-598, December.
    9. Jeremy T. Fox, 2018. "Estimating matching games with transfers," Quantitative Economics, Econometric Society, vol. 9(1), pages 1-38, March.
    10. Fafchamps, Marcel & Gubert, Flore, 2007. "The formation of risk sharing networks," Journal of Development Economics, Elsevier, vol. 83(2), pages 326-350, July.
    11. Marcel Fafchamps & Flore Gubert, 2007. "Risk Sharing and Network Formation," American Economic Review, American Economic Association, vol. 97(2), pages 75-79, May.
    12. Bryan S. Graham, 2017. "An Econometric Model of Network Formation With Degree Heterogeneity," Econometrica, Econometric Society, vol. 85, pages 1033-1063, July.
    13. Douglas Staiger & James H. Stock, 1997. "Instrumental Variables Regression with Weak Instruments," Econometrica, Econometric Society, vol. 65(3), pages 557-586, May.
    14. Cattaneo, Matias D. & Crump, Richard K. & Jansson, Michael, 2014. "Small Bandwidth Asymptotics For Density-Weighted Average Derivatives," Econometric Theory, Cambridge University Press, vol. 30(1), pages 176-200, February.
    15. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504, November.
    16. repec:dau:papers:123456789/4392 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Harold D Chiang & Yukitoshi Matsushita & Taisuke Otsu, 2021. "Multiway empirical likelihood," Papers 2108.04852, arXiv.org, revised Aug 2024.
    2. Harold D Chiang & Yukitoshi Matsushita & Taisuke Otsu, 2021. "Multiway empirical likelihood," STICERD - Econometrics Paper Series 617, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    3. Yong Cai, 2022. "Linear Regression with Centrality Measures," Papers 2210.10024, arXiv.org.
    4. St'ephane Bonhomme & Koen Jochmans & Martin Weidner, 2024. "A Neyman-Orthogonalization Approach to the Incidental Parameter Problem," Papers 2412.10304, arXiv.org.
    5. Bryan S. Graham & Fengshi Niu & James L. Powell, 2020. "Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression," Papers 2012.08444, arXiv.org, revised Mar 2021.
    6. Konrad Menzel, 2021. "Bootstrap With Cluster‐Dependence in Two or More Dimensions," Econometrica, Econometric Society, vol. 89(5), pages 2143-2188, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bryan S. Graham, 2019. "Network Data," CeMMAP working papers CWP71/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Bryan S. Graham, 2019. "Network Data," Papers 1912.06346, arXiv.org.
    3. Bryan S. Graham, 2019. "Dyadic Regression," Papers 1908.09029, arXiv.org.
    4. Graham, Bryan S. & Niu, Fengshi & Powell, James L., 2024. "Kernel density estimation for undirected dyadic data," Journal of Econometrics, Elsevier, vol. 240(2).
    5. Bryan S. Graham, 2017. "An econometric model of network formation with degree heterogeneity," CeMMAP working papers 08/17, Institute for Fiscal Studies.
    6. Tadao Hoshino & Daichi Shimamoto & Yasuyuki Todo, 2020. "Accounting for Heterogeneity in Network Formation Behaviour: An Application to Vietnamese SMEs," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 82(5), pages 1042-1067, October.
    7. Francesco Bartolucci & Claudia Pigini & Francesco Valentini, 2024. "MCMC conditional maximum likelihood for the two-way fixed-effects logit," Econometric Reviews, Taylor & Francis Journals, vol. 43(6), pages 379-404, July.
    8. Andreas Dzemski, 2019. "An Empirical Model of Dyadic Link Formation in a Network with Unobserved Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 763-776, December.
    9. Bryan S. Graham, 2024. "Sparse Network Asymptotics for Logistic Regression Under Possible Misspecification," Econometrica, Econometric Society, vol. 92(6), pages 1837-1868, November.
    10. Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
    11. Koen Jochmans & Martin Weidner, 2019. "Fixed‐Effect Regressions on Network Data," Econometrica, Econometric Society, vol. 87(5), pages 1543-1560, September.
    12. Alex Centeno, 2022. "A Structural Model for Detecting Communities in Networks," Papers 2209.08380, arXiv.org, revised Oct 2022.
    13. Jun Sung Kim & Eleonora Patacchini & Pierre M. Picard & Yves Zenou, 2023. "Spatial interactions," Quantitative Economics, Econometric Society, vol. 14(4), pages 1295-1335, November.
    14. Harold D Chiang & Yukun Ma & Joel Rodrigue & Yuya Sasaki, 2021. "Dyadic double/debiased machine learning for analyzing determinants of free trade agreements," Papers 2110.04365, arXiv.org, revised Dec 2022.
    15. Brice Romuald Gueyap Kounga, 2023. "Nonparametric Regression with Dyadic Data," Papers 2310.12825, arXiv.org.
    16. Zuckerman, David, 2024. "Multidimensional homophily," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 486-513.
    17. Nicola Campigotto & Chiara Rapallini & Aldo Rustichini, 2022. "School friendship networks, homophily and multiculturalism: evidence from European countries," Journal of Population Economics, Springer;European Society for Population Economics, vol. 35(4), pages 1687-1722, October.
    18. Arun G. Chandrasekhar & Horacio Larreguy & Juan Pablo Xandri, 2020. "Testing Models of Social Learning on Networks: Evidence From Two Experiments," Econometrica, Econometric Society, vol. 88(1), pages 1-32, January.
    19. Luis E. Candelaria, 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," Papers 2007.05403, arXiv.org, revised Aug 2020.
    20. Shuyang Sheng, 2020. "A Structural Econometric Analysis of Network Formation Games Through Subnetworks," Econometrica, Econometric Society, vol. 88(5), pages 1829-1858, September.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2010.04703. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.