IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v238y2024i2s0304407623003457.html
   My bibliography  Save this article

Distributed estimation and inference for spatial autoregression model with large scale networks

Author

Listed:
  • Ren, Yimeng
  • Li, Zhe
  • Zhu, Xuening
  • Gao, Yuan
  • Wang, Hansheng

Abstract

The rapid growth of online network platforms generates large-scale network data and it poses great challenges for statistical analysis using the spatial autoregression (SAR) model. In this work, we develop a novel distributed estimation and statistical inference framework for the SAR model on a distributed system. We first propose a distributed network least squares approximation (DNLSA) method. This enables us to obtain a one-step estimator by taking a weighted average of local estimators on each worker. Afterwards, a refined two-step estimation is designed to further reduce the estimation bias. For statistical inference, we utilize a random projection method to reduce the expensive communication cost. Theoretically, we show the consistency and asymptotic normality of both the one-step and two-step estimators. In addition, we provide theoretical guarantee of the distributed statistical inference procedure. The theoretical findings and computational advantages are validated by several numerical simulations implemented on the Spark system. Lastly, an experiment on the Yelp dataset further illustrates the usefulness of the proposed methodology.

Suggested Citation

  • Ren, Yimeng & Li, Zhe & Zhu, Xuening & Gao, Yuan & Wang, Hansheng, 2024. "Distributed estimation and inference for spatial autoregression model with large scale networks," Journal of Econometrics, Elsevier, vol. 238(2).
  • Handle: RePEc:eee:econom:v:238:y:2024:i:2:s0304407623003457
    DOI: 10.1016/j.jeconom.2023.105629
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407623003457
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2023.105629?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Härdle, Wolfgang Karl & Wang, Weining & Yu, Lining, 2016. "TENET: Tail-Event driven NETwork risk," Journal of Econometrics, Elsevier, vol. 192(2), pages 499-513.
    2. Edward L. Glaeser & Bruce Sacerdote & José A. Scheinkman, 1996. "Crime and Social Interactions," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 111(2), pages 507-548.
    3. Lee, Lung-fei & Yu, Jihai, 2010. "Estimation of spatial autoregressive panel data models with fixed effects," Journal of Econometrics, Elsevier, vol. 154(2), pages 165-185, February.
    4. Kelejian, Harry H & Prucha, Ingmar R, 1998. "A Generalized Spatial Two-Stage Least Squares Procedure for Estimating a Spatial Autoregressive Model with Autoregressive Disturbances," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 99-121, July.
    5. Xiaodong Liu & Eleonora Patacchini & Edoardo Rainone, 2017. "Peer effects in bedtime decisions among adolescents: a social network model with sampled data," Econometrics Journal, Royal Economic Society, vol. 20(3), pages 103-125, October.
    6. Yang, Zhenlin & Yu, Jihai & Liu, Shew Fan, 2016. "Bias correction and refined inferences for fixed effects spatial panel data models," Regional Science and Urban Economics, Elsevier, vol. 61(C), pages 52-72.
    7. Kelejian, Harry H. & Prucha, Ingmar R., 2010. "Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances," Journal of Econometrics, Elsevier, vol. 157(1), pages 53-67, July.
    8. Jianqing Fan & Yongyi Guo & Kaizheng Wang, 2023. "Communication-Efficient Accurate Statistical Estimation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(542), pages 1000-1010, April.
    9. Ethan Cohen‐Cole & Xiaodong Liu & Yves Zenou, 2018. "Multivariate choices and identification of social interactions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(2), pages 165-178, March.
    10. Lung-Fei Lee & Jihai Yu, 2009. "Spatial Nonstationarity and Spurious Regression: the Case with a Row-normalized Spatial Weights Matrix," Spatial Economic Analysis, Taylor & Francis Journals, vol. 4(3), pages 301-327.
    11. Kelejian, Harry H. & Prucha, Ingmar R., 2004. "Estimation of simultaneous systems of spatially interrelated cross sectional equations," Journal of Econometrics, Elsevier, vol. 118(1-2), pages 27-50.
    12. Tao Zou & Wei Lan & Hansheng Wang & Chih-Ling Tsai, 2017. "Covariance Regression Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 266-281, January.
    13. Michael I. Jordan & Jason D. Lee & Yun Yang, 2019. "Communication-Efficient Distributed Statistical Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 668-681, April.
    14. Badi H. Baltagi & Ying Deng, 2015. "EC3SLS Estimator for a Simultaneous System of Spatial Autoregressive Equations with Random Effects," Econometric Reviews, Taylor & Francis Journals, vol. 34(6-10), pages 659-694, December.
    15. Xiaodong Liu & Paulo Saraiva, 2019. "GMM estimation of spatial autoregressive models in a system of simultaneous equations with heteroskedasticity," Econometric Reviews, Taylor & Francis Journals, vol. 38(4), pages 359-385, April.
    16. Aaron Sojourner, 2013. "Identification of Peer Effects with Missing Peer Data: Evidence from Project STAR," Economic Journal, Royal Economic Society, vol. 123(569), pages 574-605, June.
    17. Jing Zhou & Yundong Tu & Yuxin Chen & Hansheng Wang, 2017. "Estimating Spatial Autocorrelation With Sampled Network Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(1), pages 130-138, January.
    18. Yujia Wu & Wei Lan & Tao Zou & Chih-Ling Tsai, 2022. "Inward and Outward Network Influence Analysis," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1617-1628, October.
    19. Tao, Ji & Yu, Jihai, 2012. "The spatial time lag in panel data models," Economics Letters, Elsevier, vol. 117(3), pages 544-547.
    20. Xuening Zhu & Zhanrui Cai & Yanyuan Ma, 2022. "Network Functional Varying Coefficient Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(540), pages 2074-2085, October.
    21. Baltagi, Badi H. & Bresson, Georges, 2011. "Maximum likelihood estimation and Lagrange multiplier tests for panel seemingly unrelated regressions with spatial lag and spatial errors: An application to hedonic housing prices in Paris," Journal of Urban Economics, Elsevier, vol. 69(1), pages 24-42, January.
    22. Tianxi Cai & Molei Liu & Yin Xia, 2022. "Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(540), pages 2105-2119, October.
    23. Lung-Fei Lee, 2004. "Asymptotic Distributions of Quasi-Maximum Likelihood Estimators for Spatial Autoregressive Models," Econometrica, Econometric Society, vol. 72(6), pages 1899-1925, November.
    24. Zhu, Xuening & Huang, Danyang & Pan, Rui & Wang, Hansheng, 2020. "Multivariate spatial autoregressive model for large scale social networks," Journal of Econometrics, Elsevier, vol. 215(2), pages 591-606.
    25. Lung-fei Lee, 2003. "Best Spatial Two-Stage Least Squares Estimators for a Spatial Autoregressive Model with Autoregressive Disturbances," Econometric Reviews, Taylor & Francis Journals, vol. 22(4), pages 307-335.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Badi H. Baltagi & Peter H. Egger & Michaela Kesina, 2022. "Bayesian estimation of multivariate panel probits with higher‐order network interdependence and an application to firms' global market participation in Guangdong," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(7), pages 1356-1378, November.
    2. Zhu, Xuening & Huang, Danyang & Pan, Rui & Wang, Hansheng, 2020. "Multivariate spatial autoregressive model for large scale social networks," Journal of Econometrics, Elsevier, vol. 215(2), pages 591-606.
    3. Lina Lu, 2017. "Simultaneous Spatial Panel Data Models with Common Shocks," Supervisory Research and Analysis Working Papers RPA 17-3, Federal Reserve Bank of Boston.
    4. Luc Anselin, 2010. "Thirty years of spatial econometrics," Papers in Regional Science, Wiley Blackwell, vol. 89(1), pages 3-25, March.
    5. Zhu, Xuening & Chang, Xiangyu & Li, Runze & Wang, Hansheng, 2019. "Portal nodes screening for large scale social networks," Journal of Econometrics, Elsevier, vol. 209(2), pages 145-157.
    6. Heather D. Gibson & Stephen G. Hall & Deborah Gefang & Pavlos Petroulas & George S. Tavlas, 2020. "Did the absence of a central bank backstop in the sovereign bond markets exacerbate spillovers during the euro-area crisis?," Working Papers 281, Bank of Greece.
    7. Yang, Kai & Lee, Lung-fei, 2017. "Identification and QML estimation of multivariate and simultaneous equations spatial autoregressive models," Journal of Econometrics, Elsevier, vol. 196(1), pages 196-214.
    8. Heather D Gibson & Stephen G Hall & Deborah GeFang & Pavlos Petroulas & George S Tavlas, 2021. "Cross-country spillovers of national financial markets and the effectiveness of ECB policies during the euro-area crisis," Oxford Economic Papers, Oxford University Press, vol. 73(4), pages 1454-1470.
    9. William C. Horrace & Hyunseok Jung & Shane Sanders, 2022. "Network Competition and Team Chemistry in the NBA," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 35-49, January.
    10. Gibson, Heather D. & Hall, Stephen G. & Petroulas, Pavlos & Tavlas, George S., 2022. "An investigation into feedback and spatial relationships between banks’ share prices and sovereign bond spreads during the euro crisis," Journal of Financial Stability, Elsevier, vol. 63(C).
    11. Debarsy, Nicolas & Jin, Fei & Lee, Lung-fei, 2015. "Large sample properties of the matrix exponential spatial specification with an application to FDI," Journal of Econometrics, Elsevier, vol. 188(1), pages 1-21.
    12. Álvarez, Inmaculada C. & Barbero, Javier & Zofío, José L., 2017. "A Panel Data Toolbox for MATLAB," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i06).
    13. Lee, Lung-fei & Yu, Jihai, 2010. "Some recent developments in spatial panel data models," Regional Science and Urban Economics, Elsevier, vol. 40(5), pages 255-271, September.
    14. Lin, Xu & Lee, Lung-fei, 2010. "GMM estimation of spatial autoregressive models with unknown heteroskedasticity," Journal of Econometrics, Elsevier, vol. 157(1), pages 34-52, July.
    15. AMBA OYON, Claude Marius & Mbratana, Taoufiki, 2017. "Simultaneous equation models with spatially autocorrelated error components," MPRA Paper 82395, University Library of Munich, Germany.
    16. Elhorst, J. Paul & Emili, Silvia, 2022. "A spatial econometric multivariate model of Okun's law," Regional Science and Urban Economics, Elsevier, vol. 93(C).
    17. Fei Jin & Lung‐fei Lee & Kai Yang, 2024. "Best linear and quadratic moments for spatial econometric models with an application to spatial interdependence patterns of employment growth in US counties," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(4), pages 640-658, June.
    18. Roger Bivand & Giovanni Millo & Gianfranco Piras, 2021. "A Review of Software for Spatial Econometrics in R," Mathematics, MDPI, vol. 9(11), pages 1-40, June.
    19. Chen, Elynn Y. & Fan, Jianqing & Zhu, Xuening, 2023. "Community network auto-regression for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1239-1256.
    20. Shew Fan Liu & Zhenlin Yang, 2015. "Asymptotic Distribution and Finite Sample Bias Correction of QML Estimators for Spatial Error Dependence Model," Econometrics, MDPI, vol. 3(2), pages 1-36, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:238:y:2024:i:2:s0304407623003457. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.