IDEAS home Printed from https://ideas.repec.org/a/kap/jgeosy/v21y2019i2d10.1007_s10109-019-00293-3.html
   My bibliography  Save this article

Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics

Author

Listed:
  • Qing Luo

    (Wuhan University
    Wuhan University)

  • Daniel A. Griffith

    (The University of Texas at Dallas)

  • Huayi Wu

    (Wuhan University
    Wuhan University)

Abstract

Being a hot topic in recent years, many studies have been conducted with spatial data containing massive numbers of observations. Because initial developments for classical spatial autocorrelation statistics are based on rather small sample sizes, in the context of massive spatial datasets, this paper presents extensions to efficiency and statistical power comparisons between the Moran coefficient and the Geary ratio for different variable distribution assumptions and selected geographic neighborhood definitions. The question addressed asks whether or not earlier results for small n extend to large and massively large n, especially for non-normal variables; implications established are relevant to big spatial data. To achieve these comparisons, this paper summarizes proofs of limiting variances, also called asymptotic variances, to do the efficiency analysis, and derives the relationship function between the two statistics to compare their statistical power at the same scale. Visualization of this statistical power analysis employs an alternative technique that already appears in the literature, furnishing additional understanding and clarity about these spatial autocorrelation statistics. Results include: the Moran coefficient is more efficient than the Geary ratio for most surface partitionings, because this index has a relatively smaller asymptotic as well as exact variance, and the superior power of the Moran coefficient vis-à-vis the Geary ratio for positive spatial autocorrelation depends upon the type of geographic configuration, with this power approaching one as sample sizes become increasingly large. Because spatial analysts usually calculate these two statistics for interval/ration data, this paper also includes comments about the join count statistics used for nominal data.

Suggested Citation

  • Qing Luo & Daniel A. Griffith & Huayi Wu, 2019. "Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics," Journal of Geographical Systems, Springer, vol. 21(2), pages 237-269, June.
  • Handle: RePEc:kap:jgeosy:v:21:y:2019:i:2:d:10.1007_s10109-019-00293-3
    DOI: 10.1007/s10109-019-00293-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10109-019-00293-3
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10109-019-00293-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yongwan Chun, 2008. "Modeling network autocorrelation within migration flows by eigenvector spatial filtering," Journal of Geographical Systems, Springer, vol. 10(4), pages 317-344, December.
    2. Barry Boots & Michael Tiefelsdorf, 2000. "Global and local spatial autocorrelation in bounded regular tessellations," Journal of Geographical Systems, Springer, vol. 2(4), pages 319-348, December.
    3. Tamara Mata & Carlos Llano, 2013. "Social networks and trade of services: modelling interregional flows with spatial and network autocorrelation effects," Journal of Geographical Systems, Springer, vol. 15(3), pages 319-367, July.
    4. David Haynes & Alex Jokela & Steven Manson, 2018. "IPUMS-Terra: integrated big heterogeneous spatiotemporal data analysis system," Journal of Geographical Systems, Springer, vol. 20(4), pages 343-361, October.
    5. Bivand, Roger & Müller, Werner G. & Reder, Markus, 2009. "Power calculations for global and local Moran's," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2859-2872, June.
    6. Tao Cheng & James Haworth & Jiaqiu Wang, 2012. "Spatio-temporal autocorrelation of road network data," Journal of Geographical Systems, Springer, vol. 14(4), pages 389-413, October.
    7. Daniel A. Griffith, 2003. "Spatial Autocorrelation and Spatial Filtering," Advances in Spatial Science, Springer, number 978-3-540-24806-4, February.
    8. François Bavaud, 2013. "Testing spatial autocorrelation in weighted networks: the modes permutation test," Journal of Geographical Systems, Springer, vol. 15(3), pages 233-247, July.
    9. Barry Boots, 2003. "Developing local measures of spatial association for categorical data," Journal of Geographical Systems, Springer, vol. 5(2), pages 139-160, August.
    10. Bartels, Cornelis P. A. & Hordijk, Leen, 1977. "On the power of the generalized Moran contiguity coefficient in testing for spatial autocorrelation among regression disturbances," Regional Science and Urban Economics, Elsevier, vol. 7(1-2), pages 83-101, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Daniel A. Griffith & Yongwan Chun, 2022. "Some useful details about the Moran coefficient, the Geary ratio, and the join count indices of spatial autocorrelation," Journal of Spatial Econometrics, Springer, vol. 3(1), pages 1-30, December.
    2. Daniel A. Griffith & Yongwan Chun & Monghyeon Lee, 2020. "Deeper Spatial Statistical Insights into Small Geographic Area Data Uncertainty," IJERPH, MDPI, vol. 18(1), pages 1-16, December.
    3. Ilya V. Naumov & Vladislav M. Sedelnikov, 2021. "Scenario modelling and forecasting of spatial transformation in the Russian catering market," Upravlenets, Ural State University of Economics, vol. 12(4), pages 75-91, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel A. Griffith & Manfred M. Fischer, 2016. "Constrained Variants of the Gravity Model and Spatial Dependence: Model Specification and Estimation Issues," Advances in Spatial Science, in: Roberto Patuelli & Giuseppe Arbia (ed.), Spatial Econometric Interaction Modelling, chapter 0, pages 37-66, Springer.
    2. Yanguang Chen, 2013. "New Approaches for Calculating Moran’s Index of Spatial Autocorrelation," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-14, July.
    3. Dieter von Fintel & Eldridge Moses, 2017. "Migration and gender in South Africa: following bright lights and the fortunes of others?," Working Papers 09/2017, Stellenbosch University, Department of Economics, revised 2018.
    4. Daniele Fabbri & Silvana Robone, 2010. "The geography of hospital admission in a national health service with patient choice," Health Economics, John Wiley & Sons, Ltd., vol. 19(9), pages 1029-1047, September.
    5. Giuseppe Ricciardo Lamonica & Barbara Zagaglia, 2013. "The determinants of internal mobility in Italy, 1995-2006," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 29(16), pages 407-440.
    6. Oshan, Taylor M., 2020. "The spatial structure debate in spatial interaction modeling: 50 years on," OSF Preprints 42vxn, Center for Open Science.
    7. Rodolfo Metulini & Roberto Patuelli & Daniel A. Griffith, 2018. "A Spatial-Filtering Zero-Inflated Approach to the Estimation of the Gravity Model of Trade," Econometrics, MDPI, vol. 6(1), pages 1-15, February.
    8. Daisuke Murakami & Daniel Griffith, 2015. "Random effects specifications in eigenvector spatial filtering: a simulation study," Journal of Geographical Systems, Springer, vol. 17(4), pages 311-331, October.
    9. François Bavaud, 2013. "Testing spatial autocorrelation in weighted networks: the modes permutation test," Journal of Geographical Systems, Springer, vol. 15(3), pages 233-247, July.
    10. Fen-Jiao Wang & Chang-Lin Mei & Zhi Zhang & Qiu-Xia Xu, 2022. "Testing for Local Spatial Association Based on Geographically Weighted Interpolation of Geostatistical Data with Application to PM2.5 Concentration Analysis," Sustainability, MDPI, vol. 14(21), pages 1-19, November.
    11. Jungmin Kim & Juyong Park & Wonjae Lee, 2018. "Why do people move? Enhancing human mobility prediction using local functions based on public records and SNS data," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-29, February.
    12. Rafael Lata & Sidonia Proff & Thomas Brenner, 2018. "The influence of distance types on co-patenting and co-publishing in the USA and Europe over time," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 61(1), pages 49-71, July.
    13. Moniruzzaman, Md & Páez, Antonio, 2012. "Accessibility to transit, by transit, and mode share: application of a logistic model with spatial filters," Journal of Transport Geography, Elsevier, vol. 24(C), pages 198-205.
    14. Yongwan Chun & Daniel A. Griffith & Monghyeon Lee & Parmanand Sinha, 2016. "Eigenvector selection with stepwise regression techniques to construct eigenvector spatial filters," Journal of Geographical Systems, Springer, vol. 18(1), pages 67-85, January.
    15. Lan Hu & Yongwan Chun & Daniel A. Griffith, 2020. "Uncovering a positive and negative spatial autocorrelation mixture pattern: a spatial analysis of breast cancer incidences in Broward County, Florida, 2000–2010," Journal of Geographical Systems, Springer, vol. 22(3), pages 291-308, July.
    16. Luc Anselin, 2010. "Thirty years of spatial econometrics," Papers in Regional Science, Wiley Blackwell, vol. 89(1), pages 3-25, March.
    17. Julie Le Gallo & Antonio Páez, 2013. "Using Synthetic Variables in Instrumental Variable Estimation of Spatial Series Models," Environment and Planning A, , vol. 45(9), pages 2227-2242, September.
    18. Michael Tiefelsdorf & Daniel A Griffith, 2007. "Semiparametric Filtering of Spatial Autocorrelation: The Eigenvector Approach," Environment and Planning A, , vol. 39(5), pages 1193-1221, May.
    19. Cupido, Kyran & Jevtić, Petar & Paez, Antonio, 2020. "Spatial patterns of mortality in the United States: A spatial filtering approach," Insurance: Mathematics and Economics, Elsevier, vol. 95(C), pages 28-38.
    20. Min Xu & Chang-Lin Mei & Na Yan, 2014. "A note on the null distribution of the local spatial heteroscedasticity (LOSH) statistic," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 52(3), pages 697-710, May.

    More about this item

    Keywords

    Moran coefficient; Geary ratio; Efficiency; Power; Geographic configuration; Join count statistics;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jgeosy:v:21:y:2019:i:2:d:10.1007_s10109-019-00293-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.