IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2020i1p231-d470716.html
   My bibliography  Save this article

Deeper Spatial Statistical Insights into Small Geographic Area Data Uncertainty

Author

Listed:
  • Daniel A. Griffith

    (School of Economic, Political and Policy Sciences, The University of Texas at Dallas, 800 West Campbell Road, Richardson, TX 75080, USA)

  • Yongwan Chun

    (School of Economic, Political and Policy Sciences, The University of Texas at Dallas, 800 West Campbell Road, Richardson, TX 75080, USA)

  • Monghyeon Lee

    (Memory Business Division, Samsung Electronics Co. Ltd., 1, Samsungjeonja-ro, Hwaseong-si, Gyeonggi-do 18448, Korea)

Abstract

Small areas refer to small geographic areas, a more literal meaning of the phrase, as well as small domains (e.g., small sub-populations), a more figurative meaning of the phrase. With post-stratification, even with big data, either case can encounter the problem of small local sample sizes, which tend to inflate local uncertainty and undermine otherwise sound statistical analyses. This condition is the opposite of that afflicting statistical significance in the context of big data. These two definitions can also occur jointly, such as during the standardization of data: small geographic units may contain small populations, which in turn have small counts in various age cohorts. Accordingly, big spatial data can become not-so-big spatial data after post-stratification by geography and, for example, by age cohorts. This situation can be ameliorated to some degree by the large volume of and high velocity of big spatial data. However, the variety of any big spatial data may well exacerbate this situation, compromising veracity in terms of bias, noise, and abnormalities in these data. The purpose of this paper is to establish deeper insights into big spatial data with regard to their uncertainty through one of the hallmarks of georeferenced data, namely spatial autocorrelation, coupled with small geographic areas. Impacts of interest concern the nature, degree, and mixture of spatial autocorrelation. The cancer data employed (from Florida for 2001–2010) represent a data category that is beginning to enter the realm of big spatial data; its volume, velocity, and variety are increasing through the widespread use of digital medical records.

Suggested Citation

  • Daniel A. Griffith & Yongwan Chun & Monghyeon Lee, 2020. "Deeper Spatial Statistical Insights into Small Geographic Area Data Uncertainty," IJERPH, MDPI, vol. 18(1), pages 1-16, December.
  • Handle: RePEc:gam:jijerp:v:18:y:2020:i:1:p:231-:d:470716
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/1/231/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/1/231/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Khalid Al-Ahmadi & Ali Al-Zahrani, 2013. "Spatial Autocorrelation of Cancer Incidence in Saudi Arabia," IJERPH, MDPI, vol. 10(12), pages 1-22, December.
    2. Àlex Costa & Albert Satorra & Eva Ventura, 2003. "An empirical evaluation of small area estimators," Economics Working Papers 674, Department of Economics and Business, Universitat Pompeu Fabra, revised Jun 2003.
    3. Jenish, Nazgul & Prucha, Ingmar R., 2009. "Central limit theorems and uniform laws of large numbers for arrays of random fields," Journal of Econometrics, Elsevier, vol. 150(1), pages 86-98, May.
    4. Monghyeon Lee & Yongwan Chun & Daniel A. Griffith, 2019. "An evaluation of kernel smoothing to protect the confidentiality of individual locations," International Journal of Urban Sciences, Taylor & Francis Journals, vol. 23(3), pages 335-351, July.
    5. Daniel A. Griffith, 2020. "A Family of Correlated Observations: From Independent to Strongly Interrelated Ones," Stats, MDPI, vol. 3(3), pages 1-19, June.
    6. Schelling, Thomas C, 1969. "Models of Segregation," American Economic Review, American Economic Association, vol. 59(2), pages 488-493, May.
    7. Daniel A. Griffith, 2003. "Spatial Autocorrelation and Spatial Filtering," Advances in Spatial Science, Springer, number 978-3-540-24806-4.
    8. Qing Luo & Daniel A. Griffith & Huayi Wu, 2019. "Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics," Journal of Geographical Systems, Springer, vol. 21(2), pages 237-269, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiping Cao & Hartwig H. Hochmair & Fisal Basheeh, 2022. "The Effect of Twitter App Policy Changes on the Sharing of Spatial Information through Twitter Users," Geographies, MDPI, vol. 2(3), pages 1-14, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel A. Griffith & Yongwan Chun, 2022. "Some useful details about the Moran coefficient, the Geary ratio, and the join count indices of spatial autocorrelation," Journal of Spatial Econometrics, Springer, vol. 3(1), pages 1-30, December.
    2. Luc Anselin, 2010. "Thirty years of spatial econometrics," Papers in Regional Science, Wiley Blackwell, vol. 89(1), pages 3-25, March.
    3. Daniel A. Griffith, 2019. "Negative Spatial Autocorrelation: One of the Most Neglected Concepts in Spatial Statistics," Stats, MDPI, vol. 2(3), pages 1-28, August.
    4. Gautier, Pieter & van Vuuren, Aico & Siegmann, Arjen, 2007. "The Effect of the Theo van Gogh Murder on House Prices in Amsterdam," CEPR Discussion Papers 6175, C.E.P.R. Discussion Papers.
    5. El Machkouri, Mohamed & Volný, Dalibor & Wu, Wei Biao, 2013. "A central limit theorem for stationary random fields," Stochastic Processes and their Applications, Elsevier, vol. 123(1), pages 1-14.
    6. Francesco Andreoli & Eugenio Peluso, 2016. "So close yet so unequal: Reconsidering spatial inequality in U.S. cities," Working Papers 21/2016, University of Verona, Department of Economics.
    7. Jeremy Pais & Scott South & Kyle Crowder, 2009. "White Flight Revisited: A Multiethnic Perspective on Neighborhood Out-Migration," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 28(3), pages 321-346, June.
    8. Gandica, Yerali & Gargiulo, Floriana & Carletti, Timoteo, 2016. "Can topology reshape segregation patterns?," Chaos, Solitons & Fractals, Elsevier, vol. 90(C), pages 46-54.
    9. Lindbeck, Assar, 1997. "Incentives and Social Norms in Household Behavior," American Economic Review, American Economic Association, vol. 87(2), pages 370-377, May.
    10. Karla Hoff & Arijit Sen, 2005. "Homeownership, Community Interactions, and Segregation," American Economic Review, American Economic Association, vol. 95(4), pages 1167-1189, September.
    11. Nick Drydakis, 2008. "Integrated Roma Earnings: A Multivariate Analysis for the Discrimination Hypothesis in Greece," Working Papers 0829, University of Crete, Department of Economics.
    12. Sauer, Johannes & Zilberman, David, 2009. "Innovation Behaviour At Farm Level – Selection And Identification," 83rd Annual Conference, March 30 - April 1, 2009, Dublin, Ireland 51073, Agricultural Economics Society.
    13. Steven N. Durlauf, 1996. "Statistical Mechanics Approaches to Socioeconomic Behavior," NBER Technical Working Papers 0203, National Bureau of Economic Research, Inc.
    14. Luis Alvarez & Bruno Ferman, 2020. "Inference in Difference-in-Differences with Few Treated Units and Spatial Correlation," Papers 2006.16997, arXiv.org, revised Apr 2023.
    15. Zhiwei Cui & Yan-An Hwang, 2017. "House exchange and residential segregation in networks," International Journal of Game Theory, Springer;Game Theory Society, vol. 46(1), pages 125-147, March.
    16. Patrick Bayer & Robert McMillan & Kim Rueben, 2004. "An Equilibrium Model of Sorting in an Urban Housing Market," NBER Working Papers 10865, National Bureau of Economic Research, Inc.
    17. Allouch, Nizar, 2017. "The cost of segregation in (social) networks," Games and Economic Behavior, Elsevier, vol. 106(C), pages 329-342.
    18. Tse-Chuan Yang & Stephen A Matthews, 2015. "Death by Segregation: Does the Dimension of Racial Segregation Matter?," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-26, September.
    19. Joshua M. Epstein, 2007. "Agent-Based Computational Models and Generative Social Science," Introductory Chapters, in: Generative Social Science Studies in Agent-Based Computational Modeling, Princeton University Press.
    20. Reinhold Kosfeld & Christian Dreger & Hans-Friedrich Eckey, 2008. "On the stability of the German Beveridge curve: a spatial econometric perspective," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 42(4), pages 967-986, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2020:i:1:p:231-:d:470716. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.