IDEAS home Printed from https://ideas.repec.org/a/vrs/coecre/v19y2016i5p5-24n1.html
   My bibliography  Save this article

A Statistical Toolbox For Mining And Modeling Spatial Data

Author

Listed:
  • D’Aubigny Gérard

    (Professor at the University of Grenoble-Alpes France)

Abstract

Most data mining projects in spatial economics start with an evaluation of a set of attribute variables on a sample of spatial entities, looking for the existence and strength of spatial autocorrelation, based on the Moran’s and the Geary’s coefficients, the adequacy of which is rarely challenged, despite the fact that when reporting on their properties, many users seem likely to make mistakes and to foster confusion. My paper begins by a critical appraisal of the classical definition and rational of these indices. I argue that while intuitively founded, they are plagued by an inconsistency in their conception. Then, I propose a principled small change leading to corrected spatial autocorrelation coefficients, which strongly simplifies their relationship, and opens the way to an augmented toolbox of statistical methods of dimension reduction and data visualization, also useful for modeling purposes. A second section presents a formal framework, adapted from recent work in statistical learning, which gives theoretical support to our definition of corrected spatial autocorrelation coefficients. More specifically, the multivariate data mining methods presented here, are easily implementable on the existing (free) software, yield methods useful to exploit the proposed corrections in spatial data analysis practice, and, from a mathematical point of view, whose asymptotic behavior, already studied in a series of papers by Belkin & Niyogi, suggests that they own qualities of robustness and a limited sensitivity to the Modifiable Areal Unit Problem (MAUP), valuable in exploratory spatial data analysis.

Suggested Citation

  • D’Aubigny Gérard, 2016. "A Statistical Toolbox For Mining And Modeling Spatial Data," Comparative Economic Research, Sciendo, vol. 19(5), pages 5-24, December.
  • Handle: RePEc:vrs:coecre:v:19:y:2016:i:5:p:5-24:n:1
    DOI: 10.1515/cer-2016-0035
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/cer-2016-0035
    Download Restriction: no

    File URL: https://libkey.io/10.1515/cer-2016-0035?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Daniel A. Griffith, 2003. "Spatial Autocorrelation and Spatial Filtering," Advances in Spatial Science, Springer, number 978-3-540-24806-4.
    2. Daniel A. Griffith, 2000. "A linear regression solution to the spatial autocorrelation problem," Journal of Geographical Systems, Springer, vol. 2(2), pages 141-156, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Reinhold Kosfeld & Christian Dreger & Hans-Friedrich Eckey, 2008. "On the stability of the German Beveridge curve: a spatial econometric perspective," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 42(4), pages 967-986, December.
    2. Daniel A. Griffith & Manfred M. Fischer, 2016. "Constrained Variants of the Gravity Model and Spatial Dependence: Model Specification and Estimation Issues," Advances in Spatial Science, in: Roberto Patuelli & Giuseppe Arbia (ed.), Spatial Econometric Interaction Modelling, chapter 0, pages 37-66, Springer.
    3. Hans-Friedrich Eckey & Reinhold Kosfeld & Matthias Türck, 2007. "Regionale Entwicklung mit und ohne räumliche Spillover-Effekte," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 27(1), pages 23-42, February.
    4. Gloria Alarcón-García & José Daniel Buendía Azorín & María del Mar Sánchez de la Vega, 2020. "Shadow economy and national culture: A spatial approach," Hacienda Pública Española / Review of Public Economics, IEF, vol. 232(1), pages 53-74, March.
    5. Daniele Fabbri & Silvana Robone, 2010. "The geography of hospital admission in a national health service with patient choice," Health Economics, John Wiley & Sons, Ltd., vol. 19(9), pages 1029-1047, September.
    6. Christoph Grimpe & Roberto Patuelli, 2011. "Regional knowledge production in nanomaterials: a spatial filtering approach," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 46(3), pages 519-541, June.
    7. Yongwan Chun, 2008. "Modeling network autocorrelation within migration flows by eigenvector spatial filtering," Journal of Geographical Systems, Springer, vol. 10(4), pages 317-344, December.
    8. Umber, Marc P. & Grote, Michael H. & Frey, Rainer, 2014. "Same as it ever was? Europe's national borders and the market for corporate control," Journal of International Money and Finance, Elsevier, vol. 40(C), pages 109-127.
    9. Manfred M. Fischer & Daniel A. Griffith, 2008. "Modeling Spatial Autocorrelation In Spatial Interaction Data: An Application To Patent Citation Data In The European Union," Journal of Regional Science, Wiley Blackwell, vol. 48(5), pages 969-989, December.
    10. Roberto Patuelli & Norbert Schanne & Daniel A. Griffith & Peter Nijkamp, 2012. "Persistence Of Regional Unemployment: Application Of A Spatial Filtering Approach To Local Labor Markets In Germany," Journal of Regional Science, Wiley Blackwell, vol. 52(2), pages 300-323, May.
    11. Gloria Alarcón García & José Daniel Buendía Azorín & María del Mar Sánchez de la Vega, 2018. "Tax Evasion in Europe: An Analysis Based on Spatial Dependence," Social Science Quarterly, Southwestern Social Science Association, vol. 99(1), pages 7-23, March.
    12. Enrico Marelli & Roberto Patuelli & Marcello Signorelli, 2012. "Regional unemployment in the EU before and after the global crisis," Post-Communist Economies, Taylor & Francis Journals, vol. 24(2), pages 155-175, January.
    13. Timo Mitze & Falk Strotebeck, 2012. "What Drives Regional Cooperative Behavior in German Biotechnology? Embedding Social Network Analysis in a Regression Framework," ERSA conference papers ersa12p629, European Regional Science Association.
    14. Roberto Patuelli & Daniel A. Griffith & Michael Tiefelsdorf & Peter Nijkamp, 2011. "Spatial Filtering and Eigenvector Stability: Space-Time Models for German Unemployment Data," International Regional Science Review, , vol. 34(2), pages 253-280, April.
    15. Roberto Patuelli & Norbert Schanne & Daniel A. Griffith & Peter Nijkamp, 2012. "Persistence Of Regional Unemployment: Application Of A Spatial Filtering Approach To Local Labor Markets In Germany," Journal of Regional Science, Wiley Blackwell, vol. 52(2), pages 300-323, May.
    16. Moniruzzaman, Md & Páez, Antonio, 2012. "Accessibility to transit, by transit, and mode share: application of a logistic model with spatial filters," Journal of Transport Geography, Elsevier, vol. 24(C), pages 198-205.
    17. Buendía Azorín, José Daniel. & Sánchez De La Vega, Mª Del Mar, 2017. "Estimación del valor añadido bruto, dependencia espacial y datos de panel: Evidencia en el caso de los municipios de la Región de Murcia /Estimation of Gross Value Added, Spatial Dependence and Panel ," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 35, pages 315-340, Mayo.
    18. Matías Mayor & Roberto Patuelli, 2012. "Short-Run Regional Forecasts: Spatial Models through Varying Cross-Sectional and Temporal Dimensions," Advances in Spatial Science, in: Esteban Fernández Vázquez & Fernando Rubiera Morollón (ed.), Defining the Spatial Scale in Modern Regional Analysis, edition 127, chapter 0, pages 173-192, Springer.
    19. Eckey, Hans-Friedrich & Türck, Matthias, 2007. "Convergence of EU-Regions. A Literature Report," INVESTIGACIONES REGIONALES - Journal of REGIONAL RESEARCH, Asociación Española de Ciencia Regional, issue 10, pages 5-32.
    20. Daniel A Griffith, 2008. "Spatial-Filtering-Based Contributions to a Critique of Geographically Weighted Regression (GWR)," Environment and Planning A, , vol. 40(11), pages 2751-2769, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:coecre:v:19:y:2016:i:5:p:5-24:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.