IDEAS home Printed from https://ideas.repec.org/a/kap/jgeosy/v26y2024i3d10.1007_s10109-024-00439-y.html
   My bibliography  Save this article

Rethinking the null hypothesis in significant colocation pattern mining of spatial flows

Author

Listed:
  • Mengjie Zhou

    (Hunan Normal University
    Hunan Key Laboratory of Geospatial Big Data Mining and Application)

  • Mengjie Yang

    (Hunan Normal University)

  • Tinghua Ai

    (Wuhan University)

  • Jiannan Cai

    (The Chinese University of Hong Kong)

  • Zhe Chen

    (Hunan Normal University)

Abstract

Spatial flows represent spatial interactions or movements. Mining colocation patterns of different types of flows may uncover the spatial dependences and associations among flows. Previous studies proposed a flow colocation pattern mining method and established a significance test under the null hypothesis of independence for the results. In fact, the definition of the null hypothesis is crucial in significance testing. Choosing an inappropriate null hypothesis may lead to misunderstandings about the spatial interactions between flows. In practice, the overall distribution patterns of different types of flows may be clustered. In these cases, the null hypothesis of independence will result in unconvincing results. Thus, considering the overall spatial pattern of flows, in this study, we changed the null hypothesis to random labeling to establish the statistical significance of flow colocation patterns. Furthermore, we compared and analyzed the impacts of different null hypotheses on flow colocation pattern mining through synthetic data tests with different preset patterns and situations. Additionally, we used empirical data from ride-hailing trips to show the practicality of the method.

Suggested Citation

  • Mengjie Zhou & Mengjie Yang & Tinghua Ai & Jiannan Cai & Zhe Chen, 2024. "Rethinking the null hypothesis in significant colocation pattern mining of spatial flows," Journal of Geographical Systems, Springer, vol. 26(3), pages 375-405, July.
  • Handle: RePEc:kap:jgeosy:v:26:y:2024:i:3:d:10.1007_s10109-024-00439-y
    DOI: 10.1007/s10109-024-00439-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10109-024-00439-y
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10109-024-00439-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Julian Besag & Peter J. Diggle, 1977. "Simple Monte Carlo Tests for Spatial Pattern," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 26(3), pages 327-333, November.
    2. Ceyhan, Elvan, 2009. "Overall and pairwise segregation tests based on nearest neighbor contingency tables," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2786-2808, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mousaei Sanjerehei, Mohammad, 2011. "Determination of an appropriate quadrat size and shape for detecting association between plant species," Ecological Modelling, Elsevier, vol. 222(10), pages 1790-1792.
    2. Dufour, Jean-Marie, 2006. "Monte Carlo tests with nuisance parameters: A general approach to finite-sample inference and nonstandard asymptotics," Journal of Econometrics, Elsevier, vol. 133(2), pages 443-477, August.
    3. François Bavaud, 2013. "Testing spatial autocorrelation in weighted networks: the modes permutation test," Journal of Geographical Systems, Springer, vol. 15(3), pages 233-247, July.
    4. Diego Giuliani & Giuseppe Arbia & Giuseppe Espa, 2014. "Weighting Ripley’s K-Function to Account for the Firm Dimension in the Analysis of Spatial Concentration," International Regional Science Review, , vol. 37(3), pages 251-272, July.
    5. Grabarnik, Pavel & Myllymäki, Mari & Stoyan, Dietrich, 2011. "Correct testing of mark independence for marked point patterns," Ecological Modelling, Elsevier, vol. 222(23), pages 3888-3894.
    6. LeSage, James & Banerjee, Sudipto & Fischer, Manfred M. & Congdon, Peter, 2009. "Spatial statistics: Methods, models & computation," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2781-2785, June.
    7. Xiaolan Wu & Tony Grubesic, 2010. "Identifying irregularly shaped crime hot-spots using a multiobjective evolutionary algorithm," Journal of Geographical Systems, Springer, vol. 12(4), pages 409-433, December.
    8. repec:elg:eechap:14395_6 is not listed on IDEAS
    9. Junfu Zhang, 2003. "Revisiting Residential Segregation by Income: A Monte Carlo Test," International Journal of Business and Economics, School of Management Development, Feng Chia University, Taichung, Taiwan, vol. 2(1), pages 27-37, April.
    10. Zhu, Li-Xing & Neuhaus, Georg, 2003. "Conditional tests for elliptical symmetry," Journal of Multivariate Analysis, Elsevier, vol. 84(2), pages 284-298, February.
    11. Saroja Selvanathan, 1991. "Regional Consumption Patterns in Australia: A System‐Wide Analysis," The Economic Record, The Economic Society of Australia, vol. 67(4), pages 338-345, December.
    12. S Openshaw, 1979. "A Methodology for Using Models for Planning Purposes," Environment and Planning A, , vol. 11(8), pages 879-896, August.
    13. Diks Cees & Manzan Sebastiano, 2002. "Tests for Serial Independence and Linearity Based on Correlation Integrals," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 6(2), pages 1-22, July.
    14. Ghorbani, Mohammad & Vafaei, Nafiseh & Dvořák, Jiří & Myllymäki, Mari, 2021. "Testing the first-order separability hypothesis for spatio-temporal point patterns," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    15. Zack W. Almquist & Carter T. Butts, 2012. "Point process models for household distributions within small areal units," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 26(22), pages 593-632.
    16. M. Lieshout, 2006. "A J-Function for Marked Point Patterns," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 58(2), pages 235-259, June.
    17. Arbia, G. & Espa, G. & Giuliani, D. & Mazzitelli, A., 2012. "Clusters of firms in an inhomogeneous space: The high-tech industries in Milan," Economic Modelling, Elsevier, vol. 29(1), pages 3-11.
    18. Giuseppe Arbia & Giuseppe Espa & Diego Giuliani & Rocco Micciolo, 2017. "A spatial analysis of health and pharmaceutical firm survival," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(9), pages 1560-1575, July.
    19. Mugglestone, Moira A. & Renshaw, Eric, 1996. "A practical guide to the spectral analysis of spatial point processes," Computational Statistics & Data Analysis, Elsevier, vol. 21(1), pages 43-65, January.
    20. Giuseppe Espa & Giuseppe Arbia & Diego Giuliani, 2013. "Conditional versus unconditional industrial agglomeration: disentangling spatial dependence and spatial heterogeneity in the analysis of ICT firms’ distribution in Milan," Journal of Geographical Systems, Springer, vol. 15(1), pages 31-50, January.
    21. Subhankar Ghosh & Jayant Gupta & Arun Sharma & Shuai An & Shashi Shekhar, 2024. "Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results," Papers 2407.02536, arXiv.org.

    More about this item

    Keywords

    Colocation pattern; Spatial flow; Statistical significance test; Null hypothesis; Random labeling;
    All these keywords.

    JEL classification:

    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jgeosy:v:26:y:2024:i:3:d:10.1007_s10109-024-00439-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.