IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v191y2024ics016794732300186x.html
   My bibliography  Save this article

Integrating machine learning and Bayesian nonparametrics for flexible modeling of point pattern data

Author

Listed:
  • Heaton, Matthew J.
  • Dahl, Benjamin K.
  • Dayley, Caleb
  • Warr, Richard L.
  • White, Philip

Abstract

Two common approaches to analyze point pattern (location-only) data are mixture models and log-Gaussian Cox processes. The former provides a flexible model for the intensity surface at the expense of no covariate effect estimates while the latter estimates covariate effects at the expense of computation. A bridge is built between these two methods that leverages the strengths of both approaches. Namely, Bayesian nonparametrics are first used to flexibly model the intensity surface. The posterior draws of the fitted intensity surface are then transformed into the equivalent representation under the log-Gaussian Cox process approach. Using principles of machine learning, estimates of covariate effects are obtained. The proposed two-step approach results in accurate estimates of parameters, with proper uncertainty quantification, which is illustrated with real and simulated examples.

Suggested Citation

  • Heaton, Matthew J. & Dahl, Benjamin K. & Dayley, Caleb & Warr, Richard L. & White, Philip, 2024. "Integrating machine learning and Bayesian nonparametrics for flexible modeling of point pattern data," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
  • Handle: RePEc:eee:csdana:v:191:y:2024:i:c:s016794732300186x
    DOI: 10.1016/j.csda.2023.107875
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794732300186X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107875?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. D. Simpson & J. B. Illian & F. Lindgren & S. H. Sørbye & H. Rue, 2016. "Going off grid: computationally efficient inference for log-Gaussian Cox processes," Biometrika, Biometrika Trust, vol. 103(1), pages 49-70.
    2. Matthew J. Heaton & Abhirup Datta & Andrew O. Finley & Reinhard Furrer & Joseph Guinness & Rajarshi Guhaniyogi & Florian Gerber & Robert B. Gramacy & Dorit Hammerling & Matthias Katzfuss & Finn Lindgr, 2019. "A Case Study Competition Among Methods for Analyzing Large Spatial Data," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 398-425, September.
    3. Jacob W. Mortensen & Matthew J. Heaton & Olga V. Wilhelmi, 2018. "Urban heat risk mapping using multiple point patterns in Houston, Texas," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(1), pages 83-102, January.
    4. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    5. Zhengyi Zhou & David S. Matteson & Dawn B. Woodard & Shane G. Henderson & Athanasios C. Micheas, 2015. "A Spatio-Temporal Point Process Model for Ambulance Demand," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 6-15, March.
    6. Brian J. Reich & James S. Hodges & Vesna Zadnik, 2006. "Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease-Mapping Models," Biometrics, The International Biometric Society, vol. 62(4), pages 1197-1206, December.
    7. Jieying Jiao & Guanyu Hu & Jun Yan, 2021. "Heterogeneity pursuit for spatial point pattern with application to tree locations: A Bayesian semiparametric recourse," Environmetrics, John Wiley & Sons, Ltd., vol. 32(7), November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ying C. MacNab, 2018. "Some recent work on multivariate Gaussian Markov random fields," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 497-541, September.
    2. Isabelle Grenier & Bruno Sansó & Jessica L. Matthews, 2024. "Multivariate nearest‐neighbors Gaussian processes with random covariance matrices," Environmetrics, John Wiley & Sons, Ltd., vol. 35(3), May.
    3. Paige, John & Fuglstad, Geir-Arne & Riebler, Andrea & Wakefield, Jon, 2022. "Bayesian multiresolution modeling of georeferenced data: An extension of ‘LatticeKrig’," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    4. John M. Humphreys & Robert B. Srygley & David H. Branson, 2022. "Geographic Variation in Migratory Grasshopper Recruitment under Projected Climate Change," Geographies, MDPI, vol. 2(1), pages 1-19, January.
    5. Quan Vu & Yi Cao & Josh Jacobson & Alan R. Pearse & Andrew Zammit-Mangion, 2021. "Discussion on “Competition on Spatial Statistics for Large Datasets”," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(4), pages 614-618, December.
    6. Jennifer F. Bobb & Maricela F. Cruz & Stephen J. Mooney & Adam Drewnowski & David Arterburn & Andrea J. Cook, 2022. "Accounting for spatial confounding in epidemiological studies with individual‐level exposures: An exposure‐penalized spline approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(3), pages 1271-1293, July.
    7. Uddin, Md Nazir & Gaskins, Jeremy T., 2023. "Shared Bayesian variable shrinkage in multinomial logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).
    8. Bevilacqua, Moreno & Caamaño-Carrillo, Christian & Porcu, Emilio, 2022. "Unifying compactly supported and Matérn covariance functions in spatial statistics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    9. Lu Zhang & Sudipto Banerjee, 2022. "Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data," Biometrics, The International Biometric Society, vol. 78(2), pages 560-573, June.
    10. Janine B. Illian & David F. R. P. Burslem, 2017. "Improving the usability of spatial point process methodology: an interdisciplinary dialogue between statistics and ecology," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 101(4), pages 495-520, October.
    11. Jorge Sicacha-Parada & Diego Pavon-Jordan & Ingelin Steinsland & Roel May & Bård Stokke & Ingar Jostein Øien, 2022. "A Spatial Modeling Framework for Monitoring Surveys with Different Sampling Protocols with a Case Study for Bird Abundance in Mid-Scandinavia," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(3), pages 562-591, September.
    12. G. Vicente & T. Goicoa & P. Fernandez‐Rasines & M. D. Ugarte, 2020. "Crime against women in India: unveiling spatial patterns and temporal trends of dowry deaths in the districts of Uttar Pradesh," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 655-679, February.
    13. Lu Zhang & Sudipto Banerjee & Andrew O. Finley, 2021. "High‐dimensional multivariate geostatistics: A Bayesian matrix‐normal approach," Environmetrics, John Wiley & Sons, Ltd., vol. 32(4), June.
    14. Moreno Bevilacqua & Christian Caamaño‐Carrillo & Carlo Gaetan, 2020. "On modeling positive continuous data with spatiotemporal dependence," Environmetrics, John Wiley & Sons, Ltd., vol. 31(7), November.
    15. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    16. Azar, Pablo D. & Micali, Silvio, 2018. "Computational principal agent problems," Theoretical Economics, Econometric Society, vol. 13(2), May.
    17. Angelica Gianfreda & Francesco Ravazzolo & Luca Rossini, 2023. "Large Time‐Varying Volatility Models for Hourly Electricity Prices," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(3), pages 545-573, June.
    18. Tobias Fissler & Yannick Hoga, 2024. "How to Compare Copula Forecasts?," Papers 2410.04165, arXiv.org.
    19. Davide Pettenuzzo & Francesco Ravazzolo, 2016. "Optimal Portfolio Choice Under Decision‐Based Model Combinations," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(7), pages 1312-1332, November.
    20. Rubio, F.J. & Steel, M.F.J., 2011. "Inference for grouped data with a truncated skew-Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3218-3231, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:191:y:2024:i:c:s016794732300186x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.