IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v161y2021ics0167947321000748.html
   My bibliography  Save this article

Combining heterogeneous spatial datasets with process-based spatial fusion models: A unifying framework

Author

Listed:
  • Wang, Craig
  • Furrer, Reinhard

Abstract

In modern spatial statistics, the structure of data has become more heterogeneous. Depending on the types of spatial data, different modeling strategies are used. For example, kriging approaches for geostatistical data; Gaussian Markov random field models for lattice data; or log Gaussian Cox process models for point-pattern data. Despite these different modeling choices, the nature of underlying data-generating (latent) processes is often the same, which can be represented by some continuous spatial surfaces. A unifying framework is introduced for process-based multivariate spatial fusion models. The framework can jointly analyze all three aforementioned types of spatial data or any combinations thereof. Moreover, the framework accommodates different likelihoods for geostatistical and lattice data. It is shown that some established approaches, such as linear models of coregionalization, can be viewed as special cases of the proposed framework. A flexible and scalable implementation using R-INLA is provided. Simulation studies confirm that the prediction of latent processes improves as one moves from univariate spatial models to multivariate spatial fusion models. The framework is illustrated via a case study using datasets from a cross-sectional study linked with a national cohort in Switzerland. The differences in underlying spatial risks between respiratory disease and lung cancer are examined in the case study.

Suggested Citation

  • Wang, Craig & Furrer, Reinhard, 2021. "Combining heterogeneous spatial datasets with process-based spatial fusion models: A unifying framework," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
  • Handle: RePEc:eee:csdana:v:161:y:2021:i:c:s0167947321000748
    DOI: 10.1016/j.csda.2021.107240
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947321000748
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2021.107240?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. D. Simpson & J. B. Illian & F. Lindgren & S. H. Sørbye & H. Rue, 2016. "Going off grid: computationally efficient inference for log-Gaussian Cox processes," Biometrika, Biometrika Trust, vol. 103(1), pages 49-70.
    2. Xiaoping Jin & Bradley P. Carlin & Sudipto Banerjee, 2005. "Generalized Hierarchical Multivariate CAR Models for Areal Data," Biometrics, The International Biometric Society, vol. 61(4), pages 950-961, December.
    3. Peter Diggle & Søren Lophaven, 2006. "Bayesian Geostatistical Design," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 33(1), pages 53-64, March.
    4. Qian Ren & Sudipto Banerjee, 2013. "Hierarchical Factor Models for Large Spatially Misaligned Data: A Low-Rank Predictive Process Approach," Biometrics, The International Biometric Society, vol. 69(1), pages 19-30, March.
    5. Abhirup Datta & Sudipto Banerjee & Andrew O. Finley & Alan E. Gelfand, 2016. "Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 800-812, April.
    6. Sudipto Banerjee & Alan E. Gelfand & Andrew O. Finley & Huiyan Sang, 2008. "Gaussian predictive process models for large spatial data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 825-848, September.
    7. Julian Besag & Jeremy York & Annie Mollié, 1991. "Bayesian image restoration, with two applications in spatial statistics," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 43(1), pages 1-20, March.
    8. Kelsall J. & Eld J.W., 2002. "Modeling Spatial Variation in Disease Risk: A Geostatistical Approach," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 692-701, September.
    9. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    10. Sujit K. Sahu & Alan E. Gelfand & David M. Holland, 2010. "Fusing point and areal level space–time data with application to wet deposition," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(1), pages 77-103, January.
    11. Finn Lindgren & Håvard Rue & Johan Lindström, 2011. "An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(4), pages 423-498, September.
    12. Martins, Thiago G. & Simpson, Daniel & Lindgren, Finn & Rue, Håvard, 2013. "Bayesian computing with INLA: New features," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 68-83.
    13. Noel Cressie & Gardar Johannesson, 2008. "Fixed rank kriging for very large spatial data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 209-226, February.
    14. Leonhard Knorr‐Held & Nicola G. Best, 2001. "A shared component model for detecting joint and selective clustering of two diseases," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 164(1), pages 73-85.
    15. Lindgren, Finn & Rue, Håvard, 2015. "Bayesian Spatial Modelling with R-INLA," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i19).
    16. Geir-Arne Fuglstad & Daniel Simpson & Finn Lindgren & Håvard Rue, 2019. "Constructing Priors that Penalize the Complexity of Gaussian Random Fields," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 445-452, January.
    17. Montserrat Fuentes & Adrian E. Raftery, 2005. "Model Evaluation and Spatial Interpolation by Bayesian Combination of Observations with Outputs from Numerical Models," Biometrics, The International Biometric Society, vol. 61(1), pages 36-45, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Yewen & Chang, Xiaohui & Luo, Fangzhi & Huang, Hui, 2023. "Additive dynamic models for correcting numerical model outputs," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    2. Sameh Abdulah & Yuxiao Li & Jian Cao & Hatem Ltaief & David E. Keyes & Marc G. Genton & Ying Sun, 2023. "Large‐scale environmental data science with ExaGeoStatR," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    3. Peter A. Gao & Hannah M. Director & Cecilia M. Bitz & Adrian E. Raftery, 2022. "Probabilistic Forecasts of Arctic Sea Ice Thickness," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(2), pages 280-302, June.
    4. C. Forlani & S. Bhatt & M. Cameletti & E. Krainski & M. Blangiardo, 2020. "A joint Bayesian space–time model to integrate spatially misaligned air pollution data in R‐INLA," Environmetrics, John Wiley & Sons, Ltd., vol. 31(8), December.
    5. Mauricio Campos & Bo Li & Guillaume Lafontaine & Joseph Napier & Feng Sheng Hu, 2024. "Integrating Different Data Sources Using a Bayesian Hierarchical Model to Unveil Glacial Refugia," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(3), pages 576-600, September.
    6. John M. Humphreys & Robert B. Srygley & David H. Branson, 2022. "Geographic Variation in Migratory Grasshopper Recruitment under Projected Climate Change," Geographies, MDPI, vol. 2(1), pages 1-19, January.
    7. Jialuo Liu & Tingjin Chu & Jun Zhu & Haonan Wang, 2022. "Large spatial data modeling and analysis: A Krylov subspace approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(3), pages 1115-1143, September.
    8. Zilber, Daniel & Katzfuss, Matthias, 2021. "Vecchia–Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    9. I. Gede Nyoman Mindra Jaya & Henk Folmer, 2022. "Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease," Journal of Geographical Systems, Springer, vol. 24(4), pages 527-581, October.
    10. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    11. Bondo, Kristin J. & Rosenberry, Christopher S. & Stainbrook, David & Walter, W. David, 2024. "Comparing risk of chronic wasting disease occurrence using Bayesian hierarchical spatial models and different surveillance types," Ecological Modelling, Elsevier, vol. 493(C).
    12. Jonathan Wakefield & Taylor Okonek & Jon Pedersen, 2020. "Small Area Estimation for Disease Prevalence Mapping," International Statistical Review, International Statistical Institute, vol. 88(2), pages 398-418, August.
    13. Zhang, Shen & Liu, Xin & Tang, Jinjun & Cheng, Shaowu & Qi, Yong & Wang, Yinhai, 2018. "Spatio-temporal modeling of destination choice behavior through the Bayesian hierarchical approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 537-551.
    14. Marchetti, Yuliya & Nguyen, Hai & Braverman, Amy & Cressie, Noel, 2018. "Spatial data compression via adaptive dispersion clustering," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 138-153.
    15. Matthew J. Heaton & Abhirup Datta & Andrew O. Finley & Reinhard Furrer & Joseph Guinness & Rajarshi Guhaniyogi & Florian Gerber & Robert B. Gramacy & Dorit Hammerling & Matthias Katzfuss & Finn Lindgr, 2019. "A Case Study Competition Among Methods for Analyzing Large Spatial Data," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 398-425, September.
    16. André Victor Ribeiro Amaral & Elias Teixeira Krainski & Ruiman Zhong & Paula Moraga, 2024. "Model-Based Geostatistics Under Spatially Varying Preferential Sampling," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(4), pages 766-792, December.
    17. Mahdi Hosseinpouri & Majid Jafari Khaledi, 2019. "An area-specific stick breaking process for spatial data," Statistical Papers, Springer, vol. 60(1), pages 199-221, February.
    18. Huang Huang & Sameh Abdulah & Ying Sun & Hatem Ltaief & David E. Keyes & Marc G. Genton, 2021. "Competition on Spatial Statistics for Large Datasets," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(4), pages 580-595, December.
    19. Unn Dahlén & Johan Lindström & Marko Scholze, 2020. "Spatiotemporal reconstructions of global CO2‐fluxes using Gaussian Markov random fields," Environmetrics, John Wiley & Sons, Ltd., vol. 31(4), June.
    20. Silius M. Vandeskog & Sara Martino & Daniela Castro-Camilo & Håvard Rue, 2022. "Modelling Sub-daily Precipitation Extremes with the Blended Generalised Extreme Value Distribution," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(4), pages 598-621, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:161:y:2021:i:c:s0167947321000748. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.