IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i2p560-573.html
   My bibliography  Save this article

Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data

Author

Listed:
  • Lu Zhang
  • Sudipto Banerjee

Abstract

Multivariate spatially oriented data sets are prevalent in the environmental and physical sciences. Scientists seek to jointly model multiple variables, each indexed by a spatial location, to capture any underlying spatial association for each variable and associations among the different dependent variables. Multivariate latent spatial process models have proved effective in driving statistical inference and rendering better predictive inference at arbitrary locations for the spatial process. High‐dimensional multivariate spatial data, which are the theme of this article, refer to data sets where the number of spatial locations and the number of spatially dependent variables is very large. The field has witnessed substantial developments in scalable models for univariate spatial processes, but such methods for multivariate spatial processes, especially when the number of outcomes are moderately large, are limited in comparison. Here, we extend scalable modeling strategies for a single process to multivariate processes. We pursue Bayesian inference, which is attractive for full uncertainty quantification of the latent spatial process. Our approach exploits distribution theory for the matrix‐normal distribution, which we use to construct scalable versions of a hierarchical linear model of coregionalization (LMC) and spatial factor models that deliver inference over a high‐dimensional parameter space including the latent spatial process. We illustrate the computational and inferential benefits of our algorithms over competing methods using simulation studies and an analysis of a massive vegetation index data set.

Suggested Citation

  • Lu Zhang & Sudipto Banerjee, 2022. "Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data," Biometrics, The International Biometric Society, vol. 78(2), pages 560-573, June.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:2:p:560-573
    DOI: 10.1111/biom.13452
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13452
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13452?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Qian Ren & Sudipto Banerjee, 2013. "Hierarchical Factor Models for Large Spatially Misaligned Data: A Low-Rank Predictive Process Approach," Biometrics, The International Biometric Society, vol. 69(1), pages 19-30, March.
    2. Abhirup Datta & Sudipto Banerjee & Andrew O. Finley & Alan E. Gelfand, 2016. "Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 800-812, April.
    3. Matthew J. Heaton & Abhirup Datta & Andrew O. Finley & Reinhard Furrer & Joseph Guinness & Rajarshi Guhaniyogi & Florian Gerber & Robert B. Gramacy & Dorit Hammerling & Matthias Katzfuss & Finn Lindgr, 2019. "A Case Study Competition Among Methods for Analyzing Large Spatial Data," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 398-425, September.
    4. Finley, Andrew O. & Banerjee, Sudipto & Carlin, Bradley P., 2007. "spBayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 19(i04).
    5. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    6. Alan Gelfand & Alexandra Schmidt & Sudipto Banerjee & C. Sirmans, 2004. "Nonstationary multivariate process modeling through spatially varying coregionalization," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 13(2), pages 263-312, December.
    7. Nhu D. Le & Weimin Sun & James V. Zidek, 1997. "Bayesian Multivariate Spatial Interpolation with Data Missing by Design," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(2), pages 501-510.
    8. Gamerman, Dani & Moreira, Ajax R. B., 2004. "Multivariate spatial regression models," Journal of Multivariate Analysis, Elsevier, vol. 91(2), pages 262-281, November.
    9. Robert Thorndike, 1953. "Who belongs in the family?," Psychometrika, Springer;The Psychometric Society, vol. 18(4), pages 267-276, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Andrew O. Finley & Hans-Erik Andersen & Chad Babcock & Bruce D. Cook & Douglas C. Morton & Sudipto Banerjee, 2024. "Models to Support Forest Inventory and Small Area Estimation Using Sparsely Sampled LiDAR: A Case Study Involving G-LiHT LiDAR in Tanana, Alaska," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(4), pages 695-722, December.
    2. Sudipto Banerjee, 2023. "Discussion of “Saving Storage in Climate Ensembles: A Model-Based Stochastic Approach” by Huang Huang, Stefano Castruccio, Allison H. Baker and Marc Genton," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(2), pages 365-369, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lu Zhang & Sudipto Banerjee & Andrew O. Finley, 2021. "High‐dimensional multivariate geostatistics: A Bayesian matrix‐normal approach," Environmetrics, John Wiley & Sons, Ltd., vol. 32(4), June.
    2. Tilman M. Davies & Sudipto Banerjee & Adam P. Martin & Rose E. Turnbull, 2022. "A nearest‐neighbour Gaussian process spatial factor model for censored, multi‐depth geochemical data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(4), pages 1014-1043, August.
    3. Isabelle Grenier & Bruno Sansó & Jessica L. Matthews, 2024. "Multivariate nearest‐neighbors Gaussian processes with random covariance matrices," Environmetrics, John Wiley & Sons, Ltd., vol. 35(3), May.
    4. Guhaniyogi, Rajarshi & Banerjee, Sudipto, 2019. "Multivariate spatial meta kriging," Statistics & Probability Letters, Elsevier, vol. 144(C), pages 3-8.
    5. Xiaotian Zheng & Athanasios Kottas & Bruno Sansó, 2023. "Bayesian geostatistical modeling for discrete‐valued processes," Environmetrics, John Wiley & Sons, Ltd., vol. 34(7), November.
    6. Zhou Lan & Brian J. Reich & Joseph Guinness & Dipankar Bandyopadhyay & Liangsuo Ma & F. Gerard Moeller, 2022. "Geostatistical modeling of positive‐definite matrices: An application to diffusion tensor imaging," Biometrics, The International Biometric Society, vol. 78(2), pages 548-559, June.
    7. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    8. Chen, Yewen & Chang, Xiaohui & Luo, Fangzhi & Huang, Hui, 2023. "Additive dynamic models for correcting numerical model outputs," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    9. Zilber, Daniel & Katzfuss, Matthias, 2021. "Vecchia–Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    10. Heaton, Matthew J. & Dahl, Benjamin K. & Dayley, Caleb & Warr, Richard L. & White, Philip, 2024. "Integrating machine learning and Bayesian nonparametrics for flexible modeling of point pattern data," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    11. Pascal Kundig & Fabio Sigrist, 2024. "A Spatio-Temporal Machine Learning Model for Mortgage Credit Risk: Default Probabilities and Loan Portfolios," Papers 2410.02846, arXiv.org.
    12. Alexandra Schmidt & Ajax Moreira & Steven Helfand & Thais Fonseca, 2009. "Spatial stochastic frontier models: accounting for unobserved local determinants of inefficiency," Journal of Productivity Analysis, Springer, vol. 31(2), pages 101-112, April.
    13. Ying C. MacNab, 2018. "Some recent work on multivariate Gaussian Markov random fields," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 497-541, September.
    14. Cole, D. Austin & Gramacy, Robert B. & Ludkovski, Mike, 2022. "Large-scale local surrogate modeling of stochastic simulation experiments," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    15. Bledar A. Konomi & Emily L. Kang & Ayat Almomani & Jonathan Hobbs, 2023. "Bayesian Latent Variable Co-kriging Model in Remote Sensing for Quality Flagged Observations," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(3), pages 423-441, September.
    16. C Emi Fergus & Andrew O Finley & Patricia A Soranno & Tyler Wagner, 2016. "Spatial Variation in Nutrient and Water Color Effects on Lake Chlorophyll at Macroscales," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-20, October.
    17. Paige, John & Fuglstad, Geir-Arne & Riebler, Andrea & Wakefield, Jon, 2022. "Bayesian multiresolution modeling of georeferenced data: An extension of ‘LatticeKrig’," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    18. Sebastain Awondo & Genti Kostandini, 2022. "Leveraging optimal portfolio of Drought-Tolerant Maize Varieties for weather index insurance and food security," The Geneva Risk and Insurance Review, Palgrave Macmillan;International Association for the Study of Insurance Economics (The Geneva Association), vol. 47(1), pages 45-65, March.
    19. Lucia Paci & Alan E. Gelfand & and María Asunción Beamonte & Pilar Gargallo & Manuel Salvador, 2020. "Spatial hedonic modelling adjusted for preferential sampling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 169-192, January.
    20. Shinichiro Shirota & Andrew O. Finley & Bruce D. Cook & Sudipto Banerjee, 2023. "Conjugate sparse plus low rank models for efficient Bayesian interpolation of large spatial data," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:2:p:560-573. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.