IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v98y2007i2p227-255.html
   My bibliography  Save this article

Estimation of high-dimensional prior and posterior covariance matrices in Kalman filter variants

Author

Listed:
  • Furrer, Reinhard
  • Bengtsson, Thomas

Abstract

This work studies the effects of sampling variability in Monte Carlo-based methods to estimate very high-dimensional systems. Recent focus in the geosciences has been on representing the atmospheric state using a probability density function, and, for extremely high-dimensional systems, various sample-based Kalman filter techniques have been developed to address the problem of real-time assimilation of system information and observations. As the employed sample sizes are typically several orders of magnitude smaller than the system dimension, such sampling techniques inevitably induce considerable variability into the state estimate, primarily through prior and posterior sample covariance matrices. In this article, we quantify this variability with mean squared error measures for two Monte Carlo-based Kalman filter variants: the ensemble Kalman filter and the ensemble square-root Kalman filter. Expressions of the error measures are derived under weak assumptions and show that sample sizes need to grow proportionally to the square of the system dimension for bounded error growth. To reduce necessary ensemble size requirements and to address rank-deficient sample covariances, covariance-shrinking (tapering) based on the Schur product of the prior sample covariance and a positive definite function is demonstrated to be a simple, computationally feasible, and very effective technique. Rules for obtaining optimal taper functions for both stationary as well as non-stationary covariances are given, and optimal taper lengths are given in terms of the ensemble size and practical range of the forecast covariance. Results are also presented for optimal covariance inflation. The theory is verified and illustrated with extensive simulations.

Suggested Citation

  • Furrer, Reinhard & Bengtsson, Thomas, 2007. "Estimation of high-dimensional prior and posterior covariance matrices in Kalman filter variants," Journal of Multivariate Analysis, Elsevier, vol. 98(2), pages 227-255, February.
  • Handle: RePEc:eee:jmvana:v:98:y:2007:i:2:p:227-255
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047-259X(06)00118-7
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gneiting, Tilmann, 2002. "Compactly Supported Correlation Functions," Journal of Multivariate Analysis, Elsevier, vol. 83(2), pages 493-508, November.
    2. Furrer, Reinhard, 2005. "Covariance estimation under spatial dependence," Journal of Multivariate Analysis, Elsevier, vol. 94(2), pages 366-381, June.
    3. Gneiting, Tilmann, 1999. "Radial Positive Definite Functions Generated by Euclid's Hat," Journal of Multivariate Analysis, Elsevier, vol. 69(1), pages 88-119, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Frei, Marco & Künsch, Hans R., 2013. "Mixture ensemble Kalman filters," Computational Statistics & Data Analysis, Elsevier, vol. 58(C), pages 127-138.
    2. Wang, Xuanci & Zhang, Bin, 2024. "Target selection in shrinkage estimation of covariance matrix: A structural similarity approach," Statistics & Probability Letters, Elsevier, vol. 208(C).
    3. Lam, Clifford, 2008. "Estimation of large precision matrices through block penalization," LSE Research Online Documents on Economics 31543, London School of Economics and Political Science, LSE Library.
    4. Na Huang & Piotr Fryzlewicz, 2019. "NOVELIST estimator of large correlation and covariance matrices and their inverses," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 694-727, September.
    5. Gautam Sabnis & Debdeep Pati & Anirban Bhattacharya, 2019. "Compressed Covariance Estimation with Automated Dimension Learning," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 466-481, December.
    6. Yi, Feng & Zou, Hui, 2013. "SURE-tuned tapering estimation of large covariance matrices," Computational Statistics & Data Analysis, Elsevier, vol. 58(C), pages 339-351.
    7. Konrad Furmańczyk, 2021. "Estimation of autocovariance matrices for high dimensional linear processes," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 595-613, May.
    8. Xiaoping Zhou & Dmitry Malioutov & Frank J. Fabozzi & Svetlozar T. Rachev, 2014. "Smooth monotone covariance for elliptical distributions and applications in finance," Quantitative Finance, Taylor & Francis Journals, vol. 14(9), pages 1555-1571, September.
    9. Fang, Qian & Yu, Chen & Weiping, Zhang, 2020. "Regularized estimation of precision matrix for high-dimensional multivariate longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
    10. Farnè, Matteo & Montanari, Angela, 2020. "A large covariance matrix estimator under intermediate spikiness regimes," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
    11. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    12. Huang, Na & Fryzlewicz, Piotr, 2018. "NOVELIST estimator of large correlation and covariance matrices and their inverses," LSE Research Online Documents on Economics 89055, London School of Economics and Political Science, LSE Library.
    13. Chen, Bei & Gel, Yulia R., 2010. "Autoregressive frequency detection using Regularized Least Squares," Journal of Multivariate Analysis, Elsevier, vol. 101(7), pages 1712-1727, August.
    14. Yu, Philip L.H. & Wang, Xiaohang & Zhu, Yuanyuan, 2017. "High dimensional covariance matrix estimation by penalizing the matrix-logarithm transformed likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 114(C), pages 12-25.
    15. Jon Sætrom & Henning Omre, 2013. "Uncertainty Quantification in the Ensemble Kalman Filter," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(4), pages 868-885, December.
    16. Zhu, Xiaonan & Chen, Yu & Hu, Jie, 2024. "Estimation of banded time-varying precision matrix based on SCAD and group lasso," Computational Statistics & Data Analysis, Elsevier, vol. 189(C).
    17. Xue, Lingzhou & Zou, Hui, 2013. "Minimax optimal estimation of general bandable covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 45-51.
    18. Zvi Bodie & Jérôme Detemple & Marcel Rindisbacher, 2009. "Life-Cycle Finance and the Design of Pension Plans," Annual Review of Financial Economics, Annual Reviews, vol. 1(1), pages 249-286, November.
    19. Deirdre Bloome & Daniel Schrage, 2021. "Covariance Regression Models for Studying Treatment Effect Heterogeneity Across One or More Outcomes: Understanding How Treatments Shape Inequality," Sociological Methods & Research, , vol. 50(3), pages 1034-1072, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alonso-Malaver, C.E. & Porcu, E. & Giraldo, R., 2015. "Multivariate and multiradial Schoenberg measures with their dimension walks," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 251-265.
    2. Moreva, Olga & Schlather, Martin, 2023. "Bivariate covariance functions of Pólya type," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    3. Moreno Bevilacqua & Alfredo Alegria & Daira Velandia & Emilio Porcu, 2016. "Composite Likelihood Inference for Multivariate Gaussian Random Fields," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(3), pages 448-469, September.
    4. Caamaño-Carrillo, Christian & Bevilacqua, Moreno & López, Cristian & Morales-Oñate, Víctor, 2024. "Nearest neighbors weighted composite likelihood based on pairs for (non-)Gaussian massive spatial data with an application to Tukey-hh random fields estimation," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    5. Guella, Jean Carlo & Menegatto, Valdir Antonio & Porcu, Emilio, 2018. "Strictly positive definite multivariate covariance functions on spheres," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 150-159.
    6. M. Ruiz-Medina & J. Angulo & G. Christakos & R. Fernández-Pascual, 2016. "New compactly supported spatiotemporal covariance functions from SPDEs," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(1), pages 125-141, March.
    7. Montero, José-María, 2018. "Geostatistics: Unde venis et quo vadis? /Geoestadística:¿De dónde vienes y a dónde vas?," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 36, pages 81-106, Enero.
    8. Gneiting, Tilmann, 2002. "Compactly Supported Correlation Functions," Journal of Multivariate Analysis, Elsevier, vol. 83(2), pages 493-508, November.
    9. Andrea Barth & Fred Espen Benth & Jurgen Potthoff, 2011. "Hedging of Spatial Temperature Risk with Market-Traded Futures," Applied Mathematical Finance, Taylor & Francis Journals, vol. 18(2), pages 93-117.
    10. Dawei Zhan & Huanlai Xing, 2020. "Expected improvement for expensive optimization: a review," Journal of Global Optimization, Springer, vol. 78(3), pages 507-544, November.
    11. Castañer, A. & Claramunt, M.M. & Lefèvre, C. & Loisel, S., 2015. "Discrete Schur-constant models," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 343-362.
    12. Jialuo Liu & Tingjin Chu & Jun Zhu & Haonan Wang, 2022. "Large spatial data modeling and analysis: A Krylov subspace approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(3), pages 1115-1143, September.
    13. Barry, Ronald Paul, 2005. "The local approximation of variograms in by covariance functions," Statistics & Probability Letters, Elsevier, vol. 74(2), pages 171-177, September.
    14. Chiranjit Mukherjee & Prasad Kasibhatla & Mike West, 2014. "Spatially varying SAR models and Bayesian inference for high-resolution lattice data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(3), pages 473-494, June.
    15. Lim, Chae Young & Stein, Michael, 2008. "Properties of spatial cross-periodograms using fixed-domain asymptotics," Journal of Multivariate Analysis, Elsevier, vol. 99(9), pages 1962-1984, October.
    16. Bevilacqua, Moreno & Caamaño-Carrillo, Christian & Porcu, Emilio, 2022. "Unifying compactly supported and Matérn covariance functions in spatial statistics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    17. Bolin, David & Lindgren, Finn, 2013. "A comparison between Markov approximations and other methods for large spatial data sets," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 7-21.
    18. Sajti, Szilárd, 2023. "Domain-domain correlation functions used in off-specular scattering," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 623(C).
    19. Fadoua Balabdaoui & Jon A. Wellner, 2010. "Estimation of a k‐monotone density: characterizations, consistency and minimax lower bounds," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 64(1), pages 45-70, February.
    20. M. Bevilacqua & A. Fassò & C. Gaetan & E. Porcu & D. Velandia, 2016. "Covariance tapering for multivariate Gaussian random fields estimation," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(1), pages 21-37, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:98:y:2007:i:2:p:227-255. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.