IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i2p1201-1212.html
   My bibliography  Save this article

A compound decision approach to covariance matrix estimation

Author

Listed:
  • Huiqin Xin
  • Sihai Dave Zhao

Abstract

Covariance matrix estimation is a fundamental statistical task in many applications, but the sample covariance matrix is suboptimal when the sample size is comparable to or less than the number of features. Such high‐dimensional settings are common in modern genomics, where covariance matrix estimation is frequently employed as a method for inferring gene networks. To achieve estimation accuracy in these settings, existing methods typically either assume that the population covariance matrix has some particular structure, for example, sparsity, or apply shrinkage to better estimate the population eigenvalues. In this paper, we study a new approach to estimating high‐dimensional covariance matrices. We first frame covariance matrix estimation as a compound decision problem. This motivates defining a class of decision rules and using a nonparametric empirical Bayes g‐modeling approach to estimate the optimal rule in the class. Simulation results and gene network inference in an RNA‐seq experiment in mouse show that our approach is comparable to or can outperform a number of state‐of‐the‐art proposals.

Suggested Citation

  • Huiqin Xin & Sihai Dave Zhao, 2023. "A compound decision approach to covariance matrix estimation," Biometrics, The International Biometric Society, vol. 79(2), pages 1201-1212, June.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:2:p:1201-1212
    DOI: 10.1111/biom.13686
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13686
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13686?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Feng, Long & Dicker, Lee H., 2018. "Approximate nonparametric maximum likelihood for mixture models: A convex optimization approach to fitting arbitrary multivariate mixing distributions," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 80-91.
    2. Joel Bun & Romain Allez & Jean-Philippe Bouchaud & Marc Potters, 2015. "Rotational invariant estimator for general noisy matrices," Papers 1502.06736, arXiv.org, revised Oct 2016.
    3. Schäfer Juliane & Strimmer Korbinian, 2005. "A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 4(1), pages 1-32, November.
    4. Cai, Tony & Liu, Weidong, 2011. "Adaptive Thresholding for Sparse Covariance Matrix Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 672-684.
    5. Rothman, Adam J. & Levina, Elizaveta & Zhu, Ji, 2009. "Generalized Thresholding of Large Covariance Matrices," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 177-186.
    6. Roger Koenker & Ivan Mizera, 2014. "Convex Optimization, Shape Constraints, Compound Decisions, and Empirical Bayes Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 674-685, June.
    7. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bailey, Natalia & Pesaran, M. Hashem & Smith, L. Vanessa, 2019. "A multiple testing approach to the regularisation of large sample correlation matrices," Journal of Econometrics, Elsevier, vol. 208(2), pages 507-534.
    2. Lam, Clifford, 2020. "High-dimensional covariance matrix estimation," LSE Research Online Documents on Economics 101667, London School of Economics and Political Science, LSE Library.
    3. Gautam Sabnis & Debdeep Pati & Anirban Bhattacharya, 2019. "Compressed Covariance Estimation with Automated Dimension Learning," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 466-481, December.
    4. Huang, Na & Fryzlewicz, Piotr, 2018. "NOVELIST estimator of large correlation and covariance matrices and their inverses," LSE Research Online Documents on Economics 89055, London School of Economics and Political Science, LSE Library.
    5. Na Huang & Piotr Fryzlewicz, 2019. "NOVELIST estimator of large correlation and covariance matrices and their inverses," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 694-727, September.
    6. Maurizio Daniele & Winfried Pohlmeier & Aygul Zagidullina, 2018. "Sparse Approximate Factor Estimation for High-Dimensional Covariance Matrices," Working Paper Series of the Department of Economics, University of Konstanz 2018-07, Department of Economics, University of Konstanz.
    7. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    8. Seunghwan Lee & Sang Cheol Kim & Donghyeon Yu, 2023. "An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso," Computational Statistics, Springer, vol. 38(1), pages 217-242, March.
    9. Ikeda, Yuki & Kubokawa, Tatsuya & Srivastava, Muni S., 2016. "Comparison of linear shrinkage estimators of a large covariance matrix in normal and non-normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 95-108.
    10. Jin-Chuan Duan & Weimin Miao, 2016. "Default Correlations and Large-Portfolio Credit Analysis," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 536-546, October.
    11. Du, Lilun & Lan, Wei & Luo, Ronghua & Zhong, Pingshou, 2018. "Factor-adjusted multiple testing of correlations," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 34-47.
    12. Xin Wang & Lingchen Kong & Liqun Wang & Zhaoqilin Yang, 2023. "High-Dimensional Covariance Estimation via Constrained L q -Type Regularization," Mathematics, MDPI, vol. 11(4), pages 1-20, February.
    13. Chen, Jia & Li, Degui & Linton, Oliver, 2019. "A new semiparametric estimation approach for large dynamic covariance matrices with multiple conditioning variables," Journal of Econometrics, Elsevier, vol. 212(1), pages 155-176.
    14. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    15. Yan Zhang & Jiyuan Tao & Zhixiang Yin & Guoqiang Wang, 2022. "Improved Large Covariance Matrix Estimation Based on Efficient Convex Combination and Its Application in Portfolio Optimization," Mathematics, MDPI, vol. 10(22), pages 1-15, November.
    16. Dai, Chaoxing & Lu, Kun & Xiu, Dacheng, 2019. "Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data," Journal of Econometrics, Elsevier, vol. 208(1), pages 43-79.
    17. Jingying Yang, 2024. "Element Aggregation for Estimation of High-Dimensional Covariance Matrices," Mathematics, MDPI, vol. 12(7), pages 1-16, March.
    18. Wang, Hanchao & Peng, Bin & Li, Degui & Leng, Chenlei, 2021. "Nonparametric estimation of large covariance matrices with conditional sparsity," Journal of Econometrics, Elsevier, vol. 223(1), pages 53-72.
    19. Yumou Qiu & Song Xi Chen, 2015. "Bandwidth Selection for High-Dimensional Covariance Matrix Estimation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1160-1174, September.
    20. Shaoxin Wang & Hu Yang & Chaoli Yao, 2019. "On the penalized maximum likelihood estimation of high-dimensional approximate factor model," Computational Statistics, Springer, vol. 34(2), pages 819-846, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:2:p:1201-1212. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.