IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v201y2025ics0167947324001269.html
   My bibliography  Save this article

Feasible model-based principal component analysis: Joint estimation of rank and error covariance matrix

Author

Listed:
  • Chan, Tak-Shing T.
  • Gibberd, Alex

Abstract

Real-world inputs to principal component analysis are often corrupted by temporally or spatially correlated errors. There are several methods to mitigate this, e.g., generalized least-square matrix decomposition and maximum likelihood approaches; however, they all require that the number of components or the error covariances to be known in advance, rendering the methods infeasible. To address this issue, a novel method is developed which estimates the number of components and the error covariances at the same time. The method is based on working covariance models, an idea adapted from generalized estimating equations, where the user only specifies the structural form of the error covariances. If the structural form is also unknown, working covariance selection can be used to search for the best structure from a user-defined library. Experiments on synthetic and real data confirm the efficacy of the proposed approach.

Suggested Citation

  • Chan, Tak-Shing T. & Gibberd, Alex, 2025. "Feasible model-based principal component analysis: Joint estimation of rank and error covariance matrix," Computational Statistics & Data Analysis, Elsevier, vol. 201(C).
  • Handle: RePEc:eee:csdana:v:201:y:2025:i:c:s0167947324001269
    DOI: 10.1016/j.csda.2024.108042
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947324001269
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2024.108042?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    2. Michael E. Tipping & Christopher M. Bishop, 1999. "Probabilistic Principal Component Analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 611-622.
    3. Choi, In, 2012. "Efficient Estimation Of Factor Models," Econometric Theory, Cambridge University Press, vol. 28(2), pages 274-308, April.
    4. Henk Kiers, 1997. "Weighted least squares fitting using ordinary least squares algorithms," Psychometrika, Springer;The Psychometric Society, vol. 62(2), pages 251-266, June.
    5. Genevera I. Allen & Logan Grosenick & Jonathan Taylor, 2014. "A Generalized Least-Square Matrix Decomposition," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 145-159, March.
    6. Josse, Julie & Husson, François, 2012. "Selecting the number of components in principal component analysis using cross-validation approximations," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1869-1879.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Barigozzi, 2023. "Asymptotic equivalence of Principal Components and Quasi Maximum Likelihood estimators in Large Approximate Factor Models," Papers 2307.09864, arXiv.org, revised Jun 2024.
    2. Matteo Barigozzi & Matteo Luciani, 2017. "Common Factors, Trends, and Cycles in Large Datasets," Finance and Economics Discussion Series 2017-111, Board of Governors of the Federal Reserve System (U.S.).
    3. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    4. Matteo Barigozzi & Matteo Luciani, 2019. "Quasi Maximum Likelihood Estimation and Inference of Large Approximate Dynamic Factor Models via the EM algorithm," Papers 1910.03821, arXiv.org, revised Sep 2024.
    5. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    6. Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
    7. Ma, Tao & Zhou, Zhou & Antoniou, Constantinos, 2018. "Dynamic factor model for network traffic state forecast," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 281-317.
    8. Jonas Krampe & Luca Margaritella, 2021. "Factor Models with Sparse VAR Idiosyncratic Components," Papers 2112.07149, arXiv.org, revised May 2022.
    9. In Choi & Dukpa Kim & Yun Jung Kim & Noh‐Sun Kwark, 2018. "A multilevel factor model: Identification, asymptotic theory and applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(3), pages 355-377, April.
    10. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    11. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    12. Yunus Emre Ergemen, 2022. "Parametric Estimation of Long Memory in Factor Models," CREATES Research Papers 2022-10, Department of Economics and Business Economics, Aarhus University.
    13. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    14. Sung Hoon Choi, 2021. "Feasible Weighted Projected Principal Component Analysis for Factor Models with an Application to Bond Risk Premia," Papers 2108.10250, arXiv.org, revised May 2022.
    15. Urga, Giovanni & Wang, Fa, 2022. "Estimation and inference for high dimensional factor model with regime switching," MPRA Paper 113172, University Library of Munich, Germany.
    16. In Choi & Hanbat Jeong, 2019. "Model selection for factor analysis: Some new criteria and performance comparisons," Econometric Reviews, Taylor & Francis Journals, vol. 38(6), pages 577-596, July.
    17. Bai, Jushan & Wang, Peng, 2012. "Identification and estimation of dynamic factor models," MPRA Paper 38434, University Library of Munich, Germany.
    18. Bai, Jushan & Liao, Yuan, 2017. "Inferences in panel data with interactive effects using large covariance matrices," Journal of Econometrics, Elsevier, vol. 200(1), pages 59-78.
    19. Bada, Oualid & Kneip, Alois, 2014. "Parameter cascading for panel models with unknown number of unobserved factors: An application to the credit spread puzzle," Computational Statistics & Data Analysis, Elsevier, vol. 76(C), pages 95-115.
    20. Gregory Connor & Robert A Korajczyk, 2024. "Semi-Strong Factors in Asset Returns," Journal of Financial Econometrics, Oxford University Press, vol. 22(1), pages 70-93.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:201:y:2025:i:c:s0167947324001269. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.