IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v38y2023i2d10.1007_s00180-022-01270-z.html
   My bibliography  Save this article

Sparse online principal component analysis for parameter estimation in factor model

Author

Listed:
  • Guangbao Guo

    (Shandong University of Technology)

  • Chunjie Wei

    (Shandong University of Technology)

  • Guoqi Qian

    (University of Melbourne)

Abstract

Factor model has the capacity of reducing redundant information in real data analysis. Note that sparse principal component (SPC) method is developed to obtain sparse solutions from the model, online principal component (OPC) method is used to handle with online dimension reduction problem. It is worth considering how to obtain a sparse solution with online learning. In this paper we propose a novel sparse online principal component (SOPC) method for sparse parameter estimation in factor model, where we combine the advantages of the SPC and OPC methods in estimating the loading matrix and the idiosyncratic variance matrix. By integrating sparse modelling with online update, the SOPC is capable of finding the sparse solution through iterative online updating, leading to a consistent and easily interpretable solution. Stability and sensitivity of the SOPC are assessed through a simulation study. The method is then applied to analyze two real data sets concerning drug efficacy and human activity recognition.

Suggested Citation

  • Guangbao Guo & Chunjie Wei & Guoqi Qian, 2023. "Sparse online principal component analysis for parameter estimation in factor model," Computational Statistics, Springer, vol. 38(2), pages 1095-1116, June.
  • Handle: RePEc:spr:compst:v:38:y:2023:i:2:d:10.1007_s00180-022-01270-z
    DOI: 10.1007/s00180-022-01270-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-022-01270-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-022-01270-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Máximo Camacho & Rafael Doménech, 2012. "MICA-BBVA: a factor model of economic and financial indicators for short-term GDP forecasting," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 3(4), pages 475-497, December.
    2. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    3. Lin, Tsung-I & McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Extending mixtures of factor models using the restricted multivariate skew-normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 398-413.
    4. Quefeng Li & Guang Cheng & Jianqing Fan & Yuyan Wang, 2018. "Embracing the Blessing of Dimensionality in Factor Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 380-389, January.
    5. Daniel Peña & Victor J. Yohai, 2016. "Generalized Dynamic Principal Components," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1121-1131, July.
    6. Nickolay T. Trendafilov & Sara Fontanella & Kohei Adachi, 2017. "Sparse Exploratory Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 778-794, September.
    7. Fan, Jianqing & Xue, Lingzhou & Yao, Jiawei, 2017. "Sufficient forecasting using factor models," Journal of Econometrics, Elsevier, vol. 201(2), pages 292-306.
    8. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    9. Yacine Aït-Sahalia & Dacheng Xiu, 2019. "Principal Component Analysis of High-Frequency Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 287-303, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Reiß, Markus & Winkelmann, Lars, 2021. "Inference on the maximal rank of time-varying covariance matrices using high-frequency data," Discussion Papers 2021/14, Free University Berlin, School of Business & Economics.
    2. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    3. Jaeheon Jung, 2019. "Estimating a Large Covariance Matrix in Time-varying Factor Models," Papers 1910.11965, arXiv.org.
    4. Paolella, Marc S. & Polak, Paweł & Walker, Patrick S., 2021. "A non-elliptical orthogonal GARCH model for portfolio selection under transaction costs," Journal of Banking & Finance, Elsevier, vol. 125(C).
    5. Ma, Shujie & Linton, Oliver & Gao, Jiti, 2021. "Estimation and inference in semiparametric quantile factor models," Journal of Econometrics, Elsevier, vol. 222(1), pages 295-323.
    6. Trucíos, Carlos & Hotta, Luiz K. & Valls Pereira, Pedro L., 2019. "On the robustness of the principal volatility components," Journal of Empirical Finance, Elsevier, vol. 52(C), pages 201-219.
    7. Choi, Sung Hoon & Kim, Donggyu, 2023. "Large volatility matrix analysis using global and national factor models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1917-1933.
    8. Ruijun Bu & Degui Li & Oliver Linton & Hanchao Wang, 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Working Papers 202212, University of Liverpool, Department of Economics.
    9. Yu, Long & He, Yong & Zhang, Xinsheng, 2019. "Robust factor number specification for large-dimensional elliptical factor model," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    10. Dovonon, Prosper & Taamouti, Abderrahim & Williams, Julian, 2022. "Testing the eigenvalue structure of spot and integrated covariance," Journal of Econometrics, Elsevier, vol. 229(2), pages 363-395.
    11. Fayed Alshammri & Jiazhu Pan, 2021. "Moving dynamic principal component analysis for non-stationary multivariate time series," Computational Statistics, Springer, vol. 36(3), pages 2247-2287, September.
    12. Cheng, Mingmian & Swanson, Norman R. & Yang, Xiye, 2021. "Forecasting volatility using double shrinkage methods," Journal of Empirical Finance, Elsevier, vol. 62(C), pages 46-61.
    13. repec:cte:wsrepe:27047 is not listed on IDEAS
    14. Kim Christensen & Mikkel Slot Nielsen & Mark Podolskij, 2023. "High-dimensional estimation of quadratic variation based on penalized realized variance," Statistical Inference for Stochastic Processes, Springer, vol. 26(2), pages 331-359, July.
    15. Choi, Jungjun & Yang, Xiye, 2022. "Asymptotic properties of correlation-based principal component analysis," Journal of Econometrics, Elsevier, vol. 229(1), pages 1-18.
    16. Jiayuan Zhou & Feiyu Jiang & Ke Zhu & Wai Keung Li, 2019. "Time series models for realized covariance matrices based on the matrix-F distribution," Papers 1903.12077, arXiv.org, revised Jul 2020.
    17. Bu, R. & Li, D. & Linton, O. & Wang, H., 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Cambridge Working Papers in Economics 2218, Faculty of Economics, University of Cambridge.
    18. Kwangmin Jung & Donggyu Kim & Seunghyeon Yu, 2022. "Next generation models for portfolio risk management: An approach using financial big data," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 89(3), pages 765-787, September.
    19. Li, Hong & Porth, Lysa & Tan, Ken Seng & Zhu, Wenjun, 2021. "Improved index insurance design and yield estimation using a dynamic factor forecasting approach," Insurance: Mathematics and Economics, Elsevier, vol. 96(C), pages 208-221.
    20. Chen, Dachuan, 2024. "High frequency principal component analysis based on correlation matrix that is robust to jumps, microstructure noise and asynchronous observation times," Journal of Econometrics, Elsevier, vol. 240(1).
    21. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:38:y:2023:i:2:d:10.1007_s00180-022-01270-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.