IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v240y2024i1s0304407624000472.html
   My bibliography  Save this article

High frequency principal component analysis based on correlation matrix that is robust to jumps, microstructure noise and asynchronous observation times

Author

Listed:
  • Chen, Dachuan

Abstract

This paper developed the high frequency estimation for the principal component analysis (PCA) based on correlation matrix. This estimation methodology is robust to jumps, microstructure noise and asynchronous observation times simultaneously, which is enabled by the newly proposed Truncated and Smoothed Two-Scales Realized Volatility (Truncated S-TSRV) estimator. The general framework of our methodology is constructed based on the estimation of realized spectral functions with respect to the spot correlation matrix. A new asymptotic representation for the element-wise estimation error of the spot correlation matrix estimate has been derived, resulting in a new bias correction term which is much more complex than that of the PCA based covariance matrix. Central limit theorem and rate of convergence have been developed for the bias-corrected estimator. The standard error estimator has also been proposed. As the empirical study of our methodology, we have constructed the first eigen-portfolio based on the eigenvector estimate corresponding to the largest eigenvalue in the spot correlation matrix. We regress the returns of first eigen-portfolio against that of the market ETF, which obtained non-significant alpha estimate and significant beta estimate which is very close to one.

Suggested Citation

  • Chen, Dachuan, 2024. "High frequency principal component analysis based on correlation matrix that is robust to jumps, microstructure noise and asynchronous observation times," Journal of Econometrics, Elsevier, vol. 240(1).
  • Handle: RePEc:eee:econom:v:240:y:2024:i:1:s0304407624000472
    DOI: 10.1016/j.jeconom.2024.105701
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407624000472
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2024.105701?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jacod, Jean & Li, Yingying & Mykland, Per A. & Podolskij, Mark & Vetter, Mathias, 2009. "Microstructure noise in the continuous case: The pre-averaging approach," Stochastic Processes and their Applications, Elsevier, vol. 119(7), pages 2249-2276, July.
    2. Billio, Monica & Getmansky, Mila & Lo, Andrew W. & Pelizzon, Loriana, 2012. "Econometric measures of connectedness and systemic risk in the finance and insurance sectors," Journal of Financial Economics, Elsevier, vol. 104(3), pages 535-559.
    3. Zhang, Lan & Mykland, Per A. & Ait-Sahalia, Yacine, 2005. "A Tale of Two Time Scales: Determining Integrated Volatility With Noisy High-Frequency Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1394-1411, December.
    4. Aït-Sahalia, Yacine & Mykland, Per A. & Zhang, Lan, 2011. "Ultra high frequency volatility estimation with dependent microstructure noise," Journal of Econometrics, Elsevier, vol. 160(1), pages 160-175, January.
    5. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    6. Marco Avellaneda, 2020. "Hierarchical PCA and Applications to Portfolio Management," Remef - Revista Mexicana de Economía y Finanzas Nueva Época REMEF (The Mexican Journal of Economics and Finance), Instituto Mexicano de Ejecutivos de Finanzas, IMEF, vol. 15(1), pages 1-16, Enero - M.
    7. Dominik M. Rösch & Avanidhar Subrahmanyam & Mathijs A. van Dijk, 2017. "The Dynamics of Market Efficiency," The Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1151-1187.
    8. Yacine Aït-Sahalia & Jean Jacod, 2014. "High-Frequency Financial Econometrics," Economics Books, Princeton University Press, edition 1, number 10261.
    9. Kollo, T. & Neudecker, H., 1993. "Asymptotics of Eigenvalues and Unit-Length Eigenvectors of Sample Variance and Correlation Matrices," Journal of Multivariate Analysis, Elsevier, vol. 47(2), pages 283-300, November.
    10. Dion Bongaerts & Frank De Jong & Joost Driessen, 2011. "Derivative Pricing with Liquidity Risk: Theory and Evidence from the Credit Default Swap Market," Journal of Finance, American Finance Association, vol. 66(1), pages 203-240, February.
    11. Ekkehart Boehmer & Dan Li & Gideon Saar, 2018. "The Competitive Landscape of High-Frequency Trading Firms," The Review of Financial Studies, Society for Financial Studies, vol. 31(6), pages 2227-2276.
    12. Xin-Bing Kong, 2017. "On the number of common factors with high-frequency data," Biometrika, Biometrika Trust, vol. 104(2), pages 397-410.
    13. Sydney C. Ludvigson & Serena Ng, 2009. "Macro Factors in Bond Risk Premia," The Review of Financial Studies, Society for Financial Studies, vol. 22(12), pages 5027-5067, December.
    14. Dick-Nielsen, Jens & Feldhütter, Peter & Lando, David, 2012. "Corporate bond liquidity before and after the onset of the subprime crisis," Journal of Financial Economics, Elsevier, vol. 103(3), pages 471-492.
    15. Pelger, Markus, 2019. "Large-dimensional factor modeling based on high-frequency observations," Journal of Econometrics, Elsevier, vol. 208(1), pages 23-42.
    16. Dachuan Chen & Per A. Mykland & Lan Zhang, 2020. "The Five Trolls Under the Bridge: Principal Component Analysis With Asynchronous and Noisy High Frequency Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1960-1977, December.
    17. Mykland, Per A. & Zhang, Lan & Chen, Dachuan, 2019. "The algebra of two scales estimation, and the S-TSRV: High frequency estimation that is robust to sampling times," Journal of Econometrics, Elsevier, vol. 208(1), pages 101-119.
    18. Per A. Mykland & Lan Zhang, 2017. "Assessment of Uncertainty in High Frequency Data: The Observed Asymptotic Variance," Econometrica, Econometric Society, vol. 85, pages 197-231, January.
    19. Dai, Chaoxing & Lu, Kun & Xiu, Dacheng, 2019. "Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data," Journal of Econometrics, Elsevier, vol. 208(1), pages 43-79.
    20. Bollerslev, Tim & Meddahi, Nour & Nyawa, Serge, 2019. "High-dimensional multivariate realized volatility estimation," Journal of Econometrics, Elsevier, vol. 212(1), pages 116-136.
    21. Fang, C. & Krishnaiah, P. R., 1982. "Asymptotic distributions of functions of the eigenvalues of some random matrices for nonnormal populations," Journal of Multivariate Analysis, Elsevier, vol. 12(1), pages 39-63, March.
    22. Yacine Aït-Sahalia & Dacheng Xiu, 2019. "Principal Component Analysis of High-Frequency Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 287-303, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Choi, Jungjun & Yang, Xiye, 2022. "Asymptotic properties of correlation-based principal component analysis," Journal of Econometrics, Elsevier, vol. 229(1), pages 1-18.
    2. Chen, Dachuan & Mykland, Per A. & Zhang, Lan, 2024. "Realized regression with asynchronous and noisy high frequency and high dimensional data," Journal of Econometrics, Elsevier, vol. 239(2).
    3. Bu, R. & Li, D. & Linton, O. & Wang, H., 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Cambridge Working Papers in Economics 2218, Faculty of Economics, University of Cambridge.
    4. Ruijun Bu & Degui Li & Oliver Linton & Hanchao Wang, 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Working Papers 202212, University of Liverpool, Department of Economics.
    5. Bollerslev, Tim & Meddahi, Nour & Nyawa, Serge, 2019. "High-dimensional multivariate realized volatility estimation," Journal of Econometrics, Elsevier, vol. 212(1), pages 116-136.
    6. Cai, T. Tony & Hu, Jianchang & Li, Yingying & Zheng, Xinghua, 2020. "High-dimensional minimum variance portfolio estimation based on high-frequency data," Journal of Econometrics, Elsevier, vol. 214(2), pages 482-494.
    7. Shin, Minseok & Kim, Donggyu & Fan, Jianqing, 2023. "Adaptive robust large volatility matrix estimation based on high-frequency financial data," Journal of Econometrics, Elsevier, vol. 237(1).
    8. Cheng, Mingmian & Swanson, Norman R. & Yang, Xiye, 2021. "Forecasting volatility using double shrinkage methods," Journal of Empirical Finance, Elsevier, vol. 62(C), pages 46-61.
    9. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    10. Li, Y-N. & Chen, J. & Linton, O., 2021. "Estimation of Common Factors for Microstructure Noise and Efficient Price in a High-frequency Dual Factor Model," Cambridge Working Papers in Economics 2150, Faculty of Economics, University of Cambridge.
    11. Mykland, Per A. & Zhang, Lan, 2021. "The Observed Asymptotic Variance: Hard edges, and a regression approach," Journal of Econometrics, Elsevier, vol. 222(1), pages 411-428.
    12. Chen, Richard Y. & Mykland, Per A., 2017. "Model-free approaches to discern non-stationary microstructure noise and time-varying liquidity in high-frequency data," Journal of Econometrics, Elsevier, vol. 200(1), pages 79-103.
    13. Li, Z. Merrick & Laeven, Roger J.A. & Vellekoop, Michel H., 2020. "Dependent microstructure noise and integrated volatility estimation from high-frequency data," Journal of Econometrics, Elsevier, vol. 215(2), pages 536-558.
    14. Shen, Yiwen & Shi, Meiqi, 2024. "Intraday variation in cross-sectional stock comovement and impact of index-based strategies," Journal of Financial Markets, Elsevier, vol. 68(C).
    15. Dovonon, Prosper & Taamouti, Abderrahim & Williams, Julian, 2022. "Testing the eigenvalue structure of spot and integrated covariance," Journal of Econometrics, Elsevier, vol. 229(2), pages 363-395.
    16. Kim, Donggyu & Kong, Xin-Bing & Li, Cui-Xia & Wang, Yazhen, 2018. "Adaptive thresholding for large volatility matrix estimation based on high-frequency financial data," Journal of Econometrics, Elsevier, vol. 203(1), pages 69-79.
    17. Li, Yingying & Liu, Guangying & Zhang, Zhiyuan, 2022. "Volatility of volatility: Estimation and tests based on noisy high frequency data with jumps," Journal of Econometrics, Elsevier, vol. 229(2), pages 422-451.
    18. Richard Y. Chen & Per A. Mykland, 2015. "Model-Free Approaches to Discern Non-Stationary Microstructure Noise and Time-Varying Liquidity in High-Frequency Data," Papers 1512.06159, arXiv.org, revised Oct 2018.
    19. Li, Yingying & Xie, Shangyu & Zheng, Xinghua, 2016. "Efficient estimation of integrated volatility incorporating trading information," Journal of Econometrics, Elsevier, vol. 195(1), pages 33-50.
    20. Donggyu Kim & Minseog Oh, 2023. "Dynamic Realized Minimum Variance Portfolio Models," Papers 2310.13511, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:240:y:2024:i:1:s0304407624000472. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.