IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/21584.html
   My bibliography  Save this paper

Principal Component Analysis of High Frequency Data

Author

Listed:
  • Yacine Aït-Sahalia
  • Dacheng Xiu

Abstract

We develop the necessary methodology to conduct principal component analysis at high frequency. We construct estimators of realized eigenvalues, eigenvectors, and principal components and provide the asymptotic distribution of these estimators. Empirically, we study the high frequency covariance structure of the constituents of the S&P 100 Index using as little as one week of high frequency data at a time. The explanatory power of the high frequency principal components varies over time. During the recent financial crisis, the first principal component becomes increasingly dominant, explaining up to 60% of the variation on its own, while the second principal component drives the common variation of financial sector stocks.

Suggested Citation

  • Yacine Aït-Sahalia & Dacheng Xiu, 2015. "Principal Component Analysis of High Frequency Data," NBER Working Papers 21584, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:21584
    Note: AP
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w21584.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. repec:hal:journl:peer-00815564 is not listed on IDEAS
    2. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    3. Tao, Minjing & Wang, Yazhen & Yao, Qiwei & Zou, Jian, 2011. "Large Volatility Matrix Inference via Combining Low-Frequency and High-Frequency Approaches," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1025-1040.
    4. Christensen, Kim & Kinnebrock, Silja & Podolskij, Mark, 2010. "Pre-averaging estimators of the ex-post covariance matrix in noisy diffusion models with non-synchronous data," Journal of Econometrics, Elsevier, vol. 159(1), pages 116-133, November.
    5. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    6. Per A. Mykland & Lan Zhang, 2009. "Inference for Continuous Semimartingales Observed at High Frequency," Econometrica, Econometric Society, vol. 77(5), pages 1403-1445, September.
    7. Barndorff-Nielsen, Ole E. & Hansen, Peter Reinhard & Lunde, Asger & Shephard, Neil, 2011. "Multivariate realised kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading," Journal of Econometrics, Elsevier, vol. 162(2), pages 149-169, June.
    8. Yacine Aït-Sahalia & Jean Jacod, 2014. "High-Frequency Financial Econometrics," Economics Books, Princeton University Press, edition 1, number 10261.
    9. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    10. Forni, Mario & Lippi, Marco, 2001. "The Generalized Dynamic Factor Model: Representation Theory," Econometric Theory, Cambridge University Press, vol. 17(6), pages 1113-1141, December.
    11. Mario Forni & Marc Hallin & Marco Lippi & Lucrezia Reichlin, 2000. "The Generalized Dynamic-Factor Model: Identification And Estimation," The Review of Economics and Statistics, MIT Press, vol. 82(4), pages 540-554, November.
    12. Bai, Z. D. & Silverstein, Jack W. & Yin, Y. Q., 1988. "A note on the largest eigenvalue of a large dimensional sample covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 26(2), pages 166-168, August.
    13. Connor, Gregory & Korajczyk, Robert A., 1988. "Risk and return in an equilibrium APT : Application of a new test methodology," Journal of Financial Economics, Elsevier, vol. 21(2), pages 255-289, September.
    14. repec:hal:journl:peer-00732537 is not listed on IDEAS
    15. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    16. Tao, Minjing & Wang, Yazhen & Chen, Xiaohong, 2013. "Fast Convergence Rates In Estimating Large Volatility Matrices Using High-Frequency Financial Data," Econometric Theory, Cambridge University Press, vol. 29(4), pages 838-856, August.
    17. Li, Baibing & Martin, Elaine B. & Morris, A. Julian, 2002. "On principal component analysis in L1," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 471-474, September.
    18. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    19. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    20. Egloff, Daniel & Leippold, Markus & Wu, Liuren, 2010. "The Term Structure of Variance Swap Rates and Optimal Variance Swap Investments," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 45(5), pages 1279-1310, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    2. Dai, Chaoxing & Lu, Kun & Xiu, Dacheng, 2019. "Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data," Journal of Econometrics, Elsevier, vol. 208(1), pages 43-79.
    3. Zura Kakushadze & Willie Yu, 2016. "Multifactor Risk Models and Heterotic CAPM," Papers 1602.04902, arXiv.org, revised Mar 2016.
    4. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    5. Zura Kakushadze, 2015. "Heterotic Risk Models," Papers 1508.04883, arXiv.org, revised Jan 2016.
    6. Zura Kakushadze & Willie Yu, 2016. "Statistical Risk Models," Papers 1602.08070, arXiv.org, revised Jan 2017.
    7. Massacci, Daniele, 2017. "Least squares estimation of large dimensional threshold factor models," Journal of Econometrics, Elsevier, vol. 197(1), pages 101-129.
    8. Bai, Jushan & Ando, Tomohiro, 2013. "Multifactor asset pricing with a large number of observable risk factors and unobservable common and group-specific factors," MPRA Paper 52785, University Library of Munich, Germany, revised Dec 2013.
    9. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    10. repec:gnv:wpaper:unige:76321 is not listed on IDEAS
    11. Patrick Gagliardini & Elisa Ossola & Olivier Scaillet, 2016. "Time‐Varying Risk Premium in Large Cross‐Sectional Equity Data Sets," Econometrica, Econometric Society, vol. 84, pages 985-1046, May.
    12. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    13. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "Estimation of large dimensional conditional factor models in finance," Working Papers unige:125031, University of Geneva, Geneva School of Economics and Management.
    14. Jianqing Fan & Alex Furger & Dacheng Xiu, 2016. "Incorporating Global Industrial Classification Standard Into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator With High-Frequency Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 489-503, October.
    15. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    16. Jushan Bai & Shuzhong Shi, 2011. "Estimating High Dimensional Covariance Matrices and its Applications," Annals of Economics and Finance, Society for AEF, vol. 12(2), pages 199-215, November.
    17. Pelger, Markus, 2019. "Large-dimensional factor modeling based on high-frequency observations," Journal of Econometrics, Elsevier, vol. 208(1), pages 23-42.
    18. Li, Y-N. & Chen, J. & Linton, O., 2021. "Estimation of Common Factors for Microstructure Noise and Efficient Price in a High-frequency Dual Factor Model," Cambridge Working Papers in Economics 2150, Faculty of Economics, University of Cambridge.
    19. Tomohiro Ando & Ruey S. Tsay, 2009. "Model selection for generalized linear models with factor‐augmented predictors," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 25(3), pages 207-235, May.
    20. Mario Forni & Luca Gambetti & Luca Sala, 2014. "No News in Business Cycles," Economic Journal, Royal Economic Society, vol. 124(581), pages 1168-1191, December.
    21. Goyal, Amit & Pérignon, Christophe & Villa, Christophe, 2008. "How common are common return factors across the NYSE and Nasdaq?," Journal of Financial Economics, Elsevier, vol. 90(3), pages 252-271, December.

    More about this item

    JEL classification:

    • C22 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics
    • G01 - Financial Economics - - General - - - Financial Crises

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:21584. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.