IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v208y2019i1p43-79.html
   My bibliography  Save this article

Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data

Author

Listed:
  • Dai, Chaoxing
  • Lu, Kun
  • Xiu, Dacheng

Abstract

We investigate estimators of factor-model-based large covariance (and precision) matrices using high-frequency data, which are asynchronous and potentially contaminated by the market microstructure noise. Our estimation strategies rely on the pre-averaging method with refresh time to solve the microstructure problems, while using three different specifications of factor models with a variety of thresholding methods, respectively, to battle the curse of dimensionality. To estimate a factor model, we either adopt the time-series regression (TSR) to recover loadings if factors are known, or use the cross-sectional regression (CSR) to recover factors from known loadings, or use the principal component analysis (PCA) if neither factors nor their loadings are assumed known. We compare the convergence rates in these scenarios using the joint in-fill and increasing dimensionality asymptotics. To evaluate the empirical trade-off between robustness to model misspecification and statistical efficiency among all 30 combinations of estimation strategies, we run a horse race on the out-of-sample portfolio allocation with Dow Jones 30, S&P 100, and S&P 500 index constituents, respectively, and find the pre-averaging-based strategy using TSR or PCA with location thresholding dominates, especially over the subsampling-based alternatives.

Suggested Citation

  • Dai, Chaoxing & Lu, Kun & Xiu, Dacheng, 2019. "Knowing factors or factor loadings, or neither? Evaluating estimators of large covariance matrices with noisy and asynchronous data," Journal of Econometrics, Elsevier, vol. 208(1), pages 43-79.
  • Handle: RePEc:eee:econom:v:208:y:2019:i:1:p:43-79
    DOI: 10.1016/j.jeconom.2018.09.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407618301702
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2018.09.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. repec:hal:journl:peer-00815564 is not listed on IDEAS
    2. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    3. Bibinger, Markus, 2012. "An estimator for the quadratic covariation of asynchronously observed Itô processes with noise: Asymptotic distribution theory," Stochastic Processes and their Applications, Elsevier, vol. 122(6), pages 2411-2453.
    4. Forni, Mario & Lippi, Marco, 2001. "The Generalized Dynamic Factor Model: Representation Theory," Econometric Theory, Cambridge University Press, vol. 17(6), pages 1113-1141, December.
    5. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    6. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2004. "The generalized dynamic factor model consistency and rates," Journal of Econometrics, Elsevier, vol. 119(2), pages 231-255, April.
    7. Doz, Catherine & Giannone, Domenico & Reichlin, Lucrezia, 2011. "A two-step estimator for large approximate dynamic factor models based on Kalman filtering," Journal of Econometrics, Elsevier, vol. 164(1), pages 188-205, September.
    8. Ikeda, Shin S., 2016. "A bias-corrected estimator of the covariation matrix of multiple security prices when both microstructure effects and sampling durations are persistent and endogenous," Journal of Econometrics, Elsevier, vol. 193(1), pages 203-214.
    9. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    10. Gregory Connor & Matthias Hagmann & Oliver Linton, 2012. "Efficient Semiparametric Estimation of the Fama–French Model and Extensions," Econometrica, Econometric Society, vol. 80(2), pages 713-754, March.
    11. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    12. Nikolaus Hautsch & Lada M. Kyj & Roel C. A. Oomen, 2012. "A blocking and regularization approach to high‐dimensional realized covariance estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(4), pages 625-645, June.
    13. Hyungsik Roger Moon & Martin Weidner, 2015. "Linear Regression for Panel With Unknown Number of Factors as Interactive Fixed Effects," Econometrica, Econometric Society, vol. 83(4), pages 1543-1579, July.
    14. Tao, Minjing & Wang, Yazhen & Yao, Qiwei & Zou, Jian, 2011. "Large Volatility Matrix Inference via Combining Low-Frequency and High-Frequency Approaches," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1025-1040.
    15. Barndorff-Nielsen, Ole E. & Hansen, Peter Reinhard & Lunde, Asger & Shephard, Neil, 2011. "Multivariate realised kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading," Journal of Econometrics, Elsevier, vol. 162(2), pages 149-169, June.
    16. Jacod, Jean & Li, Yingying & Mykland, Per A. & Podolskij, Mark & Vetter, Mathias, 2009. "Microstructure noise in the continuous case: The pre-averaging approach," Stochastic Processes and their Applications, Elsevier, vol. 119(7), pages 2249-2276, July.
    17. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    18. Croux, Christophe & Renault, Eric & Werker, Bas, 2004. "Dynamic factor models," Journal of Econometrics, Elsevier, vol. 119(2), pages 223-230, April.
    19. repec:hal:journl:peer-00844811 is not listed on IDEAS
    20. Zhang, Lan, 2011. "Estimating covariation: Epps effect, microstructure noise," Journal of Econometrics, Elsevier, vol. 160(1), pages 33-47, January.
    21. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    22. Tao, Minjing & Wang, Yazhen & Chen, Xiaohong, 2013. "Fast Convergence Rates In Estimating Large Volatility Matrices Using High-Frequency Financial Data," Econometric Theory, Cambridge University Press, vol. 29(4), pages 838-856, August.
    23. Christensen, Kim & Kinnebrock, Silja & Podolskij, Mark, 2010. "Pre-averaging estimators of the ex-post covariance matrix in noisy diffusion models with non-synchronous data," Journal of Econometrics, Elsevier, vol. 159(1), pages 116-133, November.
    24. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    25. Li, Yingying & Mykland, Per A. & Renault, Eric & Zhang, Lan & Zheng, Xinghua, 2014. "Realized Volatility When Sampling Times Are Possibly Endogenous," Econometric Theory, Cambridge University Press, vol. 30(3), pages 580-605, June.
    26. Kim, Donggyu & Wang, Yazhen & Zou, Jian, 2016. "Asymptotic theory for large volatility matrix estimation based on high-frequency financial data," Stochastic Processes and their Applications, Elsevier, vol. 126(11), pages 3527-3577.
    27. Shephard, Neil & Xiu, Dacheng, 2017. "Econometric analysis of multivariate realised QML: Estimation of the covariation of equity prices under asynchronous trading," Journal of Econometrics, Elsevier, vol. 201(1), pages 19-42.
    28. Connor, Gregory & Linton, Oliver, 2007. "Semiparametric estimation of a characteristic-based factor model of common stock returns," Journal of Empirical Finance, Elsevier, vol. 14(5), pages 694-717, December.
    29. Todorov, Viktor & Bollerslev, Tim, 2010. "Jumps and betas: A new framework for disentangling and estimating systematic risks," Journal of Econometrics, Elsevier, vol. 157(2), pages 220-235, August.
    30. Jianqing Fan & Jingjin Zhang & Ke Yu, 2012. "Vast Portfolio Selection With Gross-Exposure Constraints," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 592-606, June.
    31. Yacine Aït-Sahalia & Jean Jacod, 2014. "High-Frequency Financial Econometrics," Economics Books, Princeton University Press, edition 1, number 10261.
    32. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    33. Rosenberg, Barr, 1974. "Extra-Market Components of Covariance in Security Returns," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 9(2), pages 263-274, March.
    34. Aït-Sahalia, Yacine & Fan, Jianqing & Xiu, Dacheng, 2010. "High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1504-1517.
    35. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    36. Mario Forni & Marc Hallin & Marco Lippi & Lucrezia Reichlin, 2000. "The Generalized Dynamic-Factor Model: Identification And Estimation," The Review of Economics and Statistics, MIT Press, vol. 82(4), pages 540-554, November.
    37. Jonathan Brogaard & Terrence Hendershott & Ryan Riordan, 2014. "High-Frequency Trading and Price Discovery," The Review of Financial Studies, Society for Financial Studies, vol. 27(8), pages 2267-2306.
    38. Robert F. Engle & Jeffrey R. Russell, 1998. "Autoregressive Conditional Duration: A New Model for Irregularly Spaced Transaction Data," Econometrica, Econometric Society, vol. 66(5), pages 1127-1162, September.
    39. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    40. Kalnina, Ilze & Linton, Oliver, 2008. "Estimating quadratic variation consistently in the presence of endogenous and diurnal measurement error," Journal of Econometrics, Elsevier, vol. 147(1), pages 47-59, November.
    41. Bibinger, Markus & Hautsch, Nikolaus & Malec, Peter & Reiss, Markus, 2013. "Estimating the quadratic covariation matrix from noisy observations: Local method of moments and efficiency," SFB 649 Discussion Papers 2013-017, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    42. Daniel, Kent & Titman, Sheridan, 1997. "Evidence on the Characteristics of Cross Sectional Variation in Stock Returns," Journal of Finance, American Finance Association, vol. 52(1), pages 1-33, March.
    43. Tao, Minjing & Wang, Yahzen & Yao, Qiwei & Zou, Jian, 2011. "Large volatility matrix inference via combining low-frequency and high-frequency approaches," LSE Research Online Documents on Economics 39321, London School of Economics and Political Science, LSE Library.
    44. Bai, Jushan & Ng, Serena, 2006. "Evaluating latent and observed factors in macroeconomics and finance," Journal of Econometrics, Elsevier, vol. 131(1-2), pages 507-537.
    45. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    46. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    47. Cai, Tony & Liu, Weidong, 2011. "Adaptive Thresholding for Sparse Covariance Matrix Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 672-684.
    48. Jianqing Fan & Alex Furger & Dacheng Xiu, 2016. "Incorporating Global Industrial Classification Standard Into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator With High-Frequency Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 489-503, October.
    49. Christian Brownlees & Eulàlia Nualart & Yucheng Sun, 2018. "Realized networks," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(7), pages 986-1006, November.
    50. Alexei Onatski, 2010. "Determining the Number of Factors from Empirical Distribution of Eigenvalues," The Review of Economics and Statistics, MIT Press, vol. 92(4), pages 1004-1016, November.
    51. repec:hal:journl:peer-00732537 is not listed on IDEAS
    52. Rothman, Adam J. & Levina, Elizaveta & Zhu, Ji, 2009. "Generalized Thresholding of Large Covariance Matrices," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 177-186.
    53. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    54. Rasmus Tangsgaard Varneskov, 2016. "Flat-Top Realized Kernel Estimation of Quadratic Covariation With Nonsynchronous and Noisy Asset Prices," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(1), pages 1-22, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cai, T. Tony & Hu, Jianchang & Li, Yingying & Zheng, Xinghua, 2020. "High-dimensional minimum variance portfolio estimation based on high-frequency data," Journal of Econometrics, Elsevier, vol. 214(2), pages 482-494.
    2. Ruijun Bu & Degui Li & Oliver Linton & Hanchao Wang, 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Working Papers 202212, University of Liverpool, Department of Economics.
    3. Bu, R. & Li, D. & Linton, O. & Wang, H., 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Cambridge Working Papers in Economics 2218, Faculty of Economics, University of Cambridge.
    4. Chen, Dachuan, 2024. "High frequency principal component analysis based on correlation matrix that is robust to jumps, microstructure noise and asynchronous observation times," Journal of Econometrics, Elsevier, vol. 240(1).
    5. Erlin Guo & Cuixia Li & Fengqin Tang, 2023. "The Convergence Rates of Large Volatility Matrix Estimator Based on Noise, Jumps, and Asynchronization," Mathematics, MDPI, vol. 11(6), pages 1-11, March.
    6. Esparcia, Carlos & Escribano, Ana & Jareño, Francisco, 2024. "Assessing the crypto market stability after the FTX collapse: A study of high frequency volatility and connectedness," International Review of Financial Analysis, Elsevier, vol. 94(C).
    7. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    8. Bollerslev, Tim & Meddahi, Nour & Nyawa, Serge, 2019. "High-dimensional multivariate realized volatility estimation," Journal of Econometrics, Elsevier, vol. 212(1), pages 116-136.
    9. Chen, Dachuan & Mykland, Per A. & Zhang, Lan, 2024. "Realized regression with asynchronous and noisy high frequency and high dimensional data," Journal of Econometrics, Elsevier, vol. 239(2).
    10. Tae-Hwy Lee & Ekaterina Seregina, 2020. "Learning from Forecast Errors: A New Approach to Forecast Combination," Working Papers 202024, University of California at Riverside, Department of Economics.
    11. Davide Lauria & W. Brent Lindquist & Svetlozar T. Rachev, 2023. "Enhancing CVaR portfolio optimisation performance with GAM factor models," Papers 2401.00188, arXiv.org.
    12. Li, Y-N. & Chen, J. & Linton, O., 2021. "Estimation of Common Factors for Microstructure Noise and Efficient Price in a High-frequency Dual Factor Model," Cambridge Working Papers in Economics 2150, Faculty of Economics, University of Cambridge.
    13. Li, Degui, 2024. "Estimation of Large Dynamic Covariance Matrices: A Selective Review," Econometrics and Statistics, Elsevier, vol. 29(C), pages 16-30.
    14. Donggyu Kim & Minseog Oh, 2024. "Dynamic Realized Minimum Variance Portfolio Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(4), pages 1238-1249, October.
    15. Esparcia, Carlos & Escribano, Ana & Jareño, Francisco, 2023. "Did cryptomarket chaos unleash Silvergate's bankruptcy? investigating the high-frequency volatility and connectedness behind the collapse," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 89(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    2. Li, Y-N. & Chen, J. & Linton, O., 2021. "Estimation of Common Factors for Microstructure Noise and Efficient Price in a High-frequency Dual Factor Model," Cambridge Working Papers in Economics 2150, Faculty of Economics, University of Cambridge.
    3. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "Estimation of large dimensional conditional factor models in finance," Working Papers unige:125031, University of Geneva, Geneva School of Economics and Management.
    4. Jianqing Fan & Alex Furger & Dacheng Xiu, 2016. "Incorporating Global Industrial Classification Standard Into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator With High-Frequency Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 489-503, October.
    5. Kim, Donggyu & Song, Xinyu & Wang, Yazhen, 2022. "Unified discrete-time factor stochastic volatility and continuous-time Itô models for combining inference based on low-frequency and high-frequency," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    6. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    7. Pelger, Markus, 2019. "Large-dimensional factor modeling based on high-frequency observations," Journal of Econometrics, Elsevier, vol. 208(1), pages 23-42.
    8. Sun, Yucheng & Xu, Wen & Zhang, Chuanhai, 2023. "Identifying latent factors based on high-frequency data," Journal of Econometrics, Elsevier, vol. 233(1), pages 251-270.
    9. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.
    10. Fan, Jianqing & Kim, Donggyu, 2019. "Structured volatility matrix estimation for non-synchronized high-frequency financial data," Journal of Econometrics, Elsevier, vol. 209(1), pages 61-78.
    11. Shephard, Neil & Xiu, Dacheng, 2017. "Econometric analysis of multivariate realised QML: Estimation of the covariation of equity prices under asynchronous trading," Journal of Econometrics, Elsevier, vol. 201(1), pages 19-42.
    12. Xinyu Song, 2019. "Large Volatility Matrix Prediction with High-Frequency Data," Papers 1907.01196, arXiv.org, revised Sep 2019.
    13. Kim, Donggyu & Fan, Jianqing, 2019. "Factor GARCH-Itô models for high-frequency data with application to large volatility matrix prediction," Journal of Econometrics, Elsevier, vol. 208(2), pages 395-417.
    14. Yacine Aït-Sahalia & Dacheng Xiu, 2019. "Principal Component Analysis of High-Frequency Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 287-303, January.
    15. Bai, Jushan & Liao, Yuan, 2012. "Efficient Estimation of Approximate Factor Models," MPRA Paper 41558, University Library of Munich, Germany.
    16. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    17. Jian, Zhihong & Deng, Pingjun & Zhu, Zhican, 2018. "High-dimensional covariance forecasting based on principal component analysis of high-frequency data," Economic Modelling, Elsevier, vol. 75(C), pages 422-431.
    18. Li, Kunpeng & Li, Qi & Lu, Lina, 2018. "Quasi maximum likelihood analysis of high dimensional constrained factor models," Journal of Econometrics, Elsevier, vol. 206(2), pages 574-612.
    19. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    20. Choi, Sung Hoon & Kim, Donggyu, 2023. "Large volatility matrix analysis using global and national factor models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1917-1933.

    More about this item

    Keywords

    High-dimensional data; High-frequency data; Factor model; Pre-averaging estimator; Portfolio allocation; Low-rank plus sparse covariance matrix; Barra covariance matrix estimator;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics
    • G01 - Financial Economics - - General - - - Financial Crises

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:208:y:2019:i:1:p:43-79. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.