IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2102.11341.html
   My bibliography  Save this paper

Bridging factor and sparse models

Author

Listed:
  • Jianqing Fan
  • Ricardo Masini
  • Marcelo C. Medeiros

Abstract

Factor and sparse models are two widely used methods to impose a low-dimensional structure in high-dimensions. However, they are seemingly mutually exclusive. We propose a lifting method that combines the merits of these two models in a supervised learning methodology that allows for efficiently exploring all the information in high-dimensional datasets. The method is based on a flexible model for high-dimensional panel data, called factor-augmented regression model with observable and/or latent common factors, as well as idiosyncratic components. This model not only includes both principal component regression and sparse regression as specific models but also significantly weakens the cross-sectional dependence and facilitates model selection and interpretability. The method consists of several steps and a novel test for (partial) covariance structure in high dimensions to infer the remaining cross-section dependence at each step. We develop the theory for the model and demonstrate the validity of the multiplier bootstrap for testing a high-dimensional (partial) covariance structure. The theory is supported by a simulation study and applications.

Suggested Citation

  • Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2021. "Bridging factor and sparse models," Papers 2102.11341, arXiv.org, revised Sep 2022.
  • Handle: RePEc:arx:papers:2102.11341
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2102.11341
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Barigozzi, Matteo & Hallin, Marc & Soccorsi, Stefano & von Sachs, Rainer, 2021. "Time-varying general dynamic factor models and the measurement of financial connectedness," Journal of Econometrics, Elsevier, vol. 222(1), pages 324-343.
    2. Matteo Barigozzi & Christian Brownlees, 2019. "NETS: Network estimation for time series," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(3), pages 347-364, April.
    3. Ledoit, Olivier & Wolf, Michael, 2004. "A well-conditioned estimator for large-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 88(2), pages 365-411, February.
    4. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    5. Medeiros, Marcelo C. & Mendes, Eduardo F., 2016. "ℓ1-regularization of high-dimensional time-series models with non-Gaussian and heteroskedastic errors," Journal of Econometrics, Elsevier, vol. 191(1), pages 255-271.
    6. Bai, Jushan & Liao, Yuan, 2017. "Inferences in panel data with interactive effects using large covariance matrices," Journal of Econometrics, Elsevier, vol. 200(1), pages 59-78.
    7. Hyungsik Roger Moon & Martin Weidner, 2015. "Linear Regression for Panel With Unknown Number of Factors as Interactive Fixed Effects," Econometrica, Econometric Society, vol. 83(4), pages 1543-1579, July.
    8. Laurent Gobillon & Thierry Magnac, 2016. "Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls," The Review of Economics and Statistics, MIT Press, vol. 98(3), pages 535-551, July.
    9. Barigozzi, Matteo & Hallin, Marc, 2020. "Generalized dynamic factor models and volatilities: Consistency, rates, and prediction intervals," Journal of Econometrics, Elsevier, vol. 216(1), pages 4-34.
    10. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    11. Matteo Barigozzi & Marc Hallin, 2017. "A network analysis of the volatility of high dimensional financial series," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 581-605, April.
    12. Diebold, Francis X. & Yılmaz, Kamil, 2014. "On the network topology of variance decompositions: Measuring the connectedness of financial firms," Journal of Econometrics, Elsevier, vol. 182(1), pages 119-134.
    13. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    14. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    15. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "Estimation of large dimensional conditional factor models in finance," Working Papers unige:125031, University of Geneva, Geneva School of Economics and Management.
    16. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    17. Kock, Anders Bredahl & Callot, Laurent, 2015. "Oracle inequalities for high dimensional vector autoregressions," Journal of Econometrics, Elsevier, vol. 186(2), pages 325-344.
    18. Matteo Barigozzi & Marc Hallin, 2016. "Generalized dynamic factor models and volatilities: recovering the market volatility shocks," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 33-60, February.
    19. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    20. Olivier Ledoit & Michael Wolf, 2017. "Nonlinear Shrinkage of the Covariance Matrix for Portfolio Selection: Markowitz Meets Goldilocks," The Review of Financial Studies, Society for Financial Studies, vol. 30(12), pages 4349-4388.
    21. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    22. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    23. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Aureo de Paula, 2019. "Inference on Causal and Structural Parameters using Many Moment Inequalities," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 86(5), pages 1867-1900.
    24. Barigozzi, Matteo & Hallin, Marc, 2017. "Generalized dynamic factor models and volatilities: estimation and forecasting," Journal of Econometrics, Elsevier, vol. 201(2), pages 307-321.
    25. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    26. Chen, Song Xi & Zhang, Li-Xin & Zhong, Ping-Shou, 2010. "Tests for High-Dimensional Covariance Matrices," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 810-819.
    27. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    28. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    29. Alberto Abadie & Javier Gardeazabal, 2003. "The Economic Costs of Conflict: A Case Study of the Basque Country," American Economic Review, American Economic Association, vol. 93(1), pages 113-132, March.
    30. Andreou, Elena & Ghysels, Eric, 2021. "Predicting the VIX and the volatility risk premium: The role of short-run funding spreads Volatility Factors," Journal of Econometrics, Elsevier, vol. 220(2), pages 366-398.
    31. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    32. Lam, Clifford & Fan, Jianqing, 2009. "Sparsistency and rates of convergence in large covariance matrix estimation," LSE Research Online Documents on Economics 31540, London School of Economics and Political Science, LSE Library.
    33. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2022. "Do We Exploit all Information for Counterfactual Analysis? Benefits of Factor Models and Idiosyncratic Correction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(538), pages 574-590, April.
    34. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    35. Alexei Onatski & Marcelo Moreira J. & Marc Hallin, 2011. "Asymptotic Power of Sphericity Tests for High-Dimensional Data," Working Papers ECARES ECARES 2011-018, ULB -- Universite Libre de Bruxelles.
    36. Fan, Jianqing & Ke, Yuan & Wang, Kaizheng, 2020. "Factor-adjusted regularized model selection," Journal of Econometrics, Elsevier, vol. 216(1), pages 71-85.
    37. Carvalho, Carlos & Masini, Ricardo & Medeiros, Marcelo C., 2018. "ArCo: An artificial counterfactual approach for high-dimensional panel time-series data," Journal of Econometrics, Elsevier, vol. 207(2), pages 352-380.
    38. Tobias J. Moskowitz & Mark Grinblatt, 1999. "Do Industries Explain Momentum?," Journal of Finance, American Finance Association, vol. 54(4), pages 1249-1290, August.
    39. Christian Brownlees & Guðmundur Stefán Guðmundsson & Gábor Lugosi, 2022. "Community Detection in Partial Correlation Network Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 216-226, January.
    40. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    41. Li, Weiming & Qin, Yingli, 2014. "Hypothesis testing for high-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 128(C), pages 108-119.
    42. Ben S. Bernanke & Jean Boivin & Piotr Eliasz, 2005. "Measuring the Effects of Monetary Policy: A Factor-Augmented Vector Autoregressive (FAVAR) Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(1), pages 387-422.
    43. Jianqing Fan & Quefeng Li & Yuyan Wang, 2017. "Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 247-265, January.
    44. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    45. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    46. Jushan Bai & Serena Ng, 2006. "Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions," Econometrica, Econometric Society, vol. 74(4), pages 1133-1150, July.
    47. Doukhan, Paul & Louhichi, Sana, 1999. "A new weak dependence condition and applications to moment inequalities," Stochastic Processes and their Applications, Elsevier, vol. 84(2), pages 313-342, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jonas Krampe & Luca Margaritella, 2021. "Factor Models with Sparse VAR Idiosyncratic Components," Papers 2112.07149, arXiv.org, revised May 2022.
    2. Marko Mlikota, 2022. "Cross-Sectional Dynamics Under Network Structure: Theory and Macroeconomic Applications," Papers 2211.13610, arXiv.org, revised Sep 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Barigozzi & Christian Brownlees, 2019. "NETS: Network estimation for time series," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(3), pages 347-364, April.
    2. De Nard, Gianluca & Zhao, Zhao, 2022. "A large-dimensional test for cross-sectional anomalies:Efficient sorting revisited," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 654-676.
    3. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    4. Jonas Krampe & Luca Margaritella, 2021. "Factor Models with Sparse VAR Idiosyncratic Components," Papers 2112.07149, arXiv.org, revised May 2022.
    5. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    6. Zhaoxing Gao & Ruey S. Tsay, 2023. "Supervised Dynamic PCA: Linear Dynamic Forecasting with Many Predictors," Papers 2307.07689, arXiv.org.
    7. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "Estimation of large dimensional conditional factor models in finance," Working Papers unige:125031, University of Geneva, Geneva School of Economics and Management.
    8. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.
    9. Miao, Ke & Phillips, Peter C.B. & Su, Liangjun, 2023. "High-dimensional VARs with common factors," Journal of Econometrics, Elsevier, vol. 233(1), pages 155-183.
    10. Li, Xingyu & Shen, Yan & Zhou, Qiankun, 2024. "Confidence intervals of treatment effects in panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 240(1).
    11. Tae-Hwy Lee & Ekaterina Seregina, 2024. "Optimal Portfolio Using Factor Graphical Lasso," Journal of Financial Econometrics, Oxford University Press, vol. 22(3), pages 670-695.
    12. Xiong, Ruoxuan & Pelger, Markus, 2023. "Large dimensional latent factor modeling with missing observations and applications to causal inference," Journal of Econometrics, Elsevier, vol. 233(1), pages 271-301.
    13. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2022. "Do We Exploit all Information for Counterfactual Analysis? Benefits of Factor Models and Idiosyncratic Correction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(538), pages 574-590, April.
    14. Fan, Qingliang & Wu, Ruike & Yang, Yanrong & Zhong, Wei, 2024. "Time-varying minimum variance portfolio," Journal of Econometrics, Elsevier, vol. 239(2).
    15. Fan, Jianqing & Ke, Yuan & Liao, Yuan, 2021. "Augmented factor models with applications to validating market risk factors and forecasting bond risk premia," Journal of Econometrics, Elsevier, vol. 222(1), pages 269-294.
    16. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    17. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    18. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    19. Francisco Peñaranda & Enrique Sentana, 2024. "Portfolio management with big data," Working Papers wp2024_2411, CEMFI.
    20. Barigozzi, Matteo & Hallin, Marc & Luciani, Matteo & Zaffaroni, Paolo, 2024. "Inferential theory for generalized dynamic factor models," Journal of Econometrics, Elsevier, vol. 239(2).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2102.11341. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.