IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2407.09738.html
   My bibliography  Save this paper

Sparse Asymptotic PCA: Identifying Sparse Latent Factors Across Time Horizon

Author

Listed:
  • Zhaoxing Gao

Abstract

This paper introduces a novel sparse latent factor modeling framework using sparse asymptotic Principal Component Analysis (APCA) to analyze the co-movements of high-dimensional panel data over time. Unlike existing methods based on sparse PCA, which assume sparsity in the loading matrices, our approach posits sparsity in the factor processes while allowing non-sparse loadings. This is motivated by the fact that financial returns typically exhibit universal and non-sparse exposure to market factors. Unlike the commonly used $\ell_1$-relaxation in sparse PCA, the proposed sparse APCA employs a truncated power method to estimate the leading sparse factor and a sequential deflation method for multi-factor cases under $\ell_0$-constraints. Furthermore, we develop a data-driven approach to identify the sparsity of risk factors over the time horizon using a novel cross-sectional cross-validation method. We establish the consistency of our estimators under mild conditions as both the dimension $N$ and the sample size $T$ grow. Monte Carlo simulations demonstrate that the proposed method performs well in finite samples. Empirically, we apply our method to daily S&P 500 stock returns (2004--2016) and identify nine risk factors influencing the stock market.

Suggested Citation

  • Zhaoxing Gao, 2024. "Sparse Asymptotic PCA: Identifying Sparse Latent Factors Across Time Horizon," Papers 2407.09738, arXiv.org, revised Jan 2025.
  • Handle: RePEc:arx:papers:2407.09738
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2407.09738
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John Y. Campbell & Martin Lettau & Burton G. Malkiel & Yexiao Xu, 2001. "Have Individual Stocks Become More Volatile? An Empirical Exploration of Idiosyncratic Risk," Journal of Finance, American Finance Association, vol. 56(1), pages 1-43, February.
    2. Lippi, Marco & Reichlin, Lucrezia & Hallin, Marc & Forni, Mario, 2000. "Reference Cycles: The NBER Methodology Revisited," CEPR Discussion Papers 2400, C.E.P.R. Discussion Papers.
    3. Xavier Gabaix, 2014. "A Sparsity-Based Model of Bounded Rationality," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 129(4), pages 1661-1710.
    4. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cavit Pakel & Neil Shephard & Kevin Sheppard & Robert F. Engle, 2021. "Fitting Vast Dimensional Time-Varying Covariance Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(3), pages 652-668, July.
    2. Goyal, Amit & Pérignon, Christophe & Villa, Christophe, 2008. "How common are common return factors across the NYSE and Nasdaq?," Journal of Financial Economics, Elsevier, vol. 90(3), pages 252-271, December.
    3. Zura Kakushadze & Willie Yu, 2016. "Multifactor Risk Models and Heterotic CAPM," Papers 1602.04902, arXiv.org, revised Mar 2016.
    4. Rajnish Mehra & Sunil Wahal & Daruo Xie, 2021. "Is idiosyncratic risk conditionally priced?," Quantitative Economics, Econometric Society, vol. 12(2), pages 625-646, May.
    5. George Kapetanios & Massimiliano Marcellino, 2009. "A parametric estimation method for dynamic factor models of large dimensions," Journal of Time Series Analysis, Wiley Blackwell, vol. 30(2), pages 208-238, March.
    6. Zhuo Chen & Gregory Connor & Robert A Korajczyk, 2018. "A Performance Comparison of Large-n Factor Estimators," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 8(1), pages 153-182.
    7. André Nunes Maranhão, 2024. "Brazilian Business Cycle Analysis in a High-Dimensional and Time-Irregular Span Context," Journal of Business Cycle Research, Springer;Centre for International Research on Economic Tendency Surveys (CIRET), vol. 20(1), pages 1-58, August.
    8. Geert Bekaert & Robert J. Hodrick & Xiaoyan Zhang, 2009. "International Stock Return Comovements," Journal of Finance, American Finance Association, vol. 64(6), pages 2591-2626, December.
    9. Massimiliano Marcellino & George Kapetanios, 2006. "The Role of Search Frictions and Bargaining for Inflation Dynamics," Working Papers 305, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    10. Camelia M. Kuhnen, 2015. "Asymmetric Learning from Financial Information," Journal of Finance, American Finance Association, vol. 70(5), pages 2029-2062, October.
    11. George Kapetanios & Massimiliano Marcellino, 2003. "A Comparison of Estimation Methods for Dynamic Factor Models of Large Dimensions," Working Papers 489, Queen Mary University of London, School of Economics and Finance.
    12. Zhaoxing Gao & Ruey S. Tsay, 2021. "Divide-and-Conquer: A Distributed Hierarchical Factor Approach to Modeling Large-Scale Time Series Data," Papers 2103.14626, arXiv.org.
    13. Christophe Van Nieuwenhuyze, 2006. "A generalised dynamic factor model for the Belgian economy - Useful business cycle indicators and GDP growth forecasts," Working Paper Research 80, National Bank of Belgium.
    14. Ambrogio Cesa-Bianchi & M Hashem Pesaran & Alessandro Rebucci & Stijn Van Nieuwerburgh, 2020. "Uncertainty and Economic Activity: A Multicountry Perspective," The Review of Financial Studies, Society for Financial Studies, vol. 33(8), pages 3393-3445.
    15. Zura Kakushadze, 2015. "Heterotic Risk Models," Papers 1508.04883, arXiv.org, revised Jan 2016.
    16. Gregory Connor & Lisa R. Goldberg & Robert A. Korajczyk, 2010. "Portfolio Risk Analysis," Economics Books, Princeton University Press, edition 1, number 9224.
    17. Jones, Christopher S., 2001. "Extracting factors from heteroskedastic asset returns," Journal of Financial Economics, Elsevier, vol. 62(2), pages 293-325, November.
    18. Umberto Triacca & Fulvia Focker, 2014. "Estimating overnight volatility of asset returns by using the generalized dynamic factor model approach," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 37(2), pages 235-254, October.
    19. Zura Kakushadze & Willie Yu, 2016. "Statistical Risk Models," Papers 1602.08070, arXiv.org, revised Jan 2017.
    20. Bai, Jushan, 2004. "Estimating cross-section common stochastic trends in nonstationary panel data," Journal of Econometrics, Elsevier, vol. 122(1), pages 137-183, September.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2407.09738. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.