IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2004.13612.html
   My bibliography  Save this paper

Denise: Deep Robust Principal Component Analysis for Positive Semidefinite Matrices

Author

Listed:
  • Calypso Herrera
  • Florian Krach
  • Anastasis Kratsios
  • Pierre Ruyssen
  • Josef Teichmann

Abstract

The robust PCA of covariance matrices plays an essential role when isolating key explanatory features. The currently available methods for performing such a low-rank plus sparse decomposition are matrix specific, meaning, those algorithms must re-run for every new matrix. Since these algorithms are computationally expensive, it is preferable to learn and store a function that nearly instantaneously performs this decomposition when evaluated. Therefore, we introduce Denise, a deep learning-based algorithm for robust PCA of covariance matrices, or more generally, of symmetric positive semidefinite matrices, which learns precisely such a function. Theoretical guarantees for Denise are provided. These include a novel universal approximation theorem adapted to our geometric deep learning problem and convergence to an optimal solution to the learning problem. Our experiments show that Denise matches state-of-the-art performance in terms of decomposition quality, while being approximately $2000\times$ faster than the state-of-the-art, principal component pursuit (PCP), and $200 \times$ faster than the current speed-optimized method, fast PCP.

Suggested Citation

  • Calypso Herrera & Florian Krach & Anastasis Kratsios & Pierre Ruyssen & Josef Teichmann, 2020. "Denise: Deep Robust Principal Component Analysis for Positive Semidefinite Matrices," Papers 2004.13612, arXiv.org, revised Jun 2023.
  • Handle: RePEc:arx:papers:2004.13612
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2004.13612
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fulvio Corsi & Stefano Peluso & Francesco Audrino, 2015. "Missing in Asynchronicity: A Kalman‐em Approach for Multivariate Realized Covariance Estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(3), pages 377-397, April.
    2. Calypso Herrera & Florian Krach & Josef Teichmann, 2020. "Local Lipschitz Bounds of Deep Neural Networks," Papers 2004.13135, arXiv.org, revised Feb 2023.
    3. Yacine Aït-Sahalia & Dacheng Xiu, 2019. "Principal Component Analysis of High-Frequency Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 287-303, January.
    4. Aït-Sahalia, Yacine & Fan, Jianqing & Xiu, Dacheng, 2010. "High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1504-1517.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Donelli, Nicola & Peluso, Stefano & Mira, Antonietta, 2021. "A Bayesian semiparametric vector Multiplicative Error Model," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    2. Carlo Campajola & Fabrizio Lillo & Daniele Tantari, 2019. "Unveiling the relation between herding and liquidity with trader lead-lag networks," Papers 1909.10807, arXiv.org, revised Mar 2020.
    3. Ruijun Bu & Degui Li & Oliver Linton & Hanchao Wang, 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Working Papers 202212, University of Liverpool, Department of Economics.
    4. Boudt, Kris & Laurent, Sébastien & Lunde, Asger & Quaedvlieg, Rogier & Sauri, Orimar, 2017. "Positive semidefinite integrated covariance estimation, factorizations and asynchronicity," Journal of Econometrics, Elsevier, vol. 196(2), pages 347-367.
    5. Liu, Cheng & Tang, Cheng Yong, 2014. "A quasi-maximum likelihood approach for integrated covariance matrix estimation with high frequency data," Journal of Econometrics, Elsevier, vol. 180(2), pages 217-232.
    6. repec:cte:wsrepe:es142416 is not listed on IDEAS
    7. Bahcivan, Hulusi & Karahan, Cenk C., 2022. "High frequency correlation dynamics and day-of-the-week effect: A score-driven approach in an emerging market stock exchange," International Review of Financial Analysis, Elsevier, vol. 80(C).
    8. Michael Ho & Jack Xin, 2016. "Sparse Kalman Filtering Approaches to Covariance Estimation from High Frequency Data in the Presence of Jumps," Papers 1602.02185, arXiv.org, revised Apr 2016.
    9. Bu, R. & Li, D. & Linton, O. & Wang, H., 2022. "Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data," Cambridge Working Papers in Economics 2218, Faculty of Economics, University of Cambridge.
    10. Shephard, Neil & Xiu, Dacheng, 2017. "Econometric analysis of multivariate realised QML: Estimation of the covariation of equity prices under asynchronous trading," Journal of Econometrics, Elsevier, vol. 201(1), pages 19-42.
    11. Harry-Paul Vander Elst & David Veredas, 2014. "Disentangled Jump-Robust Realized Covariances and Correlations with Non-Synchronous Prices," Working Papers ECARES ECARES 2014-35, ULB -- Universite Libre de Bruxelles.
    12. Ulrich Hounyo, 2014. "Bootstrapping integrated covariance matrix estimators in noisy jump-diffusion models with non-synchronous trading," CREATES Research Papers 2014-35, Department of Economics and Business Economics, Aarhus University.
    13. Giuseppe Buccheri & Giacomo Bormetti & Fulvio Corsi & Fabrizio Lillo, 2018. "A Score-Driven Conditional Correlation Model for Noisy and Asynchronous Data: an Application to High-Frequency Covariance Dynamics," Papers 1803.04894, arXiv.org, revised Mar 2019.
    14. Katerina Papagiannouli, 2022. "A Lepskiĭ-type stopping rule for the covariance estimation of multi-dimensional Lévy processes," Statistical Inference for Stochastic Processes, Springer, vol. 25(3), pages 505-535, October.
    15. Altmeyer, Randolf & Bibinger, Markus, 2015. "Functional stable limit theorems for quasi-efficient spectral covolatility estimators," Stochastic Processes and their Applications, Elsevier, vol. 125(12), pages 4556-4600.
    16. Iara da Silva & Caroline Fernanda Hei Wikuats & Elizabeth Mie Hashimoto & Leila Droprinchinski Martins, 2022. "Effects of Environmental and Socioeconomic Inequalities on Health Outcomes: A Multi-Region Time-Series Study," IJERPH, MDPI, vol. 19(24), pages 1-22, December.
    17. Jin, Xin & Maheu, John M., 2016. "Bayesian semiparametric modeling of realized covariance matrices," Journal of Econometrics, Elsevier, vol. 192(1), pages 19-39.
    18. Peter Reinhard Hansen & Guillaume Horel & Asger Lunde & Ilya Archakov, 2015. "A Markov Chain Estimator of Multivariate Volatility from High Frequency Data," CREATES Research Papers 2015-19, Department of Economics and Business Economics, Aarhus University.
    19. Calypso Herrera & Florian Krach & Pierre Ruyssen & Josef Teichmann, 2021. "Optimal Stopping via Randomized Neural Networks," Papers 2104.13669, arXiv.org, revised Dec 2023.
    20. Neil Shephard & Dacheng Xiu, 2012. "Econometric analysis of multivariate realised QML: efficient positive semi-definite estimators of the covariation of equity prices," Economics Series Working Papers 604, University of Oxford, Department of Economics.
    21. Kim Christensen & Ulrich Hounyo & Zhi Liu, 2024. "A nonparametric test for diurnal variation in spot correlation processes," Papers 2408.02757, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2004.13612. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.